Paper Title:
Performance Evaluation and Optimization on GPU
  Abstract

GPU provides higher peak performance with hundreds of cores than CPU counterpart. However, it is a big challenge to take full advantage of their computing power. In order to understand performance bottlenecks of applications on many-core GPU and then optimize parallel programs on GPU architectures, we propose a performance evaluating model based on memory wall and then classify applications into AbM (Application bound-in Memory) and AbC (Application bound-in Computing). Furthermore, we optimize kernels characterized with low memory bandwidth including matrix multiplication and FFT (Fast Fourier Transform) by employing texture cache on NVIDIA GTX280 using CUDA (Compute Unified Device Architecture). Experimental results show that texture cache is helpful for AbM with better data locality, so it is critical to utilize GPU memory hierarchy efficiently for performance improvement.

  Info
Periodical
Advanced Materials Research (Volumes 219-220)
Edited by
Helen Zhang, Gang Shen and David Jin
Pages
1445-1449
DOI
10.4028/www.scientific.net/AMR.219-220.1445
Citation
X. B. Gan, L. Shen, Q. Y. Tan, C. Liu, Z. Y. Wang, "Performance Evaluation and Optimization on GPU", Advanced Materials Research, Vols. 219-220, pp. 1445-1449, 2011
Online since
March 2011
Export
Price
$32.00
Share

In order to see related information, you need to Login.

In order to see related information, you need to Login.

Authors: Xiao Yan Xiong, Miao Zhang, Xiao Ping Li, Shao Juan Yu
Abstract:Based on chaotic characteristics in vertical direction of vibrating screen sides, nonlinear methods were proposed to diagnose crack of...
1258
Authors: Ying Lin Li, Li Hui Cao, Lian He Yang
Abstract:Weft knitted pattern design is one of the most important compositions of textile CAD. Traditional pattern design has a higher request on...
576
Authors: Yong Hua Zhang, Jian Hui He, Guo Qing Zhang
Abstract:This paper aims to understand influence of the obliquity of fin ray on its motion performance. An environment-friendly propulsion system...
267
Authors: Ioana Pintilie, Francesco Moscatelli, Roberta Nipoti, Antonella Poggi, Sandro Solmi, Lars S. Løvlie, Bengt G. Svensson
Abstract:The effect of nitrogen (N) introduced by ion implantation at the SiO2/4H-SiC interface on the capacitance of the MOS capacitors is...
326
Authors: Yi Mei, Fang Ping Wang, Qiao Ying Liu, Yu Tao Mao
Abstract:To solve the thermal deformation caused by thermal load of heavy machinery gearbox, it is established that coupled analysis model to carry...
651