general purpose gpu
Recently Published Documents


TOTAL DOCUMENTS

32
(FIVE YEARS 1)

H-INDEX

8
(FIVE YEARS 0)

10.29007/55pq ◽  
2019 ◽  
Author(s):  
Lifeng Liu ◽  
Meilin Liu ◽  
Chongjun Wang

General purpose GPU (GPGPU) is an effective many-core architecture that can yield high throughput for many scientific applications with thread-level parallelism. However, several challenges still limit further performance improvements and make GPU program- ming challenging for programmers who lack the knowledge of GPU hardware architecture. In this paper, we design a compiler-assisted locality aware CTA (cooperative thread array) mapping scheme for GPUs to take advantage of the inter CTA data reuses in the GPU kernels. Using the data reuse analysis based on the polyhedron model, we can detect inter CTA data reuse patterns in the GPU kernels and control the CTA mapping pattern to improve the data locality on each SM. The compiler-assisted locality aware CTA mapping scheme can also be combined with the programmable warp scheduler to further improve the performance. The experimental results show that our CTA mapping algorithm can improve the overall performance of the input GPU programs by 23.3% on average and by 56.7% when combined with the programmable warp scheduler.


PLoS ONE ◽  
2017 ◽  
Vol 12 (3) ◽  
pp. e0173317 ◽  
Author(s):  
Kwangyong Lim ◽  
Yongwon Hong ◽  
Yeongwoo Choi ◽  
Hyeran Byun

2016 ◽  
Vol 7 (1) ◽  
pp. 17-30 ◽  
Author(s):  
Rashmi Sharan Sinha ◽  
Satvir Singh ◽  
Sarabjeet Singh ◽  
Vijay Kumar Banga

2016 ◽  
Vol 51 (3) ◽  
pp. 75-84 ◽  
Author(s):  
Alexey Kolesnichenko ◽  
Christopher M. Poskitt ◽  
Sebastian Nanz ◽  
Bertrand Meyer

Author(s):  
Rashmi Sharan Sinha ◽  
◽  
Satvir Singh ◽  
Sarabjeet Singh ◽  
Vijay Kumar Banga ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document