gmres algorithm
Recently Published Documents


TOTAL DOCUMENTS

49
(FIVE YEARS 3)

H-INDEX

11
(FIVE YEARS 0)

2021 ◽  
Vol 2021 ◽  
pp. 1-17
Author(s):  
Wenpeng Ma ◽  
Yiwen Hu ◽  
Wu Yuan ◽  
Xiazhen Liu

Solving triangular systems is the building block for preconditioned GMRES algorithm. Inexact preconditioning becomes attractive because of the feature of high parallelism on accelerators. In this paper, we propose and implement an iterative, inexact block triangular solve on multi-GPUs based on PETSc’s framework. In addition, by developing a distributed block sparse matrix-vector multiplication procedure and investigating the optimized vector operations, we form the multi-GPU-enabled preconditioned GMRES with the block Jacobi preconditioner. In the implementation, the GPU-Direct technique is employed to avoid host-device memory copies. The preconditioning step used by PETSc’s structure and the cuSPARSE library are also investigated for performance comparisons. The experiments show that the developed GMRES with inexact preconditioning on 8 GPUs can achieve up to 4.4x speedup over the CPU-only implementation with exact preconditioning using 8 MPI processes.


Author(s):  
Tobias Luiz Marchioro Toassi ◽  
Francisco Augusto Aparecido Gomes ◽  
Igor Sousa

2018 ◽  
Vol 120 ◽  
pp. 869-879 ◽  
Author(s):  
Wen Yang ◽  
Hongchun Wu ◽  
Yunzhao Li ◽  
Liangzhi Cao ◽  
Sicheng Wang

2018 ◽  
Vol 25 (8) ◽  
pp. 1171-1175
Author(s):  
S. M. Zafaruddin ◽  
Surendra Prasad
Keyword(s):  

2018 ◽  
pp. 1-30
Author(s):  
Roman Sergeevich Solomatin ◽  
Ilya Vitalievich Semenov ◽  
Igor Stanislavovich Men'shov

Sign in / Sign up

Export Citation Format

Share Document