Fine-Grained Parallel Solution for Solving Sparse Triangular Systems on Multicore Platform Using OpenMP Interface

Parallel Solution of Magnetotelluric Occam Inversion Algorithm Based on Hybrid MPI/OpenMP Model

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.602-605.3751 ◽

2014 ◽

Vol 602-605 ◽

pp. 3751-3754

Author(s):

Yu Liu ◽

Yi Xiao

Keyword(s):

Parallel Algorithm ◽

High Efficiency ◽

Programming Model ◽

Operation Time ◽

Coarse Grained ◽

Inversion Algorithm ◽

Parallel Solution ◽

Fine Grained ◽

Parallel Programming Model ◽

Occam Inversion

In order to improve the efficiency of magnetotelluric Occam inversion algorithm (MT Occam), a parallel algorithm is implemented on a hybrid MPI/OpenMP parallel programming model to increase its convergence speed and to decrease the operation time. MT Occam is partitioned to map the task on the parallel model. The parallel algorithm implements the coarse-grained parallelism between computation nodes and fine-grained parallelism between cores within each node. By analyzing the data dependency, the computing tasks are accurately partitioned so as to reduce transmission time. The experimental results show that with the increase of model scale, higher speedup can be obtained. The high efficiency of the parallel partitioning strategy of the model can improve the scalability of the parallel algorithm.

Download Full-text

Optimal Parallel Solution of Sparse Triangular Systems

SIAM Journal on Scientific Computing ◽

10.1137/0914027 ◽

1993 ◽

Vol 14 (2) ◽

pp. 446-460 ◽

Cited By ~ 33

Author(s):

Fernando L. Alvarado ◽

Robert Schreiber

Keyword(s):

Parallel Solution ◽

Triangular Systems

Download Full-text

A Parallel Algorithm for Solving Complex Multibody Problems With Stream Processors

Volume 4: 7th International Conference on Multibody Systems, Nonlinear Dynamics, and Control, Parts A, B and C ◽

10.1115/detc2009-86478 ◽

2009 ◽

Cited By ~ 2

Author(s):

Toby Heyn ◽

Alessandro Tasora ◽

Mihai Anitescu ◽

Dan Negrut

Keyword(s):

Collision Detection ◽

High Performance ◽

Complementarity Problems ◽

Detection Algorithm ◽

Inclusion Problem ◽

Parallel Solution ◽

Fine Grained ◽

Device Architecture ◽

Order Of Magnitude ◽

Cost Efficient

This paper describes a numerical method for the parallel solution of the differential measure inclusion problem posed by mechanical multibody systems containing bilateral and unilateral frictional constraints. The method proposed has been implemented as a set of parallel algorithms leveraging NVIDIA’s Compute Unified Device Architecture (CUDA) library support for multi-core stream computing. This allows the proposed solution to run on a wide variety of GeForce and TESLA NVIDIA graphics cards for high performance computing. Although the methodology relies on the solution of cone complementarity problems known to be fine-grained in terms of data dependency, a suitable approach has been developed to exploit parallelism with low overhead in terms of memory access and thread synchronization. Additionally, a parallel collision detection algorithm has been incorporated to further exploit available parallelism. Initial numerical tests described in this paper demonstrate a speedup of one order of magnitude for the solution time of both the collision detection and the cone complementarity problems when performed in parallel. Since stream multiprocessors are becoming ubiquitous as embedded components of next-generation graphic boards, the solution proposed represents a cost-efficient way to simulate the time evolution of complex mechanical problems with millions of parts and constraints, a task that used to require powerful supercomputers. The proposed methodology facilitates the analysis of extremely complex systems such as granular material flows and off-road vehicle dynamics.

Download Full-text

Parallel Solution Alternatives for Sparse Triangular Systems in Interior Point Methods

High Performance Computing Systems and Applications ◽

10.1007/978-1-4615-5611-4_36 ◽

1998 ◽

pp. 391-404 ◽

Cited By ~ 1

Author(s):

Huseyin Simitci

Keyword(s):

Interior Point ◽

Interior Point Methods ◽

Parallel Solution ◽

Triangular Systems

Download Full-text

The Parallel Solution of Triangular Systems of Equations

IEEE Transactions on Computers ◽

10.1109/tc.1983.1676206 ◽

1983 ◽

Vol C-32 (2) ◽

pp. 201-204 ◽

Cited By ~ 9

Author(s):

Evans ◽

Dunbar

Keyword(s):

Systems Of Equations ◽

Parallel Solution ◽

Triangular Systems

Download Full-text

Parallel solution of triangular systems of equations

Parallel Computing ◽

10.1016/0167-8191(88)90009-9 ◽

1988 ◽

Vol 6 (1) ◽

pp. 109-114 ◽

Cited By ~ 32

Author(s):

Charles H Romine ◽

James M Ortega

Keyword(s):

Systems Of Equations ◽

Parallel Solution ◽

Triangular Systems

Download Full-text

Parallel solution strategies for triangular systems arising from oil reservoir simulations

High-Performance Computing and Networking - Lecture Notes in Computer Science ◽

10.1007/bfb0046623 ◽

1995 ◽

pp. 148-155 ◽

Cited By ~ 2

Author(s):

A. Sunderland

Keyword(s):

Oil Reservoir ◽

Parallel Solution ◽

Solution Strategies ◽

Reservoir Simulations ◽

Triangular Systems

Download Full-text

Stability of the Partitioned Inverse Method for Parallel Solution of Sparse Triangular Systems

SIAM Journal on Scientific Computing ◽

10.1137/0915009 ◽

1994 ◽

Vol 15 (1) ◽

pp. 139-148 ◽

Cited By ~ 10

Author(s):

Nicholas J. Higham ◽

Alex Pothen

Keyword(s):

Inverse Method ◽

Parallel Solution ◽

Triangular Systems

Download Full-text

A New GPU Algorithm to Compute a Level Set-Based Analysis for the Parallel Solution of Sparse Triangular Systems

2018 IEEE International Parallel and Distributed Processing Symposium (IPDPS) ◽

10.1109/ipdps.2018.00101 ◽

2018 ◽

Cited By ~ 3

Author(s):

Ernesto Dufrechou ◽

Pablo Ezzatti

Keyword(s):

Level Set ◽

Parallel Solution ◽

Triangular Systems

Download Full-text

Parallel Solution of Triangular Systems on Distributed-Memory Multiprocessors

SIAM Journal on Scientific and Statistical Computing ◽

10.1137/0909037 ◽

1988 ◽

Vol 9 (3) ◽

pp. 558-588 ◽

Cited By ~ 110

Author(s):

Michael T. Heath ◽

Charles H. Romine

Keyword(s):

Distributed Memory ◽

Parallel Solution ◽

Triangular Systems

Download Full-text