PAPMAS: A Novel Prototype System for Parallel Application Performance Monitor and Analysis

Author(s):  
Ding Yi ◽  
Hu Kai ◽  
Gao Tao ◽  
Zhang Xinyu ◽  
Jiang Shu
2013 ◽  
Vol 562-565 ◽  
pp. 709-715
Author(s):  
Xiao Hui Zeng ◽  
Jing Zhong Li ◽  
Deng Li Bo ◽  
Chen Zhang ◽  
Wen Lang Luo

Available task scheduling systems can not support MPI parallel computing applications to be suspended for quickly inserting the emergency parallel computing tasks. By modifying TCP/IP protocol, this paper proposes a new method to solve the processes’ communication synchronization for suspending parallel application; moreover, by modifying the signal mechanism of the Linux operating system, this paper also proposes a method to solve the problems of consistently suspending and recovering parallel application. A Parallel computing dynamic task scheduling prototype system is implemented, and the experiment results show that the prototype system can suspend running parallel computing application, and also support dynamic insertion of emergency MPI parallel computing application.


2007 ◽  
Vol 19 (17) ◽  
pp. 2219-2235 ◽  
Author(s):  
Karan Singh ◽  
Engin İpek ◽  
Sally A. McKee ◽  
Bronis R. de Supinski ◽  
Martin Schulz ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document