Dynamic remote memory acquiring for parallel data mining on PC cluster: Preliminary performance results

Author(s):  
Masato Oguchi ◽  
Masaru Kitsuregawa
2014 ◽  
pp. 108-112
Author(s):  
Masato Oguchi ◽  
Masaru Kitsuregawa

In this paper, a PC cluster connected with Storage Area Network (SAN) is built and evaluated. In the case of SAN­connected cluster, each node can access all shared disks directly without LAN; thus, SAN­connected clusters achieve better performance than LAN­connected clusters for disk access operations. However, if a lot of nodes access the same­shared disk simultaneously, application performance degrades due to I/O­bottleneck. A runtime data declustering method, in which data is declustered to several other disks dynamically during the execution of application, is proposed to resolve this problem. Parallel data mining is implemented and evaluated on the SAN­connected PC cluster. This application requires iterative scans of a shared disk, which degrade execution performance severely due to I/O­bottleneck. The runtime data declustering method is applied to this case. According to the results of experiments, the proposed method prevents performance degradation caused by shared disk bottleneck in SAN­connected clusters.


Author(s):  
Masaru Kitsuregawa ◽  
Takahiko Shintani ◽  
Masahisa Tamura ◽  
Iko Pramudiono

Sign in / Sign up

Export Citation Format

Share Document