Reducing synchronization overhead for compiler-parallelized codes on software DSMs (extended abstract)

Using multi-threads to hide deduplication I/O latency with low synchronization overhead

Journal of Central South University ◽

10.1007/s11771-013-1650-4 ◽

2013 ◽

Vol 20 (6) ◽

pp. 1582-1591 ◽

Cited By ~ 2

Author(s):

Rui Zhu ◽

Lei-hua Qin ◽

Jing-li Zhou ◽

Huan Zheng

Keyword(s):

Synchronization Overhead

Download Full-text

Reducing Synchronization Overhead Through Bundled Communication

Lecture Notes in Computer Science - OpenSHMEM and Related Technologies. Experiences, Implementations, and Tools ◽

10.1007/978-3-319-05215-1_12 ◽

2014 ◽

pp. 163-177 ◽

Cited By ~ 8

Author(s):

James Dinan ◽

Clement Cole ◽

Gabriele Jost ◽

Stan Smith ◽

Keith Underwood ◽

...

Keyword(s):

Synchronization Overhead

Download Full-text

Parallel garbage collection without synchronization overhead

ACM SIGARCH Computer Architecture News ◽

10.1145/327070.327134 ◽

1985 ◽

Vol 13 (3) ◽

pp. 84-90 ◽

Cited By ~ 1

Author(s):

Ashwin Ram ◽

Janak H. Patel

Keyword(s):

Garbage Collection ◽

Synchronization Overhead

Download Full-text

Optimization strategies for inter-thread synchronization overhead on NUMA machine

2015 IEEE 34th International Performance Computing and Communications Conference (IPCCC) ◽

10.1109/pccc.2015.7410330 ◽

2015 ◽

Author(s):

Song Wu ◽

Jun Zhang ◽

Yaqiong Peng ◽

Hai Jin ◽

Wenbin Jiang

Keyword(s):

Synchronization Overhead ◽

Thread Synchronization

Download Full-text

Compiler Techniques to Reduce the Synchronization Overhead of GPU Redundant Multithreading

Proceedings of the 54th Annual Design Automation Conference 2017 on - DAC '17 ◽

10.1145/3061639.3062212 ◽

2017 ◽

Cited By ~ 6

Author(s):

Manish Gupta ◽

Daniel Lowell ◽

John Kalamatianos ◽

Steven Raasch ◽

Vilas Sridharan ◽

...

Keyword(s):

Synchronization Overhead ◽

Redundant Multithreading ◽

Compiler Techniques

Download Full-text

Scalable Checkpointing-Based Rollback Recovery Protocol for Geographically Distributed Systems

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.263-266.1492 ◽

2012 ◽

Vol 263-266 ◽

pp. 1492-1496

Author(s):

Jin Ho Ahn

Keyword(s):

Large Scale ◽

Rollback Recovery ◽

Core Networks ◽

Peer Network ◽

Underlying Network ◽

Geographically Distributed ◽

Synchronization Overhead ◽

High Scalability ◽

Coordinated Checkpointing ◽

Peer To Peer Network

Two opposite approaches were proposed to address some scalability problem resulting from coordinated checkpointing's synchronization during failure-free operation: minimizing the number of checkpointing participants and having the checkpointing process non-blocking. However, these previous approaches, oblivious to the underlying network, may not fundamentally provide any breakthrough for ensuring high scalability required in very large-scale P2P-based systems. This paper proposes a non-blocking coordinated checkpointing protocol to significantly reduce checkpointing synchronization overhead by structuring the peer-to-peer network into a set of groups according to a particular criterion. In this protocol, among processes in a group, one is designated as representative with the following special roles, intra-group and inter-group checkpointing coordination. Intra-group checkpointing coordination addresses the checkpointing procedure among processes within a group. On the other hand, inter-group checkpointing coordination is performed only among representatives. Thanks to this beneficial feature, the proposed protocol may considerably reduce the number of checkpointing control messages routed on core networks compared with the existing ones.

Download Full-text

Cell Processing for Two Scientific Computing Kernels

Handbook of Research on Scalable Computing Technologies ◽

10.4018/978-1-60566-661-7.ch014 ◽

2010 ◽

pp. 312-336

Author(s):

Meilian Xu ◽

Parimala Thulasiraman ◽

Ruppa K. Thulasiram

Keyword(s):

High Speed ◽

Scientific Computing ◽

Building Blocks ◽

Data Locality ◽

Data Mapping ◽

Single Chip ◽

Data Intensive ◽

Synchronization Overhead ◽

Simd Processing ◽

On Chip

This chapter uses two scientific computing kernels to illustrate challenges of designing parallel algorithms for one heterogeneous multi-core processor, the Cell Broadband Engine processor (Cell/B.E.). It describes the limitation of the current parallel systems using single-core processors as building blocks. The limitation deteriorates the performance of applications which have data-intensive and computationintensive kernels such as Finite Difference Time Domain (FDTD) and Fast Fourier Transform (FFT). FDTD is a regular problem with nearest neighbour comminuncation pattern under synchronization constraint. FFT based on indirect swap network (ISN) modifies the data mapping in traditional Cooley- Tukey butterfly network to improve data locality, hence reducing the communication and synchronization overhead. The authors hope to unleash the Cell/B.E. and design parallel FDTD and parallel FFT based on ISN by taking into account unique features of Cell/B.E. such as its eight SIMD processing units on the single chip and its high-speed on-chip bus.

Download Full-text

Reducing synchronization overhead in parallel simulation

ACM SIGSIM Simulation Digest ◽

10.1145/238793.238822 ◽

1996 ◽

Vol 26 (1) ◽

pp. 86-95 ◽

Cited By ~ 1

Author(s):

Ulana Legedza ◽

William E. Weihl

Keyword(s):

Parallel Simulation ◽

Synchronization Overhead

Download Full-text

Reducing Synchronization Overhead with Computation Replication in Parallel Agent-Based Road Traffic Simulation

IEEE Transactions on Parallel and Distributed Systems ◽

10.1109/tpds.2017.2714165 ◽

2017 ◽

Vol 28 (11) ◽

pp. 3286-3297 ◽

Cited By ~ 1

Author(s):

Yadong Xu ◽

Vaisagh Viswanathan ◽

Wentong Cai

Keyword(s):

Traffic Simulation ◽

Road Traffic ◽

Agent Based ◽

Synchronization Overhead

Download Full-text

Synchronization overhead in SOC compressed test

IEEE Transactions on Very Large Scale Integration (VLSI) Systems ◽

10.1109/tvlsi.2004.834238 ◽

2005 ◽

Vol 13 (1) ◽

pp. 140-152 ◽

Cited By ~ 24

Author(s):

P.T. Gonciari ◽

B. Al-Hashimi ◽

N. Nicolici

Keyword(s):

Synchronization Overhead

Download Full-text