sequential code Latest Research Papers

ECROs: building global scale systems from sequential code

Proceedings of the ACM on Programming Languages ◽

10.1145/3485484 ◽

2021 ◽

Vol 5 (OOPSLA) ◽

pp. 1-30

Author(s):

Kevin De Porre ◽

Carla Ferreira ◽

Nuno Preguiça ◽

Elisa Gonzalez Boix

Keyword(s):

Ad Hoc ◽

Global Scale ◽

Distributed Applications ◽

High Availability ◽

Sequential Data ◽

Data Types ◽

Replicated Data ◽

Sequential Code ◽

Programming Interface ◽

Existing Data

To ease the development of geo-distributed applications, replicated data types (RDTs) offer a familiar programming interface while ensuring state convergence, low latency, and high availability. However, RDTs are still designed exclusively by experts using ad-hoc solutions that are error-prone and result in brittle systems. Recent works statically detect conflicting operations on existing data types and coordinate those at runtime to guarantee convergence and preserve application invariants. However, these approaches are too conservative, imposing coordination on a large number of operations. In this work, we propose a principled approach to design and implement efficient RDTs taking into account application invariants. Developers extend sequential data types with a distributed specification, which together form an RDT. We statically analyze the specification to detect conflicts and unravel their cause. This information is then used at runtime to serialize concurrent operations safely and efficiently. Our approach derives a correct RDT from any sequential data type without changes to the data type's implementation and with minimal coordination. We implement our approach in Scala and develop an extensive portfolio of RDTs. The evaluation shows that our approach provides performance similar to conflict-free replicated data types for commutative operations, and considerably improves the performance of non-commutative operations, compared to existing solutions.

Download Full-text

Analysis of GPU Computation of Parabolic, Bessel, Wright and Riemann Zeta Functions

ITM Web of Conferences ◽

10.1051/itmconf/20214002005 ◽

2021 ◽

Vol 40 ◽

pp. 02005

Author(s):

Ashish A. Jadhav ◽

Abhijeet D. Kalamkar ◽

Pritish A. Gaikwad ◽

Vishwesh Vyawahare ◽

Navin Singhaniya

Keyword(s):

Fractional Calculus ◽

Gpu Computing ◽

Graphics Processing Unit ◽

Zeta Functions ◽

Computation Time ◽

Processing Unit ◽

Mathematical Functions ◽

Speed Up ◽

Sequential Code ◽

Graphics Processing

This paper deals with GPU computing of special mathematical functions that are used in Fractional Calculus. The graphics processing unit (GPU) has grown to be an integral part of nowadays’s mainstream computing structures. The special mathematical functions are an integral part of Fractional Calculus. This paper deals with a novel parallel approach for computing special mathematical functions used in Fractional Calculus. NVIDIA’s GPU hardware is used to speed up the parallel algorithm. A comparison of the sequential code, vectorized code and GPU code is performed. We have successfully reduced the computation time of special mathematical functions using the parallel computing capabilities of GPU.

Download Full-text

SCOPING REVIEW ABOUT GPU BASED AUTOMATIC PARALLELIZATION FRAMEWORKS FOR SEQUENTIAL CODE

Revista SODEBRAS ◽

10.29367/issn.1809-3957.15.2020.177.05 ◽

2020 ◽

Vol 15 (177) ◽

pp. 05-10

Author(s):

L. M. BORGES ◽

D. M. TAVARES ◽

S. J. BACHEGA

Keyword(s):

Scoping Review ◽

Automatic Parallelization ◽

Sequential Code

Download Full-text

Programming Heterogeneous Parallel Machines Using Refactoring and Monte–Carlo Tree Search

International Journal of Parallel Programming ◽

10.1007/s10766-020-00665-z ◽

2020 ◽

Vol 48 (4) ◽

pp. 583-602

Author(s):

Christopher Brown ◽

Vladimir Janjic ◽

M. Goli ◽

J. McCall

Keyword(s):

Monte Carlo ◽

Shared Memory ◽

Parallel Machines ◽

Memory Systems ◽

Tool Support ◽

Tree Search ◽

Algorithmic Skeletons ◽

Monte Carlo Tree Search ◽

Sequential Code ◽

A New Technique

Abstract This paper presents a new technique for introducing and tuning parallelism for heterogeneous shared-memory systems (comprising a mixture of CPUs and GPUs), using a combination of algorithmic skeletons (such as farms and pipelines), Monte–Carlo tree search for deriving mappings of tasks to available hardware resources, and refactoring tool support for applying the patterns and mappings in an easy and effective way. Using our approach, we demonstrate easily obtainable, significant and scalable speedups on a number of case studies showing speedups of up to 41 over the sequential code on a 24-core machine with one GPU. We also demonstrate that the speedups obtained by mappings derived by the MCTS algorithm are within 5–15% of the best-obtained manual parallelisation.

Download Full-text

T4: Compiling Sequential Code for Effective Speculative Parallelization in Hardware

2020 ACM/IEEE 47th Annual International Symposium on Computer Architecture (ISCA) ◽

10.1109/isca45697.2020.00024 ◽

2020 ◽

Author(s):

Victor A. Ying ◽

Mark C. Jeffrey ◽

Daniel Sanchez

Keyword(s):

Speculative Parallelization ◽

Sequential Code

Download Full-text

Building FPGA State Machines from Sequential Code

Proceedings of the 2019 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays ◽

10.1145/3289602.3293965 ◽

2019 ◽

Author(s):

Carl-Johannes Johnsen ◽

Kenneth Skovhede

Keyword(s):

State Machines ◽

Sequential Code

Download Full-text

Evaluation of XcalableACC with tightly coupled accelerators/InfiniBand hybrid communication on accelerated cluster

The International Journal of High Performance Computing Applications ◽

10.1177/1094342018821163 ◽

2019 ◽

Vol 33 (5) ◽

pp. 869-884 ◽

Cited By ~ 1

Author(s):

Masahiro Nakao ◽

Tetsuya Odajima ◽

Hitoshi Murai ◽

Akihiro Tabuchi ◽

Norihisa Fujita ◽

...

Keyword(s):

Programming Language ◽

High Performance ◽

Lattice Quantum Chromodynamics ◽

Cluster Systems ◽

Communication Latency ◽

Lattice Quantum ◽

Tightly Coupled ◽

High Bandwidth ◽

Sequential Code ◽

Better Than

Accelerated clusters, which are cluster systems equipped with accelerators, are one of the most common systems in parallel computing. In order to exploit the performance of such systems, it is important to reduce communication latency between accelerator memories. In addition, there is also a need for a programming language that facilitates the maintenance of high performance by such systems. The goal of the present article is to evaluate XcalableACC (XACC), a parallel programming language, with tightly coupled accelerators/InfiniBand (TCAs/IB) hybrid communication on an accelerated cluster. TCA/IB hybrid communication is a combination of low-latency communication with TCA and high bandwidth with IB. The XACC language, which is a directive-based language for accelerated clusters, enables programmers to use TCA/IB hybrid communication with ease. In order to evaluate the performance of XACC with TCA/IB hybrid communication, we implemented the lattice quantum chromodynamics (LQCD) mini-application and evaluated the application on our accelerated cluster using up to 64 compute nodes. We also implemented the LQCD mini-application using a combination of CUDA and MPI (CUDA + MPI) and that of OpenACC and MPI (OpenACC + MPI) for comparison with XACC. Performance evaluation revealed that the performance of XACC with TCA/IB hybrid communication is 9% better than that of CUDA + MPI and 18% better than that of OpenACC + MPI. Furthermore, the performance of XACC was found to further increase by 7% by new expansion to XACC. Productivity evaluation revealed that XACC requires much less change from the serial LQCD code to implement the parallel LQCD code than CUDA + MPI and OpenACC + MPI. Moreover, since XACC can perform parallelization while maintaining the sequential code image, XACC is highly readable and shows excellent portability due to its directive-based approach.

Download Full-text

An Architecture for Translating Sequential Code to Parallel

Proceedings of the 2nd International Conference on Information System and Data Mining - ICISDM '18 ◽

10.1145/3206098.3206104 ◽

2018 ◽

Author(s):

Khalid Alsubhi ◽

Fawaz Alsolami ◽

Abdullah Algarni ◽

Kamal Jambi ◽

Fathy Eassa ◽

...

Keyword(s):

Sequential Code

Download Full-text

An Automatic Parallelizing Model for Sequential Code Using Python

International Journal of Advanced Research in Computer Science and Software Engineering ◽

10.23956/ijarcsse/v7i3/01324 ◽

2017 ◽

Vol 7 (3) ◽

pp. 276-282

Author(s):

Afif J. Almghawish ◽

◽

Ayman M. Abdalla ◽

Ahmad B. Marzouq ◽

◽

...

Keyword(s):

Sequential Code

Download Full-text

Optimization of sequential code for simulation of solar radiative transfer in a vertically heterogeneous environment

Atmospheric and Oceanic Optics ◽

10.1134/s1024856017020117 ◽

2017 ◽

Vol 30 (2) ◽

pp. 169-175 ◽

Cited By ~ 3

Author(s):

T. V. Russkova ◽

T. B. Zhuravleva

Keyword(s):

Radiative Transfer ◽

Heterogeneous Environment ◽

Sequential Code

Download Full-text

sequential code
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

ECROs: building global scale systems from sequential code

Analysis of GPU Computation of Parabolic, Bessel, Wright and Riemann Zeta Functions

SCOPING REVIEW ABOUT GPU BASED AUTOMATIC PARALLELIZATION FRAMEWORKS FOR SEQUENTIAL CODE

Programming Heterogeneous Parallel Machines Using Refactoring and Monte–Carlo Tree Search

T4: Compiling Sequential Code for Effective Speculative Parallelization in Hardware

Building FPGA State Machines from Sequential Code

Evaluation of XcalableACC with tightly coupled accelerators/InfiniBand hybrid communication on accelerated cluster

An Architecture for Translating Sequential Code to Parallel

An Automatic Parallelizing Model for Sequential Code Using Python

Optimization of sequential code for simulation of solar radiative transfer in a vertically heterogeneous environment

Export Citation Format

sequential codeRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

ECROs: building global scale systems from sequential code

Analysis of GPU Computation of Parabolic, Bessel, Wright and Riemann Zeta Functions

SCOPING REVIEW ABOUT GPU BASED AUTOMATIC PARALLELIZATION FRAMEWORKS FOR SEQUENTIAL CODE

Programming Heterogeneous Parallel Machines Using Refactoring and Monte–Carlo Tree Search

T4: Compiling Sequential Code for Effective Speculative Parallelization in Hardware

Building FPGA State Machines from Sequential Code

Evaluation of XcalableACC with tightly coupled accelerators/InfiniBand hybrid communication on accelerated cluster

An Architecture for Translating Sequential Code to Parallel

An Automatic Parallelizing Model for Sequential Code Using Python

Optimization of sequential code for simulation of solar radiative transfer in a vertically heterogeneous environment

sequential code
Recently Published Documents