Spill-free parallel scheduling of basic blocks

A Parallel Scheduling Algorithm for Solving Transport Equations

Chinese Journal of Computers ◽

10.3724/sp.j.1016.2010.00833 ◽

2010 ◽

Vol 33 (5) ◽

pp. 833-840

Author(s):

Di-Yu ZHOU ◽

Jie LIU

Keyword(s):

Scheduling Algorithm ◽

Transport Equations ◽

Parallel Scheduling

Download Full-text

Correct program parallelisations

International Journal on Software Tools for Technology Transfer ◽

10.1007/s10009-020-00601-z ◽

2021 ◽

Author(s):

S. Blom ◽

S. Darabi ◽

M. Huisman ◽

M. Safari

Keyword(s):

Composition Operators ◽

Parallel Programs ◽

Parallel Program ◽

Tool Support ◽

Intermediate Representation ◽

Data Race ◽

Sequential Program ◽

Functional Correctness ◽

Correct Program ◽

Basic Blocks

AbstractA commonly used approach to develop deterministic parallel programs is to augment a sequential program with compiler directives that indicate which program blocks may potentially be executed in parallel. This paper develops a verification technique to reason about such compiler directives, in particular to show that they do not change the behaviour of the program. Moreover, the verification technique is tool-supported and can be combined with proving functional correctness of the program. To develop our verification technique, we propose a simple intermediate representation (syntax and semantics) that captures the main forms of deterministic parallel programs. This language distinguishes three kinds of basic blocks: parallel, vectorised and sequential blocks, which can be composed using three different composition operators: sequential, parallel and fusion composition. We show how a widely used subset of OpenMP can be encoded into this intermediate representation. Our verification technique builds on the notion of iteration contract to specify the behaviour of basic blocks; we show that if iteration contracts are manually specified for single blocks, then that is sufficient to automatically reason about data race freedom of the composed program. Moreover, we also show that it is sufficient to establish functional correctness on a linearised version of the original program to conclude functional correctness of the parallel program. Finally, we exemplify our approach on an example OpenMP program, and we discuss how tool support is provided.

Download Full-text

A Survey of Low-Energy Parallel Scheduling Algorithms

IEEE Transactions on Sustainable Computing ◽

10.1109/tsusc.2021.3057983 ◽

2021 ◽

pp. 1-1

Author(s):

Guoqi Xie ◽

Xiongren Xiao ◽

Hao Peng ◽

Renfa Li ◽

Keqin Li

Keyword(s):

Scheduling Algorithms ◽

Low Energy ◽

Parallel Scheduling

Download Full-text

Parallel Scheduling Algorithms Investigation of Support Strict Resource Reservation from Grid

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.519-520.108 ◽

2014 ◽

Vol 519-520 ◽

pp. 108-113 ◽

Cited By ~ 2

Author(s):

Jun Chen ◽

Bo Li ◽

Er Fei Wang

Keyword(s):

Parallel Computing ◽

Resource Utilization ◽

Scheduling Algorithm ◽

Scheduling Algorithms ◽

Resource Reservation ◽

Parallel Scheduling ◽

Scheduling Model ◽

Computing Grid ◽

Simulation Results ◽

Reservation Request

This paper studies resource reservation mechanisms in the strict parallel computing grid,and proposed to support the parallel strict resource reservation request scheduling model and algorithms, FCFS and EASY backfill analysis of two important parallel scheduling algorithm, given four parallel scheduling algorithms supporting resource reservation. Simulation results of four algorithms of resource utilization, job bounded slowdown factor and the success rate of Advanced Reservation (AR) jobs were studied. The results show that the EASY backfill + firstfit algorithm can ensure QoS of AR jobs while taking into account the performance of good non-AR jobs.

Download Full-text

Parallel Scheduling Algorithms

Operations Research ◽

10.1287/opre.31.1.24 ◽

1983 ◽

Vol 31 (1) ◽

pp. 24-49 ◽

Cited By ~ 44

Author(s):

Eliezer Dekel ◽

Sartaj Sahni

Keyword(s):

Scheduling Algorithms ◽

Parallel Scheduling

Download Full-text

Verifying the correctness of compiler transformations on basic blocks using abstract interpretation

ACM SIGPLAN Notices ◽

10.1145/115866.115877 ◽

1991 ◽

Vol 26 (9) ◽

pp. 106-115 ◽

Cited By ~ 5

Author(s):

Timothy S. McNerney

Keyword(s):

Abstract Interpretation ◽

Basic Blocks

Download Full-text

Parallel Scheduling of Multiple SDF Graphs Onto Heterogeneous Processors

IEEE Access ◽

10.1109/access.2021.3054725 ◽

2021 ◽

Vol 9 ◽

pp. 20493-20507

Author(s):

Dowhan Jeong ◽

Jangryul Kim ◽

Mari-Liis Oldja ◽

Soonhoi Ha

Keyword(s):

Parallel Scheduling ◽

Heterogeneous Processors

Download Full-text

Parallel Scheduling of Large-Scale Tasks for Industrial Cloud-Edge Collaboration

IEEE Internet of Things Journal ◽

10.1109/jiot.2021.3139689 ◽

2021 ◽

pp. 1-1

Author(s):

Yuanjun Laili ◽

Fuqiang Guo ◽

Lei Ren ◽

Xiang Li ◽

Yulin Li ◽

...

Keyword(s):

Large Scale ◽

Parallel Scheduling ◽

Industrial Cloud

Download Full-text

Dependency Graph-based High-level Synthesis for Maximum Instruction Parallelism

ACM Transactions on Reconfigurable Technology and Systems ◽

10.1145/3468875 ◽

2021 ◽

Vol 14 (4) ◽

pp. 1-15

Author(s):

Zhenghua Gu ◽

Wenqing Wan ◽

Jundong Xie ◽

Chang Wu

Keyword(s):

Performance Optimization ◽

Directed Acyclic Graph ◽

Scheduling Algorithm ◽

Dependency Graph ◽

High Level Synthesis ◽

Limiting Factor ◽

Circuit Performance ◽

State Transition Graph ◽

High Level ◽

Basic Blocks

Performance optimization is an important goal for High-level Synthesis (HLS). Existing HLS scheduling algorithms are all based on Control and Data Flow Graph (CDFG) and will schedule basic blocks in sequential order. Our study shows that the sequential scheduling order of basic blocks is a big limiting factor for achievable circuit performance. In this article, we propose a Dependency Graph (DG) with two important properties for scheduling. First, DG is a directed acyclic graph. Thus, no loop breaking heuristic is needed for scheduling. Second, DG can be used to identify the exact instruction parallelism. Our experiment shows that DG can lead to 76% instruction parallelism increase over CDFG. Based on DG, we propose a bottom-up scheduling algorithm to achieve much higher instruction parallelism than existing algorithms. Hierarchical state transition graph with guard conditions is proposed for efficient implementation of such high parallelism scheduling. Our experimental results show that our DG-based HLS algorithm can outperform the CDFG-based LegUp and the state-of-the-art industrial tool Vivado HLS by 2.88× and 1.29× on circuit latency, respectively.

Download Full-text

A highly parallel scheduling model for IT change management

Novel Algorithms and Techniques in Telecommunications and Networking ◽

10.1007/978-90-481-3662-9_62 ◽

2009 ◽

pp. 361-366

Author(s):

Denílson Cursino Oliveira ◽

Raimir Holanda Filho

Keyword(s):

Change Management ◽

Parallel Scheduling ◽

Scheduling Model

Download Full-text