INHIBITOR: An intrusion tolerant scheduling algorithm in cloud-based scientific workflow system

A fault-intrusion-tolerant system and deadline-aware algorithm for scheduling scientific workflow in the cloud

PeerJ Computer Science ◽

10.7717/peerj-cs.747 ◽

2021 ◽

Vol 7 ◽

pp. e747

Author(s):

Mazen Farid ◽

Rohaya Latip ◽

Masnida Hussin ◽

Nor Asilah Wati Abdul Hamid

Keyword(s):

Success Rate ◽

Completion Time ◽

Virtual Machines ◽

Scientific Workflow ◽

Scientific Workflows ◽

Completion Rate ◽

Task Completion ◽

Great Cost ◽

Workflow System ◽

Intrusion Tolerant

Background Recent technological developments have enabled the execution of more scientific solutions on cloud platforms. Cloud-based scientific workflows are subject to various risks, such as security breaches and unauthorized access to resources. By attacking side channels or virtual machines, attackers may destroy servers, causing interruption and delay or incorrect output. Although cloud-based scientific workflows are often used for vital computational-intensive tasks, their failure can come at a great cost. Methodology To increase workflow reliability, we propose the Fault and Intrusion-tolerant Workflow Scheduling algorithm (FITSW). The proposed workflow system uses task executors consisting of many virtual machines to carry out workflow tasks. FITSW duplicates each sub-task three times, uses an intermediate data decision-making mechanism, and then employs a deadline partitioning method to determine sub-deadlines for each sub-task. This way, dynamism is achieved in task scheduling using the resource flow. The proposed technique generates or recycles task executors, keeps the workflow clean, and improves efficiency. Experiments were conducted on WorkflowSim to evaluate the effectiveness of FITSW using metrics such as task completion rate, success rate and completion time. Results The results show that FITSW not only raises the success rate by about 12%, it also improves the task completion rate by 6.2% and minimizes the completion time by about 15.6% in comparison with intrusion tolerant scientific workflow ITSW system.

Download Full-text

Process-oriented ecological modeling approach and scientific workflow system

Biodiversity Science ◽

10.3724/sp.j.1003.2014.13267 ◽

2014 ◽

Vol 22 (3) ◽

pp. 277

Author(s):

Qiao Huijie ◽

Lin Congtian ◽

Wang Jiangning ◽

Ji Liqiang

Keyword(s):

Ecological Modeling ◽

Scientific Workflow ◽

Modeling Approach ◽

Workflow System ◽

Process Oriented

Download Full-text

Workspace – a Scientific Workflow System with commercial impact

El Sawah, S. (ed.) MODSIM2019, 23rd International Congress on Modelling and Simulation. ◽

10.36334/modsim.2019.d2.oakes ◽

2019 ◽

Keyword(s):

Scientific Workflow ◽

Workflow System

Download Full-text

Performance Driven Design Optimisation with Scientific Workflow System

International Conference on Green Buildings and Optimization Design (GBOD 2012) ◽

10.1115/1.860137_ch25 ◽

2012 ◽

pp. 189-196

Keyword(s):

Scientific Workflow ◽

Design Optimisation ◽

Workflow System

Download Full-text

Application Scenarios Using Serpens Suite for Kepler Scientific Workflow System

Procedia Computer Science ◽

10.1016/j.procs.2012.04.176 ◽

2012 ◽

Vol 9 ◽

pp. 1604-1613 ◽

Cited By ~ 1

Author(s):

Marcin Płóciennik ◽

Michał Owsiak ◽

Tomasz Zok ◽

Bartek Palak ◽

Antonio Gómez-Iglesias ◽

...

Keyword(s):

Scientific Workflow ◽

Workflow System

Download Full-text

Early Cloud Experiences with the Kepler Scientific Workflow System

Procedia Computer Science ◽

10.1016/j.procs.2012.04.179 ◽

2012 ◽

Vol 9 ◽

pp. 1630-1634 ◽

Cited By ~ 15

Author(s):

Jianwu Wang ◽

Ilkay Altintas

Keyword(s):

Scientific Workflow ◽

Workflow System

Download Full-text

Adapting Medical Image Processing Tasks to a Scalable Scientific Workflow System

2014 IEEE World Congress on Services ◽

10.1109/services.2014.74 ◽

2014 ◽

Cited By ~ 1

Author(s):

Hajar Hamidian ◽

Shiyong Lu ◽

Satyendra Rana ◽

Farshad Fotouhi ◽

Hamid Soltanian-Zadeh

Keyword(s):

Image Processing ◽

Medical Image ◽

Medical Image Processing ◽

Scientific Workflow ◽

Workflow System

Download Full-text

The Research of Execute Unit's State Mechanism in Scientific Workflow System

2011 4th International Conference on Intelligent Networks and Intelligent Systems ◽

10.1109/icinis.2011.63 ◽

2011 ◽

Author(s):

Yanbo Geng ◽

Hui Deng ◽

Feng Wang ◽

Kaifan Ji ◽

Bo Liang ◽

...

Keyword(s):

Scientific Workflow ◽

Workflow System

Download Full-text

A GPU-based high performance computing infrastructure for specialized NGS analyses

10.7287/peerj.preprints.2175v1 ◽

2016 ◽

Author(s):

Andrea Manconi ◽

Marco Moscatelli ◽

Matteo Gnocchi ◽

Giuliano Armano ◽

Luciano Milanesi

Keyword(s):

Gpu Computing ◽

Scientific Workflow ◽

Biological Data ◽

Single Server ◽

Web Based ◽

Continuous Increase ◽

Gpu Cluster ◽

Workflow System ◽

The Galaxy ◽

Workload Manager

Motivation Recent advances in genome sequencing and biological data analysis technologies used in bioinformatics have led to a fast and continuous increase in biological data. The difficulty of managing the huge amounts of data currently available to researchers and the need to have results within a reasonable time have led to the use of distributed and parallel computing infrastructures for their analysis. Recently, bioinformatics is exploring new approaches based on the use of hardware accelerators as GPUs. From an architectural perspective, GPUs are very different from traditional CPUs. Indeed, the latter are devices composed of few cores with lots of cache memory able to handle a few software threads at a time. Conversely, the former are devices equipped with hundreds of cores able to handle thousands of threads simultaneously, so that a very high level of parallelism can be reached. Use of GPUs over the last years has resulted in significant increases in the performance of certain applications. Despite GPUs are increasingly used in bioinformatics most laboratories do not have access to a GPU cluster or server. In this context, it is very important to provide useful services to use these tools. Methods A web-based platform has been implemented with the aim to enable researchers to perform their analysis through dedicated GPU-based computing resources. To this end, a GPU cluster equipped with 16 NVIDIA Tesla k20c cards has been configured. The infrastructure has been built upon the Galaxy technology [1]. Galaxy is an open web-based scientific workflow system for data intensive biomedical research accessible to researchers that do not have programming experience. Let us recall that Galaxy provides a public server, but it does not provide support to GPU-computing. By default, Galaxy is designed to run jobs on local systems. However, it can also be configured to run jobs on a cluster. The front-end Galaxy application runs on a single server, but tools are run on cluster nodes instead. To this end, Galaxy supports different distributed resource managers with the aim to enable different clusters. For the specific case, in our opinion SLURM [2] represents the most suitable workload manager to manage and control jobs. SLURM is a highly configurable workload and resource manager and it is currently used on six of the ten most powerful computers in the world including the Piz Daint, utilizing over 5000 NVIDIA Tesla K20 GPUs. Results GPU-based tools [3] devised by our group for quality control of NGS data have been used to test the infrastructure. Initially, this activity required to make changes to the tools with the aim to optimize the parallelization on the cluster according to the adopted workload manager. Successively, the tools have been converted into web-based services accessible through the Galaxy portal. Abstract truncated at 3,000 characters - the full version is available in the pdf file.

Download Full-text

Cost Effective Heuristic workflow scheduling algorithm in Cloud under Deadline Constraint

Recent Patents on Computer Science ◽

10.2174/2213275912666190822113039 ◽

2019 ◽

Vol 12 ◽

Author(s):

Jasraj Meena ◽

Manu Vardhan

Keyword(s):

Cloud Computing ◽

Scheduling Algorithm ◽

Cost Effective ◽

Scientific Workflow ◽

Scientific Workflows ◽

Workflow Scheduling ◽

Performance Variation ◽

Acquisition Delay ◽

Very High ◽

Deadline Constraint

Cloud computing is used to deliver IT resources over the internet. Due to the popularity of cloud computing, nowadays, most of the scientific workflows are shifted towards this environment. There are lots of algorithms has been proposed in the literature to schedule scientific workflows in the cloud, but their execution cost is very high as well as they are not meeting the user-defined deadline constraint. This paper focuses on satisfying the userdefined deadline of a scientific workflow while minimizing the total execution cost. So, to achieve this, we have proposed a Cost-Effective under Deadline (CEuD) constraint workflow scheduling algorithm. The proposed CEuD algorithm considers all the essential features of Cloud and resolves the major issues such as performance variation, and acquisition delay. We have compared the proposed CEuD algorithm with the existing literature algorithms for scientific workflows (i.e., Montage, Epigenomics, and CyberShake) and getting better results for minimizing the overall execution cost of the workflow while satisfying the user-defined deadline.

Download Full-text