scholarly journals BioShake: a Haskell EDSL for bioinformatics workflows

PeerJ ◽  
2019 ◽  
Vol 7 ◽  
pp. e7223 ◽  
Author(s):  
Justin Bedő

Typical bioinformatics analyses comprise of long running computational workflows. An important part of reproducible research is the management and execution of these workflows to allow robust execution and to minimise errors. BioShake is an embedded domain specific language in Haskell for specifying and executing computational workflows for bioinformatics that significantly reduces the possibility of errors occurring. Unlike other workflow frameworks, BioShake raises many properties to the type level allowing the correctness of a workflow to be statically checked during compilation, catching errors before any lengthy execution process. BioShake builds on the Shake build tool to provide robust dependency tracking, parallel execution, reporting, and resumption capabilities. Finally, BioShake abstracts execution so that jobs can either be executed directly or submitted to a cluster. BioShake is available at http://github.com/PapenfussLab/bioshake.

2019 ◽  
Author(s):  
Justin Bedő

AbstractBackgroundTypical bioinformatics analysis comprise long running computational pipelines. An important part of producing reproducible research is the management and execution of these computational pipelines to allow robust execution and to minimise errors. Bioshake is an embedded domain specific language embedded in Haskell for specifying and executing computational pipelines in bioinformatics that significantly reduces the possibility of errors occurring.ResultsUnlike other pipeline frameworks, Bioshake raises many properties to the type level to allow the correctness of a pipeline to be statically checked during compilation, catching errors before any lengthy execution process. Bioshake builds on the Shake build tool to provide robust dependency tracking, parallel execution, reporting, and resumption capabilities. Finally, Bioshake abstracts execution so that jobs can either be executed directly or submitted to a cluster.ConclusionsBioshake is available at http://github.com/papenfusslab/bioshake.


2018 ◽  
Author(s):  
Luis Pedro Coelho ◽  
Renato Alves ◽  
Paulo Monteiro ◽  
Jaime Huerta-Cepas ◽  
Ana Teresa Freitas ◽  
...  

AbstractNGLess is a domain specific language for describing next-generation sequence processing pipelines. It was developed with the goal of enabling user-friendly computational reproducibility.Using this framework, we developed NG-meta-profiler, a fast profiler for metagenomes which performs sequence preprocessing, mapping to bundled databases, filtering of the mapping results, and profiling (taxonomic and functional). It is significantly faster than either MOCAT2 or htseq-count and (as it builds on NGLess) its results are perfectly reproducible. These pipelines can easily be customized and extended with other tools.NGLess and NG-meta-profiler are open source software (under the liberal MIT licence) and can be downloaded from http://ngless.embl.de or installed through bioconda.


Author(s):  
Jessica Ray ◽  
Ajav Brahmakshatriya ◽  
Richard Wang ◽  
Shoaib Kamil ◽  
Albert Reuther ◽  
...  

2021 ◽  
Vol 205 ◽  
pp. 102610
Author(s):  
Davide Ancona ◽  
Luca Franceschini ◽  
Angelo Ferrando ◽  
Viviana Mascardi

2021 ◽  
pp. 102642
Author(s):  
Xiomarah Guzmán-Guzmán ◽  
Edward Rolando Núñez-Valdez ◽  
Raysa Vásquez-Reynoso ◽  
Angel Asencio ◽  
Vicente García-Díaz

Sign in / Sign up

Export Citation Format

Share Document