MetaVelvet: an extension of Velvet assembler to de novo metagenome assembly from short sequence reads

Toshiaki Namiki; Tsuyoshi Hachiya; Hideaki Tanaka; Yasubumi Sakakibara

doi:10.1093/nar/gks678

MetaVelvet : An extension of Velvet assembler to de novo metagenome assembly from short sequence reads v1

protocols.io ◽

10.17504/protocols.io.d7q9mv ◽

2015 ◽

Author(s):

Afiahayati not provided ◽

Sato K ◽

Namiki T ◽

Hachiya T ◽

Tanaka H ◽

...

Keyword(s):

De Novo ◽

Short Sequence ◽

Metagenome Assembly

Download Full-text

METAMVGL: a multi-view graph-based metagenomic contig binning algorithm by integrating assembly and paired-end graphs

BMC Bioinformatics ◽

10.1186/s12859-021-04284-4 ◽

2021 ◽

Vol 22 (S10) ◽

Author(s):

Zhenmiao Zhang ◽

Lu Zhang

Keyword(s):

De Novo ◽

Label Propagation ◽

Next Generation Sequencing Data ◽

Metagenomic Sequencing ◽

Sequencing Data ◽

Fecal Samples ◽

Microbial Genomes ◽

Metagenome Assembly ◽

High Chance ◽

Mock Communities

Abstract Background Due to the complexity of microbial communities, de novo assembly on next generation sequencing data is commonly unable to produce complete microbial genomes. Metagenome assembly binning becomes an essential step that could group the fragmented contigs into clusters to represent microbial genomes based on contigs’ nucleotide compositions and read depths. These features work well on the long contigs, but are not stable for the short ones. Contigs can be linked by sequence overlap (assembly graph) or by the paired-end reads aligned to them (PE graph), where the linked contigs have high chance to be derived from the same clusters. Results We developed METAMVGL, a multi-view graph-based metagenomic contig binning algorithm by integrating both assembly and PE graphs. It could strikingly rescue the short contigs and correct the binning errors from dead ends. METAMVGL learns the two graphs’ weights automatically and predicts the contig labels in a uniform multi-view label propagation framework. In experiments, we observed METAMVGL made use of significantly more high-confidence edges from the combined graph and linked dead ends to the main graph. It also outperformed many state-of-the-art contig binning algorithms, including MaxBin2, MetaBAT2, MyCC, CONCOCT, SolidBin and GraphBin on the metagenomic sequencing data from simulation, two mock communities and Sharon infant fecal samples. Conclusions Our findings demonstrate METAMVGL outstandingly improves the short contig binning and outperforms the other existing contig binning tools on the metagenomic sequencing data from simulation, mock communities and infant fecal samples.

Download Full-text

Practical evaluation of 11 de novo assemblers in metagenome assembly

Journal of Microbiological Methods ◽

10.1016/j.mimet.2018.06.007 ◽

2018 ◽

Vol 151 ◽

pp. 99-105 ◽

Cited By ~ 9

Author(s):

Esmaeil Forouzan ◽

Parvin Shariati ◽

Masoumeh Sadat Mousavi Maleki ◽

Ali Asghar Karkhane ◽

Bagher Yakhchali

Keyword(s):

De Novo ◽

Practical Evaluation ◽

Metagenome Assembly

Download Full-text

Tetrasomy 18p de novo: Identification by FISH with conventional and microdissection probes and analysis of parental origin and formation by short sequence repeat typing

Human Genetics ◽

10.1007/bf02281862 ◽

1996 ◽

Vol 97 (5) ◽

pp. 568-572 ◽

Cited By ~ 13

Author(s):

Thomas Eggermann ◽

Hartmut Engels ◽

Barbara Moskalonek ◽

Markus M. Nöthen ◽

Jutta Müller-Navia ◽

...

Keyword(s):

De Novo ◽

Parental Origin ◽

Short Sequence ◽

Sequence Repeat ◽

Short Sequence Repeat ◽

Tetrasomy 18P

Download Full-text

REPdenovo: Inferring De Novo Repeat Motifs from Short Sequence Reads

PLoS ONE ◽

10.1371/journal.pone.0150719 ◽

2016 ◽

Vol 11 (3) ◽

pp. e0150719 ◽

Cited By ~ 28

Author(s):

Chong Chu ◽

Rasmus Nielsen ◽

Yufeng Wu

Keyword(s):

De Novo ◽

Short Sequence

Download Full-text

Faucet: streaming de novo assembly graph construction

10.1101/125658 ◽

2017 ◽

Author(s):

Roye Rozov ◽

Gil Goldshlager ◽

Eran Halperin ◽

Ron Shamir

Keyword(s):

Resource Use ◽

De Novo ◽

State Of The Art ◽

Supplementary Information ◽

De Bruijn Graph ◽

Assembly Quality ◽

Metagenome Assembly ◽

Streaming Algorithm ◽

Supplementary Material ◽

De Bruijn

AbstractMotivationWe present Faucet, a 2-pass streaming algorithm for assembly graph construction. Faucet builds an assembly graph incrementally as each read is processed. Thus, reads need not be stored locally, as they can be processed while downloading data and then discarded. We demonstrate this functionality by performing streaming graph assembly of publicly available data, and observe that the ratio of disk use to raw data size decreases as coverage is increased.ResultsFaucet pairs the de Bruijn graph obtained from the reads with additional meta-data derived from them. We show these metadata - coverage counts collected at junction k-mers and connections bridging between junction pairs - contain most salient information needed for assembly, and demonstrate they enable cleaning of metagenome assembly graphs, greatly improving contiguity while maintaining accuracy. We compared Faucet’s resource use and assembly quality to state of the art metagenome assemblers, as well as leading resource-efficient genome assemblers. Faucet used orders of magnitude less time and disk space than the specialized metagenome assemblers MetaSPAdes and Megahit, while also improving on their memory use; this broadly matched performance of other assemblers optimizing resource efficiency - namely, Minia and LightAssembler. However, on metagenomes tested, Faucet’s outputs had 14-110% higher mean NGA50 lengths compared to Minia, and 2-11-fold higher mean NGA50 lengths compared to LightAssembler, the only other streaming assembler available.AvailabilityFaucet is available at https://github.com/Shamir-Lab/[email protected],[email protected] information:Supplementary data are available at Bioinformatics online.

Download Full-text

Metagenomes and Assembled Genomes from Diarrhea-Affected Cattle (Bos taurus)

Microbiology Resource Announcements ◽

10.1128/mra.01411-20 ◽

2021 ◽

Vol 10 (7) ◽

Author(s):

Tshepiso Pleasure Ateba ◽

Kazeem Adekunle Alayande ◽

Mulunda Mwanza

Keyword(s):

Bacterial Communities ◽

De Novo ◽

Bos Taurus ◽

Metagenome Assembly

ABSTRACT The de novo metagenome assembly for C1-TPA is 68,577,389 bp long spread over 10,108 contigs, while that of C3-TPA is 55,517,929 bp distributed over 9,415 contigs. A total of 8 metagenome-assembled genomes (MAGs) were extracted from C1-TPA, and 10 were extracted from C3-TPA. Both samples have a Flavobacterium sp. and a Pseudomonas sp. in common among their bacterial communities.

Download Full-text

MetaPlatanus: a metagenome assembler that combines long-range sequence links and species-specific features

Nucleic Acids Research ◽

10.1093/nar/gkab831 ◽

2021 ◽

Author(s):

Rei Kajitani ◽

Hideki Noguchi ◽

Yasuhiro Gotoh ◽

Yoshitoshi Ogura ◽

Dai Yoshimura ◽

...

Keyword(s):

Long Range ◽

Large Scale ◽

De Novo ◽

Gene Clusters ◽

Human Saliva ◽

Bacterial Genomes ◽

Specific Sequence ◽

Metagenome Assembly ◽

Downstream Analysis ◽

Species Specific

Abstract De novo metagenome assembly is effective in assembling multiple draft genomes, including those of uncultured organisms. However, heterogeneity in the metagenome hinders assembly and introduces interspecies misassembly deleterious for downstream analysis. For this purpose, we developed a hybrid metagenome assembler, MetaPlatanus. First, as a characteristic function, it assembles the basic contigs from accurate short reads and then iteratively utilizes long-range sequence links, species-specific sequence compositions, and coverage depth. The binning information was also used to improve contiguity. Benchmarking using mock datasets consisting of known bacteria with long reads or mate pairs revealed the high contiguity MetaPlatanus with a few interspecies misassemblies. For published human gut data with nanopore reads from potable sequencers, MetaPlatanus assembled many biologically important elements, such as coding genes, gene clusters, viral sequences, and over-half bacterial genomes. In the benchmark with published human saliva data with high-throughput nanopore reads, the superiority of MetaPlatanus was considerably more evident. We found that some high-abundance bacterial genomes were assembled only by MetaPlatanus as near-complete. Furthermore, MetaPlatanus can circumvent the limitations of highly fragmented assemblies and frequent interspecies misassembles obtained by the other tools. Overall, the study demonstrates that MetaPlatanus could be an effective approach for exploring large-scale structures in metagenomes.

Download Full-text

Metagenomic analysis of ancient dental calculus reveals unexplored diversity of oral archaeal Methanobrevibacter

Microbiome ◽

10.1186/s40168-021-01132-8 ◽

2021 ◽

Vol 9 (1) ◽

Author(s):

Lena Granehäll ◽

Kun D. Huang ◽

Adrian Tett ◽

Paolo Manghi ◽

Alice Paladin ◽

...

Keyword(s):

Middle Ages ◽

De Novo ◽

Oral Microbiome ◽

Dental Calculus ◽

Human Skeletal Remains ◽

Metagenome Assembly ◽

Metagenomic Assembly ◽

Source Of Information ◽

Archaeal Genus ◽

Geographical Locations

Abstract Background Dental calculus (mineralised dental plaque) preserves many types of microfossils and biomolecules, including microbial and host DNA, and ancient calculus are thus an important source of information regarding our ancestral human oral microbiome. In this study, we taxonomically characterised the dental calculus microbiome from 20 ancient human skeletal remains originating from Trentino-South Tyrol, Italy, dating from the Neolithic (6000–3500 BCE) to the Early Middle Ages (400–1000 CE). Results We found a high abundance of the archaeal genus Methanobrevibacter in the calculus. However, only a fraction of the sequences showed high similarity to Methanobrevibacter oralis, the only described Methanobrevibacter species in the human oral microbiome so far. To further investigate the diversity of this genus, we used de novo metagenome assembly to reconstruct 11 Methanobrevibacter genomes from the ancient calculus samples. Besides the presence of M. oralis in one of the samples, our phylogenetic analysis revealed two hitherto uncharacterised and unnamed oral Methanobrevibacter species that are prevalent in ancient calculus samples sampled from a broad range of geographical locations and time periods. Conclusions We have shown the potential of using de novo metagenomic assembly on ancient samples to explore microbial diversity and evolution. Our study suggests that there has been a possible shift in the human oral microbiome member Methanobrevibacter over the last millennia.

Download Full-text

Yet another de novo genome assembler

10.1101/656306 ◽

2019 ◽

Cited By ~ 4

Author(s):

Robert Vaser ◽

Mile Šikić

Keyword(s):

De Novo ◽

Sequence Classification ◽

De Novo Genome Assembly ◽

Development Fund ◽

European Regional Development Fund ◽

Sequencing Technologies ◽

Single Genome ◽

Long Read ◽

Metagenome Assembly ◽

Genome Assemblies

AbstractAdvances in sequencing technologies have pushed the limits of genome assemblies beyond imagination. The sheer amount of long read data that is being generated enables the assembly for even the largest and most complex organism for which efficient algorithms are needed. We present a new tool, called Ra, for de novo genome assembly of long uncorrected reads. It is a fast and memory friendly assembler based on sequence classification and assembly graphs, developed with large genomes in mind. It is freely available at https://github.com/lbcb-sci/ra.This work has been supported in part by the Croatian Science Foundation under the project Single genome and metagenome assembly (IP-2018-01-5886), and in part by the European Regional Development Fund under the grant KK.01.1.1.01.0009 (DATACROSS). In addition, M.Š. is partly supported by funding from A*STAR, Singapore.

Download Full-text