scholarly journals De novo species identification using 16S rRNA gene nanopore sequencing

PeerJ ◽  
2020 ◽  
Vol 8 ◽  
pp. e10029
Author(s):  
Inga Leena Angell ◽  
Morten Nilsen ◽  
Karin C. Lødrup Carlsen ◽  
Kai-Håkon Carlsen ◽  
Gunilla Hedlin ◽  
...  

Nanopore sequencing is rapidly becoming more popular for use in various microbiota-based applications. Major limitations of current approaches are that they do not enable de novo species identification and that they cannot be used to verify species assignments. This severely limits applicability of the nanopore sequencing technology in taxonomic applications. Here, we demonstrate the possibility of de novo species identification and verification using hexamer frequencies in combination with k-means clustering for nanopore sequencing data. The approach was tested on the human infant gut microbiota of 3-month-old infants. Using the hexamer k-means approach we identified two new low abundant species associated with vaginal delivery. In addition, we confirmed both the vaginal delivery association for two previously identified species and the overall high levels of bifidobacteria. Taxonomic assignments were further verified by mock community analyses. Therefore, we believe our de novo species identification approach will have widespread application in analyzing microbial communities in the future.

2017 ◽  
Author(s):  
Joe Parker ◽  
Andrew J. Helmstetter ◽  
Dion Devey ◽  
Alexander S.T. Papadopulos

Advances in DNA sequencing and informatics have revolutionised biology over the past four decades, but technological limitations have left many applications unexplored1,2. Recently, portable, real-time, nanopore sequencing (RTnS) has become available. This offers opportunities to rapidly collect and analyse genomic data anywhere3–5. However, the generation of datasets from large, complex genomes has been constrained to laboratories6,7. The portability and long DNA sequences of RTnS offer great potential for field-based species identification, but the feasibility and accuracy of these technologies for this purpose have not been assessed. Here, we show that a field-based RTnS analysis of closely-related plant species (Arabidopsis spp.)8 has many advantages over laboratory-based high-throughput sequencing (HTS) methods for species level identification-by-sequencing and de novo phylogenomics. Samples were collected and sequenced in a single day by RTnS using a portable, “al fresco” laboratory. Our analyses demonstrate that correctly identifying unknown reads from matches to a reference database with RTnS reads enables rapid and confident species identification. Individually annotated RTnS reads can be used to infer the evolutionary relationships of A. thaliana. Furthermore, hybrid genome assembly with RTnS and HTS reads substantially improved upon a genome assembled from HTS reads alone. Field-based RTnS makes real-time, rapid specimen identification and genome wide analyses possible. These technological advances are set to revolutionise research in the biological sciences9 and have broad implications for conservation, taxonomy, border agencies and citizen science.


2021 ◽  
Vol 7 (2) ◽  
pp. 831-834
Author(s):  
Chiara Becht ◽  
Jonas Schmidt ◽  
Frithjof Blessing ◽  
Folker Wenzel

Abstract INTRODUCTION: Long-read sequencing techniques such as Oxford Nanopore sequencing, are representing a promising novel approach in molecular-biological methodology, enabling potential facilitation in mapping and de novo assembly. In comparison to conventional sequencing methods, novel alignment tools are mandated to compensate differing data structures (especially high error rate) to achieve acceptably accurate analysis results. METHODS: In this study, benchmarking for long read aligners BLASR, GraphMap, LAST, minimap2, NGMLR and the short-read aligner BWA MEM on three experimental datasets was conducted. Obtained alignment results were compared for various quality and performance criteria, such as match rate, mismatch rate, error rate, working memory usage and computational time. RESULTS: The comparison yielded differences in alignment quality and performance of tools under test. Tool LAST showed the largest differences among all tools. Minimap2 achieved constant quality with good performance. BLASR, GraphMap, BWA MEM and NGMLR showed slight differences only. CONCLUSION: Differences among the tools could be reasoned with dataset characteristics and algorithm approaches of individual tools. All tools except BLASR seem applicable for Nanopore sequencing data. Therefore, selection of the tool should be done under consideration of the experimental design and the further downstream analysis


2022 ◽  
Vol 10 (1) ◽  
pp. 142
Author(s):  
Hye Won Kim ◽  
Na Kyung Kim ◽  
Alex P. R. Phillips ◽  
David A. Parker ◽  
Ping Liu ◽  
...  

Verrucomicrobiotal methanotrophs are thermoacidophilic methane oxidizers that have been isolated from volcanic and geothermal regions of the world. We used a metagenomic approach that entailed obtaining the whole genome sequence of a verrucomicrobiotal methanotroph from a microbial consortium enriched from samples obtained from Nymph Lake (89.9 °C, pH 2.73) in Yellowstone National Park in the USA. To identify and reconstruct the verrucomicrobiotal genome from Illumina NovaSeq 6000 sequencing data, we constructed a bioinformatic pipeline with various combinations of de novo assembly, alignment, and binning algorithms. Based on the marker gene (pmoA), we identified and assembled the Candidatus Methylacidiphilum sp. YNP IV genome (2.47 Mbp, 2392 ORF, and 41.26% GC content). In a comparison of average nucleotide identity between Ca. Methylacidiphilum sp. YNP IV and Ca. Methylacidiphilum fumariolicum SolV, its closest 16S rRNA gene sequence relative, is lower than 95%, suggesting that Ca. Methylacidiphilum sp. YNP IV can be regarded as a different species. The Ca. Methylacidiphilum sp. YNP IV genome assembly showed most of the key genes for methane metabolism, the CBB pathway for CO2 fixation, nitrogen fixation and assimilation, hydrogenases, and rare earth elements transporter, as well as defense mechanisms. The assembly and reconstruction of a thermoacidophilic methanotroph belonging to the Verrucomicrobiota phylum from a geothermal environment adds further evidence and knowledge concerning the diversity of biological methane oxidation and on the adaptation of this geochemically relevant reaction in extreme environments.


2017 ◽  
Author(s):  
Maximilian H.-W. Schmidt ◽  
Alxander Vogel ◽  
Alisandra K. Denton ◽  
Benjamin Istace ◽  
Alexandra Wormit ◽  
...  

Recent updates in sequencing technology have made it possible to obtain Gigabases of sequence data from one single flowcell. Prior to this update, the nanopore sequencing technology was mainly used to analyze and assemble microbial samples1-3. Here, we describe the generation of a comprehensive nanopore sequencing dataset with a median fragment size of 11,979 bp for the wild tomato species Solanum pennellii featuring an estimated genome size of ca 1.0 to 1.1 Gbases. We describe its genome assembly to a contig N50 of 2.5 MB using a pipeline comprising a Canu4 pre-processing and a subsequent assembly using SMARTdenovo. We show that the obtained nanopore based de novo genome reconstruction is structurally highly similar to that of the reference S. pennellii LA7165 genome but has a high error rate caused mostly by deletions in homopolymers. After polishing the assembly with Illumina short read data we obtained an error rate of <0.02 % when assessed versus the same Illumina data. More importantly however we obtained a gene completeness of 96.53% which even slightly surpasses that of the reference S. pennellii genome5. Taken together our data indicate such long read sequencing data can be used to affordably sequence and assemble Gbase sized diploid plant genomes.Raw data is available at http://www.plabipd.de/portal/solanum-pennellii and has been deposited as PRJEB19787.


2015 ◽  
Vol 12 (8) ◽  
pp. 733-735 ◽  
Author(s):  
Nicholas J Loman ◽  
Joshua Quick ◽  
Jared T Simpson

2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Oksana Kutsyr ◽  
Lucía Maestre-Carballa ◽  
Mónica Lluesma-Gomez ◽  
Manuel Martinez-Garcia ◽  
Nicolás Cuenca ◽  
...  

AbstractThe gut microbiome is known to influence the pathogenesis and progression of neurodegenerative diseases. However, there has been relatively little focus upon the implications of the gut microbiome in retinal diseases such as retinitis pigmentosa (RP). Here, we investigated changes in gut microbiome composition linked to RP, by assessing both retinal degeneration and gut microbiome in the rd10 mouse model of RP as compared to control C57BL/6J mice. In rd10 mice, retinal responsiveness to flashlight stimuli and visual acuity were deteriorated with respect to observed in age-matched control mice. This functional decline in dystrophic animals was accompanied by photoreceptor loss, morphologic anomalies in photoreceptor cells and retinal reactive gliosis. Furthermore, 16S rRNA gene amplicon sequencing data showed a microbial gut dysbiosis with differences in alpha and beta diversity at the genera, species and amplicon sequence variants (ASV) levels between dystrophic and control mice. Remarkably, four fairly common ASV in healthy gut microbiome belonging to Rikenella spp., Muribaculaceace spp., Prevotellaceae UCG-001 spp., and Bacilli spp. were absent in the gut microbiome of retinal disease mice, while Bacteroides caecimuris was significantly enriched in mice with RP. The results indicate that retinal degenerative changes in RP are linked to relevant gut microbiome changes. The findings suggest that microbiome shifting could be considered as potential biomarker and therapeutic target for retinal degenerative diseases.


2021 ◽  
Vol 22 (S10) ◽  
Author(s):  
Zhenmiao Zhang ◽  
Lu Zhang

Abstract Background Due to the complexity of microbial communities, de novo assembly on next generation sequencing data is commonly unable to produce complete microbial genomes. Metagenome assembly binning becomes an essential step that could group the fragmented contigs into clusters to represent microbial genomes based on contigs’ nucleotide compositions and read depths. These features work well on the long contigs, but are not stable for the short ones. Contigs can be linked by sequence overlap (assembly graph) or by the paired-end reads aligned to them (PE graph), where the linked contigs have high chance to be derived from the same clusters. Results We developed METAMVGL, a multi-view graph-based metagenomic contig binning algorithm by integrating both assembly and PE graphs. It could strikingly rescue the short contigs and correct the binning errors from dead ends. METAMVGL learns the two graphs’ weights automatically and predicts the contig labels in a uniform multi-view label propagation framework. In experiments, we observed METAMVGL made use of significantly more high-confidence edges from the combined graph and linked dead ends to the main graph. It also outperformed many state-of-the-art contig binning algorithms, including MaxBin2, MetaBAT2, MyCC, CONCOCT, SolidBin and GraphBin on the metagenomic sequencing data from simulation, two mock communities and Sharon infant fecal samples. Conclusions Our findings demonstrate METAMVGL outstandingly improves the short contig binning and outperforms the other existing contig binning tools on the metagenomic sequencing data from simulation, mock communities and infant fecal samples.


Pathogens ◽  
2021 ◽  
Vol 10 (4) ◽  
pp. 396
Author(s):  
Ewa Sajnaga ◽  
Marcin Skowronek ◽  
Agnieszka Kalwasińska ◽  
Waldemar Kazimierczak ◽  
Karolina Ferenc ◽  
...  

This study focused on the potential relationships between midgut microbiota of the common cockchafer Melolontha melolontha larvae and their resistance to entomopathogenic nematodes (EPN) infection. We investigated the bacterial community associated with control and unsusceptible EPN-exposed insects through nanopore sequencing of the 16S rRNA gene. Firmicutes, Proteobacteria, Actinobacteria, and Bacteroidetes were the most abundant bacterial phyla within the complex and variable midgut microbiota of the wild M. melolontha larvae. The core microbiota was found to include 82 genera, which accounted for 3.4% of the total number of identified genera. The EPN-resistant larvae differed significantly from the control ones in the abundance of many genera belonging to the Actinomycetales, Rhizobiales, and Clostridiales orders. Additionally, the analysis of the microbiome networks revealed different sets of keystone midgut bacterial genera between these two groups of insects, indicating differences in the mutual interactions between bacteria. Finally, we detected Xenorhabdus and Photorhabdus as gut residents and various bacterial species exhibiting antagonistic activity against these entomopathogens. This study paves the way to further research aimed at unravelling the role of the host gut microbiota on the output of EPN infection, which may contribute to enhancement of the efficiency of nematodes used in eco-friendly pest management.


2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Lidong Guo ◽  
Mengyang Xu ◽  
Wenchao Wang ◽  
Shengqiang Gu ◽  
Xia Zhao ◽  
...  

Abstract Background Synthetic long reads (SLR) with long-range co-barcoding information are now widely applied in genomics research. Although several tools have been developed for each specific SLR technique, a robust standalone scaffolder with high efficiency is warranted for hybrid genome assembly. Results In this work, we developed a standalone scaffolding tool, SLR-superscaffolder, to link together contigs in draft assemblies using co-barcoding and paired-end read information. Our top-to-bottom scheme first builds a global scaffold graph based on Jaccard Similarity to determine the order and orientation of contigs, and then locally improves the scaffolds with the aid of paired-end information. We also exploited a screening algorithm to reduce the negative effect of misassembled contigs in the input assembly. We applied SLR-superscaffolder to a human single tube long fragment read sequencing dataset and increased the scaffold NG50 of its corresponding draft assembly 1349 fold. Moreover, benchmarking on different input contigs showed that this approach overall outperformed existing SLR scaffolders, providing longer contiguity and fewer misassemblies, especially for short contigs assembled by next-generation sequencing data. The open-source code of SLR-superscaffolder is available at https://github.com/BGI-Qingdao/SLR-superscaffolder. Conclusions SLR-superscaffolder can dramatically improve the contiguity of a draft assembly by integrating a hybrid assembly strategy.


Sign in / Sign up

Export Citation Format

Share Document