scholarly journals Alpha diversity metrics for noisy OTUs

2018 ◽  
Author(s):  
Robert C. Edgar ◽  
Henrik Flyvbjerg

AbstractNext-generation sequencing (NGS) of marker genes such as 16S ribosomal RNA is widely used to survey microbial communities. The in-sample (alpha) diversity of Operational Taxonomic Units (OTUs) is often summarized by metrics such as richness or entropy which are calculated from observed abundances, or by estimators such as Chao1 which extrapolate to unobserved OTUs. Most such measures are adopted from traditional biodiversity studies, where observational error can often be neglected. However, errors introduced by next-generation amplicon sequencing tend to induce spurious OTUs and spurious counts in OTU tables, both of which are especially prevalent at low abundances. In consequence, traditional metrics may be grossly inaccurate if they are naively applied to NGS OTU tables. In this work, we describe two novel alpha diversity estimators which are calculated from OTU abundances above a specified threshold. The singleton-free estimator (SFE) is a non-parametric estimator which is derived from a similar approach to Chao1 but extrapolates using doublet and triplet abundances rather than singletons and doublets. The octave estimator (OE) fits a log-normal distribution to non-singleton bars of an octave plot. We show that these estimators are effective under suitable conditions, but these conditions rarely apply in practice. We conclude that extrapolating to unobserved OTUs remains an open problem which is unlikely to be solved in the near future.

2021 ◽  
Vol 14 (1) ◽  
Author(s):  
Ruoyao Ni ◽  
Nian Liu ◽  
Mei Li ◽  
Weiping Qian ◽  
Xinghui Qiu

Abstract Background Anopheles sinensis is a dominant vector for malaria transmission in Asian countries. Voltage-gated sodium channel (VGSC) mutation-mediated knock-down resistance (kdr) has developed in many A. sinensis populations because of intensive and long-term use of pyrethroids. Our previous study showed that multiple mutations at position 1014 of the VGSC were heterogeneously distributed in A. sinensis populations across Sichuan, China. Methods To understand resistance genotypes at the haplotype level and reconstruct the phylogenetic relationship of VGSC haplotypes, a cost-effective next-generation sequencing (NGS)-based amplicon sequencing approach was established to clarify haplotypes containing codon 1014 of the VGSC gene from a total of 446 adults collected in 12 locations of Sichuan, China. Results Nineteen (19) haplotypes were identified, including 11 wild 1014L, 6 resistance 1014F, and 2 resistance 1014C haplotypes. We found that resistance haplotypes of A. sinensis VGSC were widely distributed at frequencies ranging from 3.67 to 92.61%. The frequencies of the 1014C haplotype in the southeast of Sichuan (Luzhou, Guangan, and Suining) were relatively higher than those in other sampling locations. Phylogenetic analyses support that kdr-type mutation at position 1014 is not singly originated and resistance 1014C haplotypes evolve from TTT-encoding 1014F. Conclusions A cost-effective next-generation sequencing (NGS)-based amplicon sequencing approach has been established in this study. The data revealed the patchy distribution of VGSC resistance haplotypes with overall high frequencies in Sichuan, China. Phylogenetic analyses support multiple origins and sequential evolution (1014L → 1014F → 1014C) for kdr-type mutations in A. sinensis. Graphical abstract


2018 ◽  
Author(s):  
Robert C. Edgar ◽  
Henrik Flyvbjerg

AbstractNext-generation sequencing of marker genes such as 16S ribosomal RNA is widely used to survey microbial communities. The abundance distribution (AD) of Operational Taxonomic Units (OTUs) in a sample is typically summarized by alpha diversity metrics, e.g. richness and entropy, discarding information about the AD shape. In this work, we describe octave plots, histograms which visualize the shape of microbial ADs by binning on a logarithmic scale with base 2. Optionally, histogram bars are colored to indicate possible spurious OTUs due to sequence error and cross-talk. Octave plots enable assessment of (a) the shape and completeness of the distribution, (b) the effects of noise on measured diversity, (c) whether low-abundance OTUs should be discarded, (d) whether alpha diversity metrics and estimators are reliable, and (e) the additional sampling effort (i.e., read depth) required to obtain a complete census of the community. The utility of octave plots is illustrated in a re-analysis of a prostate cancer study showing that the reported core microbiome is most likely an artifact of experimental error.


Author(s):  
Altuğ Koç ◽  
Elçin Bora ◽  
Tayfun Cinleti ◽  
Gizem Yıldız ◽  
Meral Torun Bayram ◽  
...  

2020 ◽  
Vol 16 ◽  
Author(s):  
Pelin Telkoparan-Akillilar ◽  
Dilek Cevik

Background: Numerous sequencing techniques have been progressed since the 1960s with the rapid development of molecular biology studies focusing on DNA and RNA. Methods: a great number of articles, book chapters, websites are reviewed, and the studies covering NGS history, technology and applications to cancer therapy are included in the present article. Results: High throughput next-generation sequencing (NGS) technologies offer many advantages over classical Sanger sequencing with decreasing cost per base and increasing sequencing efficiency. NGS technologies are combined with bioinformatics software to sequence genomes to be used in diagnostics, transcriptomics, epidemiologic and clinical trials in biomedical sciences. The NGS technology has also been successfully used in drug discovery for the treatment of different cancer types. Conclusion: This review focuses on current and potential applications of NGS in various stages of drug discovery process, from target identification through to personalized medicine.


Diagnostics ◽  
2020 ◽  
Vol 10 (11) ◽  
pp. 962
Author(s):  
Dario de Biase ◽  
Matteo Fassan ◽  
Umberto Malapelle

Next-Generation Sequencing (NGS) allows for the sequencing of multiple genes at a very high depth of coverage [...]


Sign in / Sign up

Export Citation Format

Share Document