scholarly journals Using High-Throughput Amplicon Sequencing to Evaluate Intragenomic Variation and Accuracy in Species Identification of Cordyceps Species

2021 ◽  
Vol 7 (9) ◽  
pp. 767
Author(s):  
Soumitra Paloi ◽  
Wuttichai Mhuantong ◽  
Janet Jennifer Luangsa-ard ◽  
Noppol Kobmoo

While recent sequencing technologies (third generation sequencing) can successfully sequence all copies of nuclear ribosomal DNA (rDNA) markers present within a genome and offer insights into the intragenomic variation of these markers, high intragenomic variation can be a source of confusion for high-throughput species identification using such technologies. High-throughput (HT) amplicon sequencing via PacBio SEQUEL I was used to evaluate the intragenomic variation of the ITS region and D1–D2 LSU domains in nine Cordyceps species, and the accuracy of such technology to identify these species based on molecular phylogenies was also assessed. PacBio sequences within strains showed variable level of intragenomic variation among the studied Cordyceps species with C. blackwelliae showing greater variation than the others. Some variants from a mix of species clustered together outside their respective species of origin, indicative of intragenomic variation that escaped concerted evolution shared between species. Proper selection of consensus sequences from HT amplicon sequencing is a challenge for interpretation of correct species identification. PacBio consensus sequences with the highest number of reads represent the major variants within a genome and gave the best results in terms of species identification.

2019 ◽  
Author(s):  
Michael Bradshaw ◽  
Felix Grewe ◽  
Anne Thomas ◽  
Cody H. Harrison ◽  
Hanna Lindgren ◽  
...  

Abstract Background: Regions within the nuclear ribosomal operon are a major tool for inferring evolutionary relationships and investigating diversity in fungi. In spite of the prevalent use of ribosomal markers in fungal research, central features of nuclear ribosomal DNA (nrDNA) evolution are poorly characterized for fungi in general, including lichenized fungi. The internal transcribed spacer (ITS) region of the nrDNA has been adopted as the primary DNA barcode identification marker for fungi. However, little is known about intragenomic variation in the nrDNA in symbiotic fungi. In order to better understand evolution of nrDNA and the utility of the ITS region for barcode identification of lichen-forming fungal species, we generated nearly complete nuclear ribosomal operon sequences from nine species in the Rhizoplaca melanophthalma species complex using short reads from high-throughput sequencing. Results: We estimated copy numbers for the nrDNA operon, ranging from nine to 48 copies for members of thiscomplex, and found low levels of intragenomic variation in the standard barcode region (ITS). Monophyly of currently described species in this complex was supported in phylogenetic inferences based on the ITS, 28S, IGS, and some intronic regions; however, a phylogenetic inference based on the 18S provided much lower resolution. Phylogenetic analysis of concatenated ITS and intergenic spacer sequence data generated from 496 specimens collected worldwide revealed previously unrecognized lineages in the nrDNA phylogeny. Conclusions: The results from our study support the general assumption that the ITS region of the nrDNA is an effective barcoding marker for fungi. For the R. melanophthalma group, the limited amount of potential intragenomic variability in the ITS region did not correspond to fixed diagnostic nucleotide position characters separating taxa within this species complex. Previously unrecognized lineages inferred from ITS sequence data may represent undescribed species-level lineages or reflect uncharacterized aspects of nrDNA evolution in the R. melanophthalma species complex.


2019 ◽  
Author(s):  
Michael Bradshaw ◽  
Felix Grewe ◽  
Anne Thomas ◽  
Cody H. Harrison ◽  
Hanna Lindgren ◽  
...  

Abstract Background Regions within the nuclear ribosomal operon are a major tool for inferring evolutionary relationships and investigating diversity in fungi. In spite of the prevalent use of ribosomal markers in fungal research, central features of nuclear ribosomal DNA (nrDNA) evolution are poorly characterized for fungi in general, including lichenized fungi. The internal transcribed spacer (ITS) region of the nrDNA has been adopted as the primary DNA barcode identification marker for fungi. However, little is known about intragenomic variation in the nrDNA in symbiotic fungi. In order to better understand evolution of nrDNA and the utility of the ITS region for barcode identification of lichen-forming fungal species, we generated nearly complete nuclear ribosomal operon sequences from approximate nine species in the Rhizoplaca melanophthalma species complex using short reads from high-throughput sequencing.Results We estimated copy numbers for the nrDNA operon, ranging from nine to 48 copies for members of this complex, and found low levels of intragenomic variation in the standard barcode region (ITS). Monophyly of currently described species in this complex was supported in phylogenetic reconstructions of the ITS, 28S, IGS, and some intronic regions; however, phylogenetic reconstructions based on the 18S provided much lower resolution. Phylogenetic analysis of concatenated ITS and intergenic spacer sequence data generated from 496 specimens collected worldwide revealed, previously unrecognized lineages in the nrDNA phylogeny.Conclusions The results from our study support the general assumption that the ITS region of the nrDNA is an effective barcoding marker for fungi. For the R. melanophthalma group, the limited number of potentially polymorphic sites generally do not correspond to fixed diagnostic nucleotide position characters separating taxa within this species complex. Previously unrecognized lineages inferred from ITS sequence data may represent undescribed species-level lineages or reflect uncharacterized aspects of nrDNA evolution.


2019 ◽  
Author(s):  
Søren M. Karst ◽  
Ryan M. Ziels ◽  
Rasmus H. Kirkegaard ◽  
Emil A. Sørensen ◽  
Daniel McDonald ◽  
...  

AbstractHigh-throughput amplicon sequencing of large genomic regions remains challenging for short-read technologies. Here, we report a high-throughput amplicon sequencing approach combining unique molecular identifiers (UMIs) with Oxford Nanopore Technologies or Pacific Biosciences CCS sequencing, yielding high accuracy single-molecule consensus sequences of large genomic regions. Our approach generates amplicon and genomic sequences of >10,000 bp in length with a mean error-rate of 0.0049-0.0006% and chimera rate <0.022%.


2019 ◽  
Author(s):  
Michael Bradshaw ◽  
Felix Grewe ◽  
Anne Thomas ◽  
Cody H. Harrison ◽  
Hanna Lindgren ◽  
...  

Abstract Background: Regions within the nuclear ribosomal operon are a major tool for inferring evolutionary relationships and investigating diversity in fungi. In spite of the prevalent use of ribosomal markers in fungal research, central features of nuclear ribosomal DNA (nrDNA) evolution are poorly characterized for fungi in general, including lichenized fungi. The internal transcribed spacer (ITS) region of the nrDNA has been adopted as the primary DNA barcode identification marker for fungi. However, little is known about intragenomic variation in the nrDNA in symbiotic fungi. In order to better understand evolution of nrDNA and the utility of the ITS region for barcode identification of lichen-forming fungal species, we generated nearly complete nuclear ribosomal operon sequences from nine species in the Rhizoplaca melanophthalma species complex using short reads from high-throughput sequencing. Results: We estimated copy numbers for the nrDNA operon, ranging from nine to 48 copies for members of thiscomplex, and found low levels of intragenomic variation in the standard barcode region (ITS). Monophyly of currently described species in this complex was supported in phylogenetic inferences based on the ITS, 28S, intergenic spacer region, and some intronic regions, independently; however, a phylogenetic inference based on the 18S provided much lower resolution. Phylogenetic analysis of concatenated ITS and intergenic spacer sequence data generated from 496 specimens collected worldwide revealed previously unrecognized lineages in the nrDNA phylogeny. Conclusions: The results from our study support the general assumption that the ITS region of the nrDNA is an effective barcoding marker for fungi. For the R. melanophthalma group, the limited amount of potential intragenomic variability in the ITS region did not correspond to fixed diagnostic nucleotide position characters separating taxa within this species complex. Previously unrecognized lineages inferred from ITS sequence data may represent undescribed species-level lineages or reflect uncharacterized aspects of nrDNA evolution in the R. melanophthalma species complex.


2017 ◽  
Author(s):  
Daniel W. Bellott ◽  
Ting-Jan Cho ◽  
Jennifer F. Hughes ◽  
Helen Skaletsky ◽  
David C. Page

AbstractReference sequence of structurally complex regions can only be obtained through highly accurate clone-based approaches. We and others have successfully employed Single-Haplotype Iterative Mapping and Sequencing (SHIMS 1.0) to assemble structurally complex regions across the sex chromosomes of several vertebrate species and in targeted improvements to the reference sequences of human autosomes. However, SHIMS 1.0 was expensive and time consuming, requiring the resources that only a genome center could command. Here we introduce SHIMS 2.0, an improved SHIMS protocol to allow even a small laboratory to generate high-quality reference sequence from complex genomic regions. Using a streamlined and parallelized library preparation protocol, and taking advantage of high-throughput, inexpensive, short-read sequencing technologies, a small group can sequence and assemble hundreds of clones in a week. Relative to SHIMS 1.0, SHIMS 2.0 reduces the cost and time required by two orders of magnitude, while preserving high sequencing accuracy.


2020 ◽  
Vol 20 (1) ◽  
Author(s):  
Michael Bradshaw ◽  
Felix Grewe ◽  
Anne Thomas ◽  
Cody H. Harrison ◽  
Hanna Lindgren ◽  
...  

Abstract Background Regions within the nuclear ribosomal operon are a major tool for inferring evolutionary relationships and investigating diversity in fungi. In spite of the prevalent use of ribosomal markers in fungal research, central features of nuclear ribosomal DNA (nrDNA) evolution are poorly characterized for fungi in general, including lichenized fungi. The internal transcribed spacer (ITS) region of the nrDNA has been adopted as the primary DNA barcode identification marker for fungi. However, little is known about intragenomic variation in the nrDNA in symbiotic fungi. In order to better understand evolution of nrDNA and the utility of the ITS region for barcode identification of lichen-forming fungal species, we generated nearly complete nuclear ribosomal operon sequences from nine species in the Rhizoplaca melanophthalma species complex using short reads from high-throughput sequencing. Results We estimated copy numbers for the nrDNA operon, ranging from nine to 48 copies for members of this complex, and found low levels of intragenomic variation in the standard barcode region (ITS). Monophyly of currently described species in this complex was supported in phylogenetic inferences based on the ITS, 28S, intergenic spacer region, and some intronic regions, independently; however, a phylogenetic inference based on the 18S provided much lower resolution. Phylogenetic analysis of concatenated ITS and intergenic spacer sequence data generated from 496 specimens collected worldwide revealed previously unrecognized lineages in the nrDNA phylogeny. Conclusions The results from our study support the general assumption that the ITS region of the nrDNA is an effective barcoding marker for fungi. For the R. melanophthalma group, the limited amount of potential intragenomic variability in the ITS region did not correspond to fixed diagnostic nucleotide position characters separating taxa within this species complex. Previously unrecognized lineages inferred from ITS sequence data may represent undescribed species-level lineages or reflect uncharacterized aspects of nrDNA evolution in the R. melanophthalma species complex.


2019 ◽  
Author(s):  
Michael Bradshaw ◽  
Felix Grewe ◽  
Anne Thomas ◽  
Cody H. Harrison ◽  
Hanna Lindgren ◽  
...  

Abstract Regions within the nuclear ribosomal operon are a major tool for inferring evolutionary relationships and investigating diversity in fungi. In spite of the prevalent use of ribosomal markers in fungal research, central features of nuclear ribosomal DNA (nrDNA) evolution are poorly characterized for fungi in general, including lichenized fungi. The internal transcribed spacer (ITS) region of the nrDNA has been adopted as the primary DNA barcode identification marker for fungi. However, little is known about intragenomic variation in the nrDNA in symbiotic fungi. In order to better understand evolution of nrDNA and the utility of the ITS region for barcode identification of lichen-forming fungal species, we generated nearly complete nuclear ribosomal operon sequences from nine species in the Rhizoplaca melanophthalma species complex using short reads from high-throughput sequencing. Results: We estimated copy numbers for the nrDNA operon, ranging from nine to 48 copies for members of thiscomplex, and found low levels of intragenomic variation in the standard barcode region (ITS). Monophyly of currently described species in this complex was supported in phylogenetic inferences based on the ITS, 28S, IGS, and some intronic regions; however, a phylogenetic inference based on the 18S provided much lower resolution. Phylogenetic analysis of concatenated ITS and intergenic spacer sequence data generated from 496 specimens collected worldwide revealed previously unrecognized lineages in the nrDNA phylogeny. Conclusions: The results from our study support the general assumption that the ITS region of the nrDNA is an effective barcoding marker for fungi. For the R. melanophthalma group, the limited amount of potential intragenomic variability in the ITS region did not correspond to fixed diagnostic nucleotide position characters separating taxa within this species complex. Previously unrecognized lineages inferred from ITS sequence data may represent undescribed species-level lineages or reflect uncharacterized aspects of nrDNA evolution in the R. melanophthalma species complex.


Author(s):  
Kristoffer Sahlin ◽  
Marisa Lim ◽  
Stefan Prost

Third generation sequencing technologies, such as Oxford Nanopore Technologies (ONT) and Pacific Biosciences (PacBio), have gained popularity over the last years. These platforms can generate millions of long read sequences. This is not only advantageous for genome sequencing projects, but also for amplicon-based high-throughput sequencing experiments, such as DNA barcoding. However, the relatively high error rates associated with these technologies still pose challenges for generating high quality consensus sequences. Here we present NGSpeciesID, a program which can generate highly accurate consensus sequences from long-read amplicon sequencing technologies, including ONT and PacBio. The tool includes clustering of the reads to help filter out contaminants or reads with high error rates and employs polishing strategies specific to the appropriate sequencing platform. We show that NGSpeciesID produces consensus sequences with improved usability by minimizing preprocessing and software installation and scalability by enabling rapid processing of hundreds to thousands of samples, while maintaining similar consensus accuracy as current pipelines


Fuels ◽  
2021 ◽  
Vol 2 (2) ◽  
pp. 241-252
Author(s):  
Dyah Asri Handayani Taroepratjeka ◽  
Tsuyoshi Imai ◽  
Prapaipid Chairattanamanokorn ◽  
Alissara Reungsang

Extreme halophiles offer the advantage to save on the costs of sterilization and water for biohydrogen production from lignocellulosic waste after the pretreatment process with their ability to withstand extreme salt concentrations. This study identifies the dominant hydrogen-producing genera and species among the acclimatized, extremely halotolerant microbial communities taken from two salt-damaged soil locations in Khon Kaen and one location from the salt evaporation pond in Samut Sakhon, Thailand. The microbial communities’ V3–V4 regions of 16srRNA were analyzed using high-throughput amplicon sequencing. A total of 345 operational taxonomic units were obtained and the high-throughput sequencing confirmed that Firmicutes was the dominant phyla of the three communities. Halanaerobium fermentans and Halanaerobacter lacunarum were the dominant hydrogen-producing species of the communities. Spatial proximity was not found to be a determining factor for similarities between these extremely halophilic microbial communities. Through the study of the microbial communities, strategies can be developed to increase biohydrogen molar yield.


Author(s):  
Stella C. Yuan ◽  
Eric Malekos ◽  
Melissa T. R. Hawkins

AbstractThe use of museum specimens held in natural history repositories for population and conservation genetic research is increasing in tandem with the use of massively parallel sequencing technologies. Short Tandem Repeats (STRs), or microsatellite loci, are commonly used genetic markers in wildlife and population genetic studies. However, they traditionally suffered from a host of issues including length homoplasy, high costs, low throughput, and difficulties in reproducibility across laboratories. Massively parallel sequencing technologies can address these problems, but the incorporation of museum specimen derived DNA suffers from significant fragmentation and exogenous DNA contamination. Combatting these issues requires extra measures of stringency in the lab and during data analysis, yet there have not been any high-throughput sequencing studies evaluating microsatellite allelic dropout from museum specimen extracted DNA. In this study, we evaluate genotyping errors derived from mammalian museum skin DNA extracts for previously characterized microsatellites across PCR replicates utilizing high-throughput sequencing. We found it useful to classify samples based on DNA concentration, which determined the rate by which genotypes were accurately recovered. Longer microsatellites performed worse in all museum specimens. Allelic dropout rates across loci were dependent on sample quantity, with high concentration museum specimens performing as well and recovering quality metrics nearly as high as the frozen tissue sample. Based on our results, we provide a set of best practices for quality assurance and incorporation of reliable genotypes from museum specimens.


Sign in / Sign up

Export Citation Format

Share Document