Environmental DNA analysis shows high potential as a tool for estimating intraspecific genetic diversity in a wild fish population

Estimating intraspecific genetic diversity from community DNA metabarcoding data

PeerJ ◽

10.7717/peerj.4644 ◽

2018 ◽

Vol 6 ◽

pp. e4644 ◽

Cited By ~ 51

Author(s):

Vasco Elbrecht ◽

Ecaterina Edith Vamos ◽

Dirk Steinke ◽

Florian Leese

Keyword(s):

Genetic Diversity ◽

Large Scale ◽

High Throughput Sequencing ◽

Intraspecific Diversity ◽

Great Promise ◽

Mock Community ◽

Dna Metabarcoding ◽

Intraspecific Genetic Diversity ◽

Primer Sets ◽

Haplotype Information

BackgroundDNA metabarcoding is used to generate species composition data for entire communities. However, sequencing errors in high-throughput sequencing instruments are fairly common, usually requiring reads to be clustered into operational taxonomic units (OTUs), losing information on intraspecific diversity in the process. While Cytochrome c oxidase subunit I (COI) haplotype information is limited in resolving intraspecific diversity it is nevertheless often useful e.g. in a phylogeographic context, helping to formulate hypotheses on taxon distribution and dispersal.MethodsThis study combines sequence denoising strategies, normally applied in microbial research, with additional abundance-based filtering to extract haplotype information from freshwater macroinvertebrate metabarcoding datasets. This novel approach was added to the R package “JAMP” and can be applied to COI amplicon datasets. We tested our haplotyping method by sequencing (i) a single-species mock community composed of 31 individuals with 15 different haplotypes spanning three orders of magnitude in biomass and (ii) 18 monitoring samples each amplified with four different primer sets and two PCR replicates.ResultsWe detected all 15 haplotypes of the single specimens in the mock community with relaxed filtering and denoising settings. However, up to 480 additional unexpected haplotypes remained in both replicates. Rigorous filtering removes most unexpected haplotypes, but also can discard expected haplotypes mainly from the small specimens. In the monitoring samples, the different primer sets detected 177–200 OTUs, each containing an average of 2.40–3.30 haplotypes per OTU. The derived intraspecific diversity data showed population structures that were consistent between replicates and similar between primer pairs but resolution depended on the primer length. A closer look at abundant taxa in the dataset revealed various population genetic patterns, e.g. the stoneflyTaeniopteryx nebulosaand the caddisflyHydropsyche pellucidulashowed a distinct north–south cline with respect to haplotype distribution, while the beetleOulimnius tuberculatusand the isopodAsellus aquaticusdisplayed no clear population pattern but differed in genetic diversity.DiscussionWe developed a strategy to infer intraspecific genetic diversity from bulk invertebrate metabarcoding data. It needs to be stressed that at this point this metabarcoding-informed haplotyping is not capable of capturing the full diversity present in such samples, due to variation in specimen size, primer bias and loss of sequence variants with low abundance. Nevertheless, for a high number of species intraspecific diversity was recovered, identifying potentially isolated populations and taxa for further more detailed phylogeographic investigation. While we are currently lacking large-scale metabarcoding datasets to fully take advantage of our new approach, metabarcoding-informed haplotyping holds great promise for biomonitoring efforts that not only seek information about species diversity but also underlying genetic diversity.

Download Full-text

Evaluating intraspecific genetic diversity of a fish population using environmental DNA: An approach to distinguish true haplotypes from erroneous sequences

10.1101/429993 ◽

2018 ◽

Cited By ~ 1

Author(s):

Satsuki Tsuji ◽

Masaki Miya ◽

Masayuki Ushio ◽

Hirotoshi Sato ◽

Toshifumi Minamoto ◽

...

Keyword(s):

Genetic Diversity ◽

False Positive ◽

High Throughput Sequencing ◽

Environmental Dna ◽

Detection Accuracy ◽

Sequence Variant ◽

Plecoglossus Altivelis ◽

Detection Rates ◽

Loop Region ◽

Intraspecific Genetic Diversity

AbstractRecent advances in environmental DNA (eDNA) analysis using high-throughput sequencing (HTS) provide a non-invasive way to evaluate the intraspecific genetic diversity of aquatic macroorganisms. However, erroneous sequences present in HTS data can result in false positive haplotypes; therefore, reliable strategies are necessary to eliminate such erroneous sequences when evaluating intraspecific genetic diversity using eDNA metabarcoding. In this study, we propose an approach combining denoising using amplicon sequence variant (ASV) method and the removal of haplotypes with low detection rates. A mixture of rearing water of Ayu (Plecoglossus altivelis altivelis) was used as an eDNA sample. In total, nine haplotypes of Ayu mitochondrial D-loop region were contained in the sample and amplified by two-step tailed PCR. The 15 PCR replicates indexed different tags were prepared from the eDNA sample to compare the detection rates between true haplotypes and false positive haplotypes. All PCR replications were sequenced by HTS, and the total number of detected true haplotypes and false positive haplotypes were compared with and without denoising using the two types of ASV methods, Divisive Amplicon Denoising Algorithm 2 (DADA2) and UNOISE3. The use of both ASV methods considerably reduced the number of false positive haplotypes. Moreover, all true haplotypes were detected in all 15 PCR, whereas false positive haplotypes had detection rates varying from 1/15 to 15/15. Thus, by removing haplotypes with lower detection rates than 15/15, the number of false positive haplotypes were further reduced. The approach proposed in this study successfully eliminated most of false positive haplotypes in the HTS data obtained from eDNA samples, which allowed us to improve the detection accuracy for evaluating intraspecific genetic diversity using eDNA analysis.

Download Full-text

Quantitative evaluation of intraspecific genetic diversity in a natural fish population using environmental DNA analysis

Molecular Ecology Resources ◽

10.1111/1755-0998.13200 ◽

2020 ◽

Vol 20 (5) ◽

pp. 1323-1332 ◽

Cited By ~ 2

Author(s):

Satsuki Tsuji ◽

Naoki Shibata ◽

Hayato Sawada ◽

Masayuki Ushio

Keyword(s):

Genetic Diversity ◽

Quantitative Evaluation ◽

Dna Analysis ◽

Environmental Dna ◽

Fish Population ◽

Intraspecific Genetic Diversity

Download Full-text

Environmental DNA analysis shows high potential as a tool for estimating intraspecific genetic diversity in a wild fish population

Molecular Ecology Resources ◽

10.1111/1755-0998.13165 ◽

2020 ◽

Vol 20 (5) ◽

pp. 1248-1258 ◽

Cited By ~ 2

Author(s):

Satsuki Tsuji ◽

Atsushi Maruyama ◽

Masaki Miya ◽

Masayuki Ushio ◽

Hirotoshi Sato ◽

...

Keyword(s):

Genetic Diversity ◽

Dna Analysis ◽

Environmental Dna ◽

Fish Population ◽

High Potential ◽

Wild Fish ◽

Intraspecific Genetic Diversity

Download Full-text

Estimating intraspecific genetic diversity from community DNA metabarcoding data

10.7287/peerj.preprints.3269v4 ◽

2018 ◽

Author(s):

Vasco Elbrecht ◽

Ecaterina Edith Vamos ◽

Dirk Steinke ◽

Florian Leese

Keyword(s):

Genetic Diversity ◽

Biological Diversity ◽

Intraspecific Diversity ◽

Great Promise ◽

Data Sets ◽

Mock Community ◽

Data Set ◽

Dna Metabarcoding ◽

Intraspecific Genetic Diversity ◽

Primer Sets

Background. DNA metabarcoding is used to generate species composition data for entire communities. However, sequencing errors in high throughput sequencing instruments are fairly common, usually requiring reads to be clustered into operational taxonomic units (OTU), losing information on intraspecific diversity in the process. While COI haplotype information is limited in resolution, it is nevertheless useful in a phylogeographic context, helping to formulate hypothesis on taxon dispersal. Methods. This study combines sequence denoising strategies, normally applied in microbial research, with additional abundance-based filtering to extract haplotypes from freshwater macroinvertebrate metabarcoding data sets. This novel approach was added to the R package "JAMP" and can be applied to Cytochrome c oxidase subunit I (COI) amplicon datasets. We tested our haplotyping method by sequencing i) a single-species mock community composed of 31 individuals with different haplotypes spanning three orders of magnitude in biomass and ii) 18 monitoring samples each amplified with four different primer sets and two PCR replicates. Results. We detected all 15 haplotypes of the single specimens in the mock community with relaxed filtering and denoising settings. However, up to 480 additional unexpected haplotypes remained in both replicates. Rigorous filtering removes most unexpected haplotypes, but also can discard expected haplotypes mainly from the small specimens. In the monitoring samples, the different primer sets detected 177 - 200 OTUs, each containing an average of 2.40 to 3.30 haplotypes per OTU. Population structures were consistent between replicates, and similar between primer pairs, depending on the primer length. A closer look at abundant taxa in the data set revealed various population genetic patterns, e.g. Taeniopteryx nebulosa and Hydropsyche pellucidula with a difference in north-south haplotype distribution, while Oulimnius tuberculatus and Asellus aquaticus display no clear population pattern but differ in genetic diversity. Discussion. We developed a strategy to infer intraspecific genetic diversity from bulk invertebrate monitoring samples using metabarcoding data. It needs to be stressed that at this point metabarcoding-informed haplotyping is not capable of capture the full diversity present in such samples, due to variation in specimen size, primer bias and loss of sequence variants with low abundance. Nevertheless, for a high number of species intraspecific diversity was recovered, identifying potentially isolated populations and potential taxa for further more detailed phylogeographic investigation. While we are currently lacking large-scale metabarcoding data sets to fully take advantage of our new approach, metabarcoding-informed haplotyping holds great promise for biomonitoring efforts that not only seek information about biological diversity but also underlying genetic diversity.

Download Full-text

Estimating intraspecific genetic diversity from community DNA metabarcoding data

10.7287/peerj.preprints.3269v3 ◽

2018 ◽

Author(s):

Vasco Elbrecht ◽

Ecaterina Edith Vamos ◽

Dirk Steinke ◽

Florian Leese

Keyword(s):

Genetic Diversity ◽

Biological Diversity ◽

Intraspecific Diversity ◽

Great Promise ◽

Data Sets ◽

Mock Community ◽

Data Set ◽

Dna Metabarcoding ◽

Intraspecific Genetic Diversity ◽

Primer Sets

Background. DNA metabarcoding is used to generate species composition data for entire communities. However, sequencing errors in high throughput sequencing instruments are fairly common, usually requiring reads to be clustered into operational taxonomic units (OTU), losing information on intraspecific diversity in the process. While COI haplotype information is limited in resolution, it is nevertheless useful in a phylogeographic context, helping to formulate hypothesis on taxon dispersal. Methods. This study combines sequence denoising strategies, normally applied in microbial research, with additional abundance-based filtering to extract haplotypes from freshwater macroinvertebrate metabarcoding data sets. This novel approach was added to the R package "JAMP" and can be applied to Cytochrome c oxidase subunit I (COI) amplicon datasets. We tested our haplotyping method by sequencing i) a single-species mock community composed of 31 individuals with different haplotypes spanning three orders of magnitude in biomass and ii) 18 monitoring samples each amplified with four different primer sets and two PCR replicates. Results. We detected all 15 haplotypes of the single specimens in the mock community with relaxed filtering and denoising settings. However, up to 480 additional unexpected haplotypes remained in both replicates. Rigorous filtering removes most unexpected haplotypes, but also can discard expected haplotypes mainly from the small specimens. In the monitoring samples, the different primer sets detected 177 - 200 OTUs, each containing an average of 2.40 to 3.30 haplotypes per OTU. Population structures were consistent between replicates, and similar between primer pairs, depending on the primer length. A closer look at abundant taxa in the data set revealed various population genetic patterns, e.g. Taeniopteryx nebulosa and Hydropsyche pellucidula with a difference in north-south haplotype distribution, while Oulimnius tuberculatus and Asellus aquaticus display no clear population pattern but differ in genetic diversity. Discussion. We developed a strategy to infer intraspecific genetic diversity from bulk invertebrate monitoring samples using metabarcoding data. It needs to be stressed that at this point metabarcoding-informed haplotyping is not capable of capture the full diversity present in such samples, due to variation in specimen size, primer bias and loss of sequence variants with low abundance. Nevertheless, for a high number of species intraspecific diversity was recovered, identifying potentially isolated populations and potential taxa for further more detailed phylogeographic investigation. While we are currently lacking large-scale metabarcoding data sets to fully take advantage of our new approach, metabarcoding-informed haplotyping holds great promise for biomonitoring efforts that not only seek information about biological diversity but also underlying genetic diversity.

Download Full-text

Intra-genomic rDNA gene variability of Nassellaria and Spumellaria (Rhizaria, Radiolaria) assessed by Sanger, MinION and Illumina sequencing

10.1101/2021.10.05.463214 ◽

2021 ◽

Author(s):

Miguel Mendez Sandin ◽

Sarah Romac ◽

Fabrice Not

Keyword(s):

Large Scale ◽

High Throughput Sequencing ◽

Phylogenetic Reconstruction ◽

Environmental Dna ◽

Genomic Diversity ◽

Genomic Variability ◽

Rdna Gene ◽

Sequencing Errors ◽

Sequencing Platforms ◽

Gene Variability

Ribosomal DNA (rDNA) genes are known to be valuable markers for the barcoding of eukaryotic life and its phylogenetic classification at various taxonomic levels. The large scale exploration of environmental microbial diversity through metabarcoding approaches have been focused mainly on the hypervariable regions V4 and V9 of the 18S rDNA gene. Yet, the accurate interpretation of such environmental surveys is hampered by technical (e.g., PCR and sequencing errors) and biological biases (e.g., intra-genomic variability). Here we explored the intra-genomic diversity of Nassellaria and Spumellaria specimens (Radiolaria) by comparing Sanger sequencing with two different high-throughput sequencing platforms: Illumina and Oxford Nanopore Technologies (MinION). Our analysis determined that intra-genomic variability of Nassellaria and Spumellaria is generally low, yet in some Spumellaria specimens we found two different copies of the V4 with a similarity lower than 97%. From the different sequencing methods, Illumina showed the highest number of contaminations (i.e., environmental DNA, cross-contamination, tag-jumping), revealed by its high sequencing depth; and Minion showed the highest sequencing rate error (~14%). Yet the long reads produced by MinION (~2900 bp) allowed accurate phylogenetic reconstruction studies. These results, highlight the requirement for a careful interpretation of Illumina based metabarcoding studies, in particular regarding low abundant amplicons, and open future perspectives towards full environmental rDNA metabarcoding surveys.

Download Full-text

Evaluating intraspecific genetic diversity using environmental DNA and denoising approach: A case study using tank water

Environmental DNA ◽

10.1002/edn3.44 ◽

2019 ◽

Vol 2 (1) ◽

pp. 42-52 ◽

Cited By ~ 10

Author(s):

Satsuki Tsuji ◽

Masaki Miya ◽

Masayuki Ushio ◽

Hirotoshi Sato ◽

Toshifumi Minamoto ◽

...

Keyword(s):

Genetic Diversity ◽

Environmental Dna ◽

Intraspecific Genetic Diversity ◽

Tank Water

Download Full-text

Genetic diversity and molecular phylogeography of Chinese domestic goats by large-scale mitochondrial DNA analysis

Molecular Biology Reports ◽

10.1007/s11033-014-3234-2 ◽

2014 ◽

Vol 41 (6) ◽

pp. 3695-3704 ◽

Cited By ~ 14

Author(s):

Yongju Zhao ◽

Runze Zhao ◽

Zhongquan Zhao ◽

Huizhong Xu ◽

Erhu Zhao ◽

...

Keyword(s):

Genetic Diversity ◽

Mitochondrial Dna ◽

Large Scale ◽

Dna Analysis ◽

Molecular Phylogeography ◽

Mitochondrial Dna Analysis

Download Full-text

Assessing intraspecific genetic diversity from community DNA metabarcoding data

10.7287/peerj.preprints.3269v2 ◽

2018 ◽

Author(s):

Vasco Elbrecht ◽

Ecaterina Edith Vamos ◽

Dirk Steinke ◽

Florian Leese

Keyword(s):

Genetic Diversity ◽

Biological Diversity ◽

Intraspecific Diversity ◽

Great Promise ◽

Data Sets ◽

Mock Community ◽

Data Set ◽

Dna Metabarcoding ◽

Intraspecific Genetic Diversity ◽

Primer Sets

Background. DNA metabarcoding is used to generate species composition data for entire communities. However, sequencing errors in high throughput sequencing instruments are fairly common, usually requiring reads to be clustered into operational taxonomic units (OTU), loosing information on intraspecific diversity in the process. Methods. This study combines sequence denoising strategies, normally applied in microbial research, with additional abundance based filtering to extract haplotypes from freshwater macroinvertebrate metabarcoding data sets. This novel approach is implemented in the R package "JAMP" and can be applied to Cytochrome c oxidase subunit I (COI) amplicon datasets. We tested our haplotyping method by sequencing i) a single-species mock community composed of 31 individuals with different haplotypes spanning three orders of magnitude in biomass and ii) 18 monitoring samples each amplified with four different primer sets and two PCR replicates. Results. We detected all 15 haplotypes of the single specimens in the mock community with relaxed filtering and denoising settings. However, up to 480 additional unexpected haplotypes remained in both replicates. Rigorous filtering removes most unexpected haplotypes, but also can discard expected haplotypes mainly from the small specimens. In the monitoring samples, the different primer sets detected 177 - 200 OTUs, each containing an average of 2.40 to 3.30 haplotypes per OTU. Population structures were consistent between replicates, and similar between primer pairs, depending on the primer length. A closer look at abundant taxa in the data set revealed various population genetic patterns, e.g. Taeniopteryx nebulosa and Hydropsyche pellucidula with a difference in north-south haplotype distribution, while Oulimnius tuberculatus and Asellus aquaticus display no clear population pattern but differ in genetic diversity. Discussion. We developed a strategy to infer intraspecific genetic diversity from bulk invertebrate samples using metabarcoding data. It needs to be stressed that at this point metabarcoding-informed haplotyping is not capable to capture the full diversity present in bulk samples, due to variation in specimen size, primer bias and loss of sequence variants with low abundance. Nevertheless, for a high number of species intraspecific diversity is recovered, identifying potentially isolated populations and potential taxa for further more detailed phylogeographic investigation. While we are currently lacking large-scale metabarcoding data sets to fully take advantage our new approach, metabarcoding-informed haplotyping holds great promise for biomonitoring efforts that not only seek information about biological diversity but also underlying genetic diversity.

Download Full-text