scholarly journals Assessment of metrics in next-generation sequencing experiments for use in core-genome multilocus sequence type

PeerJ ◽  
2021 ◽  
Vol 9 ◽  
pp. e11842
Author(s):  
Yen-Yi Liu ◽  
Bo-Han Chen ◽  
Chih-Chieh Chen ◽  
Chien-Shun Chiou

With the reduction in the cost of next-generation sequencing, whole-genome sequencing (WGS)–based methods such as core-genome multilocus sequence type (cgMLST) have been widely used. However, gene-based methods are required to assemble raw reads to contigs, thus possibly introducing errors into assemblies. Because the robustness of cgMLST depends on the quality of assemblies, the results of WGS should be assessed (from sequencing to assembly). In this study, we investigated the robustness of different read lengths, read depths, and assemblers in recovering genes from reference genomes. Different combinations of read lengths and read depths were simulated from the complete genomes of three common food-borne pathogens: Escherichia coli, Listeria monocytogenes, and Salmonella enterica. We found that the quality of assemblies was mainly affected by read depth, irrespective of the assembler used. In addition, we suggest several cutoff values for future cgMLST experiments. Furthermore, we recommend the combinations of read lengths, read depths, and assemblers that can result in a higher cost/performance ratio for cgMLST.

2020 ◽  
Vol 66 (6) ◽  
pp. 832-841
Author(s):  
Young Kyu Min ◽  
Young Kee Lee ◽  
Seong-Hyeuk Nam ◽  
Jae Kyung Kim ◽  
Kyung Sun Park ◽  
...  

Abstract Background Because next-generation sequencing (NGS) for detecting somatic mutations has been adopted in clinical fields, both qualitative and quantitative QC of the somatic variants through whole coding regions detected by NGS is crucial. However, specific applications or guidelines, especially for quantitative QC, are currently insufficient. Our goal was to devise a practical approach for both quantitative and qualitative QC using an example of detecting clonal hematopoiesis of indeterminate potential (CHIP). Methods We applied the QC scheme using commercial reference materials and in-house QC materials (IQCM) composed of haplotype map and cancer cell lines for monitoring CHIP. Results This approach efficiently validated a customized CHIP NGS assay. Accuracy, analytical sensitivity, analytical specificity, qualitative precision (concordance), and limit of detection achieved were 99.87%, 98.53%, 100.00%, 100.00%, and 1.00%, respectively. The quantitative precision analysis also had a higher CV percentage at a lower alternative read depth (R2 = 0.749∼0.858). Use of IQCM ensured more than 100-fold reduction in the cost per run compared with that achieved using commercial reference materials. Conclusion Our approach determined the general analytical performance of NGS for detecting CHIP and recognized limitations such as lower precision at a lower level of variant burden. This approach could also be theoretically expanded to a general NGS assay for detecting somatic variants. Considering the reliable NGS results and cost-effectiveness, we propose the use of IQCM for QC of NGS assays at clinical laboratories.


2019 ◽  
Author(s):  
◽  
Morgan Gueuning

Wild bees are essential pollinators and therefore play a key role in both natural and agricultural ecosystems. However, bees have often been neglected in conservation studies and policies worldwide, which is surprising given their ecological importance. As a result, little is known on the conservation status of the vast majority of wild bee species in Europe, and even less worldwide. Limited surveys suggest important declines in the abundance and diversity of most wild bee communities worldwide. It is therefore urgent to implement targeted measures for the conservation of these keystone species. Once implemented, the effectiveness of these measures must be evaluated using adequate monitoring programs. To date, wild bee surveys are entirely based on morphological identification, which is both labor intensive and time consuming. Consequently, an affordable, high-throughput identification method is needed to reduce costs and improve bee monitoring. The objective of this thesis was to evaluate novel genetic techniques based on Next Generation Sequencing (NGS) methods for facilitating surveys of wild bees. NGS tools were mainly investigated for bridging two important impediments to wild bee conservation efforts, i.e., the cost of biodiversity assessment schemes and taxonomic incompleteness. With the development of NGS techniques, DNA barcoding has gained enormous momentum, enabling cost-effective, fast and accurate identifications. Before these methods can be routinely used in monitoring programs, there are however still important knowledge gaps to fill. These gaps mainly concern the detection of rare species and the acquisition of accurate quantitative data on species abundance; more generally the cost and labour effectiveness of these methods need to be evaluated. To provide a comprehensive presentation of the advantages and weaknesses of different NGS-based identification methods, we assessed three of the most promising ones, namely metabarcoding, mitogenomics and NGS barcoding. Using a regular monitoring data, we found that NGS barcoding performed best for both species’ presence/absence and abundance data, producing only few false positives and no false negatives. The other methods investigated were less reliable in term of species detection and inference of abundance data, and partly led to erroneous ecological conclusions. In terms of workload and cost, we showed that NGS techniques were more expensive than morphological identification with our dataset, although these techniques would become slightly more economical in large-scale monitoring programs. A second aim of this thesis was to provide an easy and robust genomic solution to alleviate taxonomical incompleteness, one of the major impediments to the effective conservation of many insect taxa. For conservation purposes, having stable and well-delimited species hypotheses is essential. Currently, most species are delimitated based on morphology and/or DNA barcoding. These methods are however associated with important limitations, and it is widely accepted that species delimitation should rely on multi-locus genomic markers. To overcome these limitations, ultraconserved elements (UCEs) were tested as a fast and robust approach using different species-complexes harbouring cryptic diversity, mitochondrial introgression, or mitochondrial paraphyly. Phylogenetic analyses of UCEs were highly conclusive and yielded meaningful species delimitation hypotheses in all cases. These results provide strong evidence for the potential of UCEs as a fast method for delimiting species even in cases of recently diverged lineages. Advantages and limitations of UCEs for shallow phylogenetic studies are further discussed.


2017 ◽  
Vol 7 (1) ◽  
Author(s):  
Mike J. Wilkinson ◽  
Claudia Szabo ◽  
Caroline S. Ford ◽  
Yuval Yarom ◽  
Adam E. Croxford ◽  
...  

2020 ◽  
Vol 79 (2) ◽  
pp. 105-113
Author(s):  
Abdul Bari Muneera Parveen ◽  
Divya Lakshmanan ◽  
Modhumita Ghosh Dasgupta

The advent of next-generation sequencing has facilitated large-scale discovery and mapping of genomic variants for high-throughput genotyping. Several research groups working in tree species are presently employing next generation sequencing (NGS) platforms for marker discovery, since it is a cost effective and time saving strategy. However, most trees lack a chromosome level genome map and validation of variants for downstream application becomes obligatory. The cost associated with identifying potential variants from the enormous amount of sequence data is a major limitation. In the present study, high resolution melting (HRM) analysis was optimized for rapid validation of single nucleotide polymorphisms (SNPs), insertions or deletions (InDels) and simple sequence repeats (SSRs) predicted from exome sequencing of parents and hybrids of Eucalyptus tereticornis Sm. ? Eucalyptus grandis Hill ex Maiden generated from controlled hybridization. The cost per data point was less than 0.5 USD, providing great flexibility in terms of cost and sensitivity, when compared to other validation methods. The sensitivity of this technology in variant detection can be extended to other applications including Bar-HRM for species authentication and TILLING for detection of mutants.


2010 ◽  
Vol 76 (12) ◽  
pp. 3863-3868 ◽  
Author(s):  
J. Kirk Harris ◽  
Jason W. Sahl ◽  
Todd A. Castoe ◽  
Brandie D. Wagner ◽  
David D. Pollock ◽  
...  

ABSTRACT Constructing mixtures of tagged or bar-coded DNAs for sequencing is an important requirement for the efficient use of next-generation sequencers in applications where limited sequence data are required per sample. There are many applications in which next-generation sequencing can be used effectively to sequence large mixed samples; an example is the characterization of microbial communities where ≤1,000 sequences per samples are adequate to address research questions. Thus, it is possible to examine hundreds to thousands of samples per run on massively parallel next-generation sequencers. However, the cost savings for efficient utilization of sequence capacity is realized only if the production and management costs associated with construction of multiplex pools are also scalable. One critical step in multiplex pool construction is the normalization process, whereby equimolar amounts of each amplicon are mixed. Here we compare three approaches (spectroscopy, size-restricted spectroscopy, and quantitative binding) for normalization of large, multiplex amplicon pools for performance and efficiency. We found that the quantitative binding approach was superior and represents an efficient scalable process for construction of very large, multiplex pools with hundreds and perhaps thousands of individual amplicons included. We demonstrate the increased sequence diversity identified with higher throughput. Massively parallel sequencing can dramatically accelerate microbial ecology studies by allowing appropriate replication of sequence acquisition to account for temporal and spatial variations. Further, population studies to examine genetic variation, which require even lower levels of sequencing, should be possible where thousands of individual bar-coded amplicons are examined in parallel.


PLoS ONE ◽  
2016 ◽  
Vol 11 (2) ◽  
pp. e0149393 ◽  
Author(s):  
Kendrick B. Turner ◽  
Jennifer Naciri ◽  
Jinny L. Liu ◽  
George P. Anderson ◽  
Ellen R. Goldman ◽  
...  

2017 ◽  
Vol 55 (2) ◽  
pp. 114-121 ◽  
Author(s):  
Jamie M Ellingford ◽  
Bradley Horn ◽  
Christopher Campbell ◽  
Gavin Arno ◽  
Stephanie Barton ◽  
...  

BackgroundDiagnostic use of gene panel next-generation sequencing (NGS) techniques is commonplace for individuals with inherited retinal dystrophies (IRDs), a highly genetically heterogeneous group of disorders. However, these techniques have often failed to capture the complete spectrum of genomic variation causing IRD, including CNVs. This study assessed the applicability of introducing CNV surveillance into first-tier diagnostic gene panel NGS services for IRD.MethodsThree read-depth algorithms were applied to gene panel NGS data sets for 550 referred individuals, and informatics strategies used for quality assurance and CNV filtering. CNV events were confirmed and reported to referring clinicians through an accredited diagnostic laboratory.ResultsWe confirmed the presence of 33 deletions and 11 duplications, determining these findings to contribute to the confirmed or provisional molecular diagnosis of IRD for 25 individuals. We show that at least 7% of individuals referred for diagnostic testing for IRD have a CNV within genes relevant to their clinical diagnosis, and determined a positive predictive value of 79% for the employed CNV filtering techniques.ConclusionIncorporation of CNV analysis increases diagnostic yield of gene panel NGS diagnostic tests for IRD, increases clarity in diagnostic reporting and expands the spectrum of known disease-causing mutations.


2021 ◽  
Vol 8 (Supplement_1) ◽  
pp. S436-S436
Author(s):  
Rachel Downey Quick ◽  
Kelli A Martinez ◽  
Susan M Russo ◽  
Sarah E McGwier ◽  
Rachel A Quirt ◽  
...  

Abstract Background Pathogen testing using next-generation sequencing of microbial cell-free DNA (NGS cfDNA) is a promising diagnostic tool to identify pathogens that might not be detected using conventional lab evaluation. Considering the cost of this test, it is important to determine when it is most useful to the plan of care (POC). Figure 1. Unit of admission among cases Figure 2. Patient characteristics in cases determined to be valuable and not valuable to the plan of care (POC) Methods In this retrospective study, we collected data from the medical charts of 50 consecutive NGS cfDNA tests in a free-standing children’s hospital. We evaluated patients for demographics, underlying conditions, diagnosis at time of testing, conventional laboratory testing and timing, medical treatment, and NGS cfDNA test results for clinical relevance or false negative results compared to conventional testing. The primary goal was to identify patients for whom the NGS cfDNA testing affected the POC. Charts were reviewed, and determinations regarding whether the result influenced the POC were confirmed by a provider. Results We were unable to differentiate patients with clinically valuable NGS cfDNA results (Fig 1 & 2). Among those with NGS cfDNA results valuable to the POC (n=22), both negative and positive testing guided POC (13 valuable negative vs. 9 diagnostic cases). In the total sample, 5 cases (10%) had a clinically relevant pathogen identified through conventional testing, but not through NGS cfDNA and 2 cases had antimicrobial resistance on culture, which is not detected by NGS cfDNA. Conclusion While we did not find a specific clinical profile for NGS cfDNA use, positive results were essential to the diagnosis in 18% of cases with otherwise negative laboratory evaluation for the pathogen identified in NGS cfDNA. Negative tests affected the POC in 26% of cases by avoiding unnecessary antimicrobials in high risk immunocompromised patients and patients that presented with low-risk of infection, but unclear disease process. Caution must be exercised with reliance on this test with respect to antimicrobial resistance and risk of false negative results. Disclosures All Authors: No reported disclosures


Sign in / Sign up

Export Citation Format

Share Document