scholarly journals Comparison of Illumina MiSeq and the Ion Torrent PGM and S5 platforms for whole-genome sequencing of picornaviruses and caliciviruses

2019 ◽  
Author(s):  
Rachel L. Marine ◽  
Laura C. Magaña ◽  
Christina J. Castro ◽  
Kun Zhao ◽  
Anna M. Montmayeur ◽  
...  

ABSTRACTNext-generation sequencing is a powerful tool for virological surveillance. While Illumina® and Ion Torrent® sequencing platforms are used extensively for generating viral RNA genome sequences, there is limited data comparing different platforms. We evaluated the Illumina MiSeq, Ion Torrent PGM and Ion Torrent S5 platforms using a panel of sixteen specimens containing picornaviruses and human caliciviruses (noroviruses and sapoviruses). The specimens were processed, using combinations of three library preparation and five sequencing kits, to assess the quality and completeness of assembled viral genomes, and an estimation of cost per sample to generate the data was calculated. The choice of library preparation kit and sequencing platform was found to impact the breadth of genome coverage and accuracy of consensus viral genomes. The Ion Torrent S5 outperformed the older Ion Torrent PGM platform in data quality and cost, and generated the highest proportion of reads for enterovirus D68 samples. However, indels at homopolymer regions impacted the accuracy of consensus genome sequences. For lower throughput sequencing runs (i.e., Ion Torrent 510 or Illumina MiSeq Nano V2), the cost per sample was lower on the MiSeq platform, whereas with higher throughput runs (Ion Torrent 530 or Illumina MiSeq V2) the cost per sample was comparable. These findings suggest that the Ion Torrent S5 and Illumina MiSeq platforms are both viable options for genomic sequencing of RNA viruses, each with specific advantages and tradeoffs.

Viruses ◽  
2021 ◽  
Vol 13 (10) ◽  
pp. 2093
Author(s):  
Shen-Yuan Hsieh ◽  
Mohammad A. Tariq ◽  
Andrea Telatin ◽  
Rebecca Ansorge ◽  
Evelien M. Adriaenssens ◽  
...  

The human intestinal microbiota is abundant in viruses, comprising mainly bacteriophages, occasionally outnumbering bacteria 10:1 and is termed the virome. Due to their high genetic diversity and the lack of suitable tools and reference databases, the virome remains poorly characterised and is often referred to as “viral dark matter”. However, the choice of sequencing platforms, read lengths and library preparation make study design challenging with respect to the virome. Here we have compared the use of PCR and PCR-free methods for sequence-library construction on the Illumina sequencing platform for characterising the human faecal virome. Viral DNA was extracted from faecal samples of three healthy donors and sequenced. Our analysis shows that most variation was reflecting the individually specific faecal virome. However, we observed differences between PCR and PCR-free library preparation that affected the recovery of low-abundance viral genomes. Using three faecal samples in this study, the PCR library preparation samples led to a loss of lower-abundance vOTUs evident in their PCR-free pairs (vOTUs 128, 6202 and 8364) and decreased the alpha-diversity indices (Chao1 p-value = 0.045 and Simpson p-value = 0.044). Thus, differences between PCR and PCR-free methods are important to consider when investigating “rare” members of the gut virome, with these biases likely negligible when investigating moderately and highly abundant viruses.


Author(s):  
Champika Fernando ◽  
Janet E. Hill

Abstract This protocol can be applied to determine the composition of a microbial community. The cpn60 gene (also known as groEL, hsp60) is present in almost all bacteria and a 552-558 bp region of the gene has been established as a barcode for species level identification of bacteria. The primer cocktail used in this protocol amplifies cpn60 barcode sequences from bacteria with a wide range of G+C content. Some species of Mycoplasma lack the cpn60 gene and therefore this method is not recommended to detect Mycoplasma. DNA sequences generated from this method could be compared to cpnDB, a public database of cpn60 sequences, for identification. Library preparation involves cpn60 amplicon generation, PCR clean-up, index PCR, index PCR clean-up, library quantification, normalization, pooling, library denaturation and loading. Time taken to complete depends on the number of samples included. If using 96 samples, the procedure takes 8 hours but there are several stages where the samples could be stored and continued the next day. Specific instructions are provided for the Illumina MiSeq platform, but the protocol could easily be adapted for other sequencing platforms.


2020 ◽  
Author(s):  
Xiaohuan Sun ◽  
Jingjing Wang ◽  
Chao Fang ◽  
Jiguang Li ◽  
Mo Han ◽  
...  

ABSTRACTMetabarcoding has become the de facto method for characterizing the structure of microbial communities in complex environmental samples. To determine how sequencing platform may influence microbial community characterization, we present a large-scale comparison of two sequencing platforms; Illumina MiSeq and a new platform DNBSEQ-G400 developed by MGI Tech. The accuracy of DNBSEQ-G400 on bacterial and fungal mock samples and compared sequencing consistency and precision between DNBSEQ-G400 and MiSeq platforms by sequencing the fungal ITS2 region from 1144 soil samples with 3 technical replicates. The DNBSEQ-G400 showed a high accuracy in reproducing mock communities containing different proportions of bacteria and fungi, respectively. The taxonomic profiles of the 1144 soil samples generated by the two DNBSEQ-G400 modes closely resembled each other and were highly correlated with those generated by the MiSeq platform. Analyses of technical replicates demonstrated a run bias against certain taxa on the MiSeq but not DNBSEQ-G400 platform. Based on lower cost, greater capacity, and less bias, we conclude that DNBSEQ-G400 is an optimal platform for short-term metabarcoding of microbial communities.IMPORTANCEExperimental steps that generate sequencing bias during amplicon sequencing have been intensively evaluated, including the choice of primer pair, polymerase, PCR cycle and technical replication. However, few studies have assessed the accuracy and precision of different sequencing platforms. Here, we compared the performance of newly released DNBSEQ-G400 sequencer with that of the commonly used Illumina MiSeq platform by leveraging amplicon sequencing of a large number of soil samples. Significant sequencing bias among major fungal genera was found in parallel MiSeq runs, which can be easily neglected without the use of sequencing controls. We emphasize the importance of technical controls in large-scale sequencing efforts and provide DNBSEQ-G400 as an alternative with increased sequencing capacity and more stable reproducibility for amplicon sequencing.


2018 ◽  
Author(s):  
Thomas W. A. Braukmann ◽  
Natalia V. Ivanova ◽  
Sean W. J. Prosser ◽  
Vasco Elbrecht ◽  
Dirk Steinke ◽  
...  

AbstractDNA metabarcoding is an attractive approach for monitoring biodiversity. However, it is subject to biases that often impede detection of all species in a sample. In particular, the proportion of sequences recovered from each species depends on its biomass, mitome copy number, and primer set employed for PCR. To examine these variables, we constructed a mock community of terrestrial arthropods comprised of 374 BINs, a species proxy. We used this community to examine how species recovery was impacted when amplicon pools were constructed in four ways. The first two protocols involved the construction of bulk DNA extracts from different body partitions (Bulk Abdomen, Bulk Leg). The other protocols involved the production of DNA extracts from single legs which were then merged prior to PCR (Composite Leg) or PCR-amplified separately (Single Leg) and then pooled. The amplicon generated by these four treatments were then sequenced on three platforms (Illumina MiSeq, Ion Torrent PGM and Ion Torrent S5). The choice of sequencing platform did not substantially influence species recovery, other variables did. As expected, the best recovery was obtained from the Single Leg treatment, but the Bulk Abdomen produced a more uniform read abundance than the Bulk Leg or Composite Leg samples. Primer choice also influenced species recovery. Our results reveal how variation in protocols can have substantive impacts on perceived diversity unless sequencing coverage is sufficient to reach an asymptote. Although metabarcoding is a powerful approach, further optimization of analytical protocols is crucial to obtain reproducible results and increase its cost-effectiveness.


2020 ◽  
Vol 7 (1) ◽  
Author(s):  
Kira Moon ◽  
Suhyun Kim ◽  
Ilnam Kang ◽  
Jang-Cheon Cho

Abstract A high number of viral metagenomes have revealed countless genomes of putative bacteriophages that have not yet been identified due to limitations in bacteriophage cultures. However, most virome studies have been focused on marine or gut environments, thereby leaving the viral community structure of freshwater lakes unclear. Because the lakes located around the globe have independent ecosystems with unique characteristics, viral community structures are also distinctive but comparable. Here, we present data on viral metagenomes that were seasonally collected at a depth of 1 m from Lake Soyang, the largest freshwater reservoir in South Korea. Through shotgun metagenome sequencing using the Illumina MiSeq platform, 3.08 to 5.54-Gbps of reads per virome were obtained. To predict the viral genome sequences within Lake Soyang, contigs were constructed and 648 to 1,004 putative viral contigs were obtained per sample. We expect that both viral metagenome reads and viral contigs would contribute in comparing and understanding of viral communities among different freshwater lakes depending on seasonal changes.


2020 ◽  
Vol 280 ◽  
pp. 113865 ◽  
Author(s):  
Rachel L. Marine ◽  
Laura C. Magaña ◽  
Christina J. Castro ◽  
Kun Zhao ◽  
Anna M. Montmayeur ◽  
...  

2018 ◽  
Author(s):  
Madeline Y Mayday ◽  
Lillian M Khan ◽  
Eric D Chow ◽  
Matt S Zinter ◽  
Joseph L DeRisi

AbstractPreparation of high-quality sequencing libraries is a costly and time-consuming component of metagenomic next generation sequencing (mNGS). While the overall cost of sequencing has dropped significantly over recent years, the reagents needed to prepare sequencing samples are likely to become the dominant expense in the process. Furthermore, libraries prepared by hand are subject to human variability and needless waste due to limitations of manual pipetting volumes. Reduction of reaction volumes, combined with sub-microliter automated dispensing of reagents without consumable pipette tips, has the potential to provide significant advantages. Here, we describe the integration of several instruments, including the Labcyte Echo 525 acoustic liquid handler and the iSeq and NovaSeq Illumina sequencing platforms, to miniaturize and automate mNGS library preparation, significantly reducing the cost and the time required to prepare samples. Through the use of External RNA Controls Consortium (ERCC) spike-in RNAs, we demonstrated the fidelity of the miniaturized preparation to be equivalent to full volume reactions. Furthermore, detection of viral and microbial species from cell culture and patient samples was also maintained in the miniaturized libraries. For 384-well mNGS library preparations, we achieved a savings of over 80% in materials and reagents alone, and reduced preparation time by 90% compared to manual approaches, without compromising quality or representation within the library.


Author(s):  
Elizabeth M. Batty ◽  
Theerarat Kochakarn ◽  
Arporn Wangwiwatsin ◽  
Khajohn Joonlasak ◽  
Angkana T. Huang ◽  
...  

AbstractGenomic surveillance has a key role in tracking the ongoing COVID-19 pandemic, but information on how different sequencing library preparation approaches affect the data produced are lacking. We compared three library preparation methods using both tagmentation (Nextera XT and Nextera Flex) and ligation-based (KAPA HyperPrep) approaches on both positive and negative samples to provide insights into any methodological differences between the methods, and validate their use in SARS-CoV-2 amplicon sequencing. We show that all three library preparation methods allow us to recover near-complete SARS-CoV-2 genomes with identical SNP calls. The Nextera Flex and KAPA library preparation methods gave better coverage than libraries prepared with Nextera XT, which required more reads to call the same number of genomic positions. The KAPA ligation-based approach shows the lowest levels of human contamination, but contaminating reads had no effect on the downstream analysis. We found some examples of library preparation-specific differences in minority variant calling. Overall our data shows that the choice of Illumina library preparation method has minimal effects on consensus base calling and downstream phylogenetic analysis, and suggests that all methods would be suitable for use if specific reagents are difficult to obtain.


2020 ◽  
Author(s):  
Karamese Murat ◽  
Ozgur Didem ◽  
Tutuncu Emin Ediz

ABSTRACTIntroductionWe present the sequence analysis for 47 complete genomes for SARS-CoV-2 isolates on Turkish patients. To identify their genetic similarity, phylogenetic analysis was performed by comparing the worldwide SARS-CoV-2 sequences, selected from GISAID, to the complete genomes from Turkish isolates. In addition, we focused on the variation analysis to show the mutations on SARS-CoV-2 genomes.MethodsIllumina MiSeq platform was used for sequencing the libraries. The raw reads were aligned to the known SARS-CoV-2 genome (GenBank: MN908947.3) using the Burrows-Wheeler aligner (v.0.7.1). The phylogenetic tree was constructer using Phylip v.3.6 with Neighbor-Joining and composite likelihood method. The variants were detected by using Genome Analysis Toolkit-HaplotypeCaller v.3.8.0 and were inspected on GenomeBrowse v2.1.2.ResultsAll viral genome sequences of our isolates was located in lineage B under the different clusters such as B.1 (n=3), B.1.1 (n=28), and B.1.9 (n=16). According to the GISAID nomenclature, all our complete genomes were placed in G, GR and GH clades. Five hundred forty-nine total and 53 unique variants were detected. All 47 genomes exhibited different kinds of variants. The distinct variants consist of 274 missense, 225 synonymous, and 50 non-coding alleles.ConclusionThe results indicated that the SARS-CoV-2 sequences of our isolates have great similarity with all Turkish and European sequences. Further studies should be performed for better comparison of strains, after more complete genome sequences will be released. We also believe that collecting and sharing any data about SARS-CoV-2 virus and COVID-19 will be effective and may help the related studies.


Sign in / Sign up

Export Citation Format

Share Document