Fact and Fiction about 1%: Next Generation Sequencing and the Detection of Minor Drug Resistant Variants in HIV-1 Populations with and without Unique Molecular Identifiers

Shuntai Zhou; Ronald Swanstrom

doi:10.3390/v12080850

Fact and Fiction about 1%: Next Generation Sequencing and the Detection of Minor Drug Resistant Variants in HIV-1 Populations with and without Unique Molecular Identifiers

Viruses ◽

10.3390/v12080850 ◽

2020 ◽

Vol 12 (8) ◽

pp. 850 ◽

Cited By ~ 1

Author(s):

Shuntai Zhou ◽

Ronald Swanstrom

Keyword(s):

Next Generation Sequencing ◽

High Throughput Sequencing ◽

Consensus Sequence ◽

Viral Population ◽

Drug Resistant ◽

Next Generation ◽

Data Set ◽

Method Error ◽

Generation Sequencing ◽

Hiv 1

Next generation sequencing (NGS) platforms have the ability to generate almost limitless numbers of sequence reads starting with a PCR product. This gives the illusion that it is possible to analyze minor variants in a viral population. However, including a PCR step obscures the sampling depth of the viral population, the key parameter needed to understand the utility of the data set for finding minor variants. Also, these high throughput sequencing platforms are error prone at the level where minor variants are of interest, confounding the interpretation of detected minor variants. A simple strategy has been applied in multiple applications of NGS to solve these problems. Prior to PCR, individual molecules are “tagged” with a unique molecular identifier (UMI) that can be used to establish the actual sample size of viral genomes sequenced after PCR and sequencing. In addition, since PCR generates many copies of each sequence tagged to a specific UMI, a template consensus sequence (TCS) can be created from the many reads of each template, removing virtually all of the method error. From this perspective we examine our own use of a UMI, called Primer ID, in the detection of minor drug resistant variants in HIV-1 populations.

Download Full-text

Primer ID Validates Template Sampling Depth and Greatly Reduces the Error Rate of Next-Generation Sequencing of HIV-1 Genomic RNA Populations

Journal of Virology ◽

10.1128/jvi.00522-15 ◽

2015 ◽

Vol 89 (16) ◽

pp. 8540-8555 ◽

Cited By ~ 77

Author(s):

Shuntai Zhou ◽

Corbin Jones ◽

Piotr Mieczkowski ◽

Ronald Swanstrom

Keyword(s):

Next Generation Sequencing ◽

Error Rate ◽

Consensus Sequence ◽

Next Generation ◽

Sampling Depth ◽

Data Set ◽

Sequencing Errors ◽

Next Generation Sequencing Ngs ◽

Generation Sequencing ◽

Hiv 1

ABSTRACTValidating the sampling depth and reducing sequencing errors are critical for studies of viral populations using next-generation sequencing (NGS). We previously described the use of Primer ID to tag each viral RNA template with a block of degenerate nucleotides in the cDNA primer. We now show that low-abundance Primer IDs (offspring Primer IDs) are generated due to PCR/sequencing errors. These artifactual Primer IDs can be removed using a cutoff model for the number of reads required to make a template consensus sequence. We have modeled the fraction of sequences lost due to Primer ID resampling. For a typical sequencing run, less than 10% of the raw reads are lost to offspring Primer ID filtering and resampling. The remaining raw reads are used to correct for PCR resampling and sequencing errors. We also demonstrate that Primer ID reveals bias intrinsic to PCR, especially at low template input or utilization. cDNA synthesis and PCR convert ca. 20% of RNA templates into recoverable sequences, and 30-fold sequence coverage recovers most of these template sequences. We have directly measured the residual error rate to be around 1 in 10,000 nucleotides. We use this error rate and the Poisson distribution to define the cutoff to identify preexisting drug resistance mutations at low abundance in an HIV-infected subject. Collectively, these studies show that >90% of the raw sequence reads can be used to validate template sampling depth and to dramatically reduce the error rate in assessing a genetically diverse viral population using NGS.IMPORTANCEAlthough next-generation sequencing (NGS) has revolutionized sequencing strategies, it suffers from serious limitations in defining sequence heterogeneity in a genetically diverse population, such as HIV-1 due to PCR resampling and PCR/sequencing errors. The Primer ID approach reveals the true sampling depth and greatly reduces errors. Knowing the sampling depth allows the construction of a model of how to maximize the recovery of sequences from input templates and to reduce resampling of the Primer ID so that appropriate multiplexing can be included in the experimental design. With the defined sampling depth and measured error rate, we are able to assign cutoffs for the accurate detection of minority variants in viral populations. This approach allows the power of NGS to be realized without having to guess about sampling depth or to ignore the problem of PCR resampling, while also being able to correct most of the errors in the data set.

Download Full-text

A Pan-HIV Strategy for Complete Genome Sequencing

Journal of Clinical Microbiology ◽

10.1128/jcm.02479-15 ◽

2015 ◽

Vol 54 (4) ◽

pp. 868-882 ◽

Cited By ~ 36

Author(s):

Michael G. Berg ◽

Julie Yamaguchi ◽

Elodie Alessandri-Gradt ◽

Robert W. Tell ◽

Jean-Christophe Plantier ◽

...

Keyword(s):

Next Generation Sequencing ◽

Rna Virus ◽

Consensus Sequence ◽

Sample Type ◽

Full Genome ◽

Next Generation ◽

Viral Loads ◽

Subtype B ◽

Generation Sequencing ◽

Hiv 1

Molecular surveillance is essential to monitor HIV diversity and track emerging strains. We have developed a universal library preparation method (HIV-SMART [i.e.,switchingmechanismat 5′ end ofRNAtranscript]) for next-generation sequencing that harnesses the specificity of HIV-directed priming to enable full genome characterization of all HIV-1 groups (M, N, O, and P) and HIV-2. Broad application of the HIV-SMART approach was demonstrated using a panel of diverse cell-cultured virus isolates. HIV-1 non-subtype B-infected clinical specimens from Cameroon were then used to optimize the protocol to sequence directly from plasma. When multiplexing 8 or more libraries per MiSeq run, full genome coverage at a median ∼2,000× depth was routinely obtained for either sample type. The method reproducibly generated the same consensus sequence, consistently identified viral sequence heterogeneity present in specimens, and at viral loads of ≤4.5 log copies/ml yielded sufficient coverage to permit strain classification. HIV-SMART provides an unparalleled opportunity to identify diverse HIV strains in patient specimens and to determine phylogenetic classification based on the entire viral genome. Easily adapted to sequence any RNA virus, this technology illustrates the utility of next-generation sequencing (NGS) for viral characterization and surveillance.

Download Full-text

FDA authorizes marketing of first next-generation sequencing test for detecting HIV-1 drug resistance mutations

Case Medical Research ◽

10.31525/cmr-2106c4c ◽

2019 ◽

Author(s):

Keyword(s):

Drug Resistance ◽

Next Generation Sequencing ◽

Next Generation ◽

Resistance Mutations ◽

Drug Resistance Mutations ◽

Generation Sequencing ◽

Hiv 1

Download Full-text

Universal Amplification, Next-Generation Sequencing, and Assembly of HIV-1 Genomes

Journal of Clinical Microbiology ◽

10.1128/jcm.01516-12 ◽

2012 ◽

Vol 50 (12) ◽

pp. 3838-3844 ◽

Cited By ~ 85

Author(s):

A. Gall ◽

B. Ferns ◽

C. Morris ◽

S. Watson ◽

M. Cotten ◽

...

Keyword(s):

Next Generation Sequencing ◽

Next Generation ◽

Generation Sequencing ◽

Hiv 1

Download Full-text

Profiling DNA Methylation Based on Next-Generation Sequencing Approaches: New Insights and Clinical Applications

Genes ◽

10.3390/genes9090429 ◽

2018 ◽

Vol 9 (9) ◽

pp. 429 ◽

Cited By ~ 38

Author(s):

Daniela Barros-Silva ◽

C. Marques ◽

Rui Henrique ◽

Carmen Jerónimo

Keyword(s):

Dna Methylation ◽

Next Generation Sequencing ◽

High Throughput Sequencing ◽

Epigenetic Modification ◽

Response To Therapy ◽

Next Generation ◽

Sequencing Technologies ◽

Prognosis And Prediction ◽

Novel Biomarkers ◽

Generation Sequencing

DNA methylation is an epigenetic modification that plays a pivotal role in regulating gene expression and, consequently, influences a wide variety of biological processes and diseases. The advances in next-generation sequencing technologies allow for genome-wide profiling of methyl marks both at a single-nucleotide and at a single-cell resolution. These profiling approaches vary in many aspects, such as DNA input, resolution, coverage, and bioinformatics analysis. Thus, the selection of the most feasible method according with the project’s purpose requires in-depth knowledge of those techniques. Currently, high-throughput sequencing techniques are intensively used in epigenomics profiling, which ultimately aims to find novel biomarkers for detection, diagnosis prognosis, and prediction of response to therapy, as well as to discover new targets for personalized treatments. Here, we present, in brief, a portrayal of next-generation sequencing methodologies’ evolution for profiling DNA methylation, highlighting its potential for translational medicine and presenting significant findings in several diseases.

Download Full-text

Targeted next-generation sequencing of sputum for diagnosis of drug-resistant TB: results of a national survey in Democratic Republic of the Congo

Scientific Reports ◽

10.1038/s41598-020-67479-4 ◽

2020 ◽

Vol 10 (1) ◽

Author(s):

Michel Kaswa Kayomo ◽

Vital Nkake Mbula ◽

Muriel Aloni ◽

Emmanuel André ◽

Leen Rigouts ◽

...

Keyword(s):

Next Generation Sequencing ◽

National Survey ◽

Democratic Republic ◽

Drug Resistant ◽

Next Generation ◽

Targeted Next Generation Sequencing ◽

Republic Of The Congo ◽

Generation Sequencing

Download Full-text

Hi-C chromosome conformation capture sequencing of avian genomes using the BGISEQ-500 platform

GigaScience ◽

10.1093/gigascience/giaa087 ◽

2020 ◽

Vol 9 (8) ◽

Author(s):

Marcela Sandoval-Velasco ◽

Juan Antonio Rodríguez ◽

Cynthia Perez Estrada ◽

Guojie Zhang ◽

Erez Lieberman Aiden ◽

...

Keyword(s):

Next Generation Sequencing ◽

High Throughput Sequencing ◽

Data Generation ◽

Next Generation ◽

Sequencing Data ◽

Yield Data ◽

Chromosome Conformation ◽

Sequencing Platform ◽

Sequencing Platforms ◽

Generation Sequencing

Abstract Background Hi-C experiments couple DNA-DNA proximity with next-generation sequencing to yield an unbiased description of genome-wide interactions. Previous methods describing Hi-C experiments have focused on the industry-standard Illumina sequencing. With new next-generation sequencing platforms such as BGISEQ-500 becoming more widely available, protocol adaptations to fit platform-specific requirements are useful to give increased choice to researchers who routinely generate sequencing data. Results We describe an in situ Hi-C protocol adapted to be compatible with the BGISEQ-500 high-throughput sequencing platform. Using zebra finch (Taeniopygia guttata) as a biological sample, we demonstrate how Hi-C libraries can be constructed to generate informative data using the BGISEQ-500 platform, following circularization and DNA nanoball generation. Our protocol is a modification of an Illumina-compatible method, based around blunt-end ligations in library construction, using un-barcoded, distally overhanging double-stranded adapters, followed by amplification using indexed primers. The resulting libraries are ready for circularization and subsequent sequencing on the BGISEQ series of platforms and yield data similar to what can be expected using Illumina-compatible approaches. Conclusions Our straightforward modification to an Illumina-compatible in situHi-C protocol enables data generation on the BGISEQ series of platforms, thus expanding the options available for researchers who wish to utilize the powerful Hi-C techniques in their research.

Download Full-text

Routine drug resistance testing in HIV-1 proviral DNA, using an automated next- generation sequencing assay

Journal of Clinical Virology ◽

10.1016/j.jcv.2019.104207 ◽

2019 ◽

Vol 121 ◽

pp. 104207 ◽

Cited By ~ 2

Author(s):

Enagnon Kazali Alidjinou ◽

Pauline Coulon ◽

Christophe Hallaert ◽

Olivier Robineau ◽

Agnès Meybeck ◽

...

Keyword(s):

Drug Resistance ◽

Next Generation Sequencing ◽

Resistance Testing ◽

Next Generation ◽

Proviral Dna ◽

Drug Resistance Testing ◽

Generation Sequencing ◽

Hiv 1

Download Full-text

Next generation sequencing of HIV-1 protease in the PIVOT trial of protease inhibitor monotherapy

Journal of Clinical Virology ◽

10.1016/j.jcv.2018.02.003 ◽

2018 ◽

Vol 101 ◽

pp. 63-65 ◽

Cited By ~ 2

Author(s):

David T. Dunn ◽

Wolfgang Stöhr ◽

Alejandro Arenas-Pinto ◽

Anna Tostevin ◽

Jean L. Mbisa ◽

...

Keyword(s):

Next Generation Sequencing ◽

Protease Inhibitor ◽

Next Generation ◽

Protease Inhibitor Monotherapy ◽

Generation Sequencing ◽

Hiv 1

Download Full-text

Screening and Identification of PLK1-Polo Box Binding Peptides by High-Throughput Sequencing of Phage-Selected Libraries

Protein and Peptide Letters ◽

10.2174/0929866526666190318101054 ◽

2019 ◽

Vol 26 (8) ◽

pp. 620-633 ◽

Cited By ~ 1

Author(s):

Nousheen Bibi ◽

Hafsa Niaz ◽

Ted Hupp ◽

Mohammad Amjad Kamal ◽

Sajid Rashid

Keyword(s):

Next Generation Sequencing ◽

Phage Display ◽

High Throughput ◽

High Throughput Sequencing ◽

B Lymphocyte ◽

Fluorescent Protein ◽

Next Generation ◽

Peptide Phage Display ◽

Binding Peptides ◽

Generation Sequencing

Background: Human proteome contains a plethora of short linear peptide motifs that is crucial for signaling and other cellular processes. These motifs are difficult to identify due to lack of systematic approach for their detection. Objective: Here we demonstrate the use of peptide phage display in combination with high throughput next generation sequencing to identify enriched peptide sequences through biopanning process against polo box domain (PBD) of mitotic polo like kinase 1 (Plk1). Methods: Purified recombinant Plk1 and two unrelated controls namely B-lymphocyte antigen (CD20) and fluorescent protein (mCherry) were subjected to peptide phage display analysis. Bacterially-propagated phage DNA was amplified by PCR using triplet bar coded primers to tag the pool from each amplicon. Results: Proteomic peptide phage display along with next generation sequencing and Bioinformatics analysis demonstrated several known and putative novel interactions which were potentially related to Plk1-PBD. With our strategy, we were able to identify and characterize several Plk1-PBD binding peptides, as well as define more precisely, consensus sequences. Conclusion: We believe that this information could provide valuable tools for exploring novel interaction involved in Plk1 signaling as well as to choose peptides for Plk1 specific drug development.

Download Full-text