Determining Exon Connectivity in Complex mRNAs by Nanopore Sequencing

Plastid gene expression involves many post-transcriptional maturation steps resulting in a complex transcriptome composed of multiple isoforms. Although short read RNA-seq has considerably improved our understanding of the molecular mechanisms controlling these processes, it is unable to sequence full-length transcripts. This information is however crucial when it comes to understand the interplay between the various steps of plastid gene expression. Here, the study of the Arabidopsis leaf plastid transcriptome using Nanopore sequencing showed that many splicing and editing events were not independent but co-occurring. For a given transcript, maturation events also appeared to be chronologically ordered with splicing happening after most sites are edited.

Download Full-text

Detection of Clinically Relevant Molecular Alterations in Chronic Lymphocytic Leukemia (CLL) By Nanopore Sequencing

Blood ◽

10.1182/blood-2018-99-110948 ◽

2018 ◽

Vol 132 (Supplement 1) ◽

pp. 1847-1847 ◽

Cited By ~ 1

Author(s):

Adam Burns ◽

David Robert Bruce ◽

Pauline Robbe ◽

Adele Timbs ◽

Basile Stamatopoulos ◽

...

Keyword(s):

Error Correction ◽

Low Cost ◽

Nanopore Sequencing ◽

Sequencing Data ◽

Mutation Status ◽

Short Read ◽

Short Read Sequencing ◽

Oxford Nanopore ◽

Low Coverage ◽

Oxford Nanopore Technologies

Abstract Introduction Chronic Lymphocytic Leukaemia (CLL) is the most prevalent leukaemia in the Western world and characterised by clinical heterogeneity. IgHV mutation status, mutations in the TP53 gene and deletions of the p-arm of chromosome 17 are currently used to predict an individual patient's response to therapy and give an indication as to their long-term prognosis. Current clinical guidelines recommend screening patients prior to initial, and any subsequent, treatment. Routine clinical laboratory practices for CLL involve three separate assays, each of which are time-consuming and require significant investment in equipment. Nanopore sequencing offers a rapid, low-cost alternative, generating a full prognostic dataset on a single platform. In addition, Nanopore sequencing also promises low failure rates on degraded material such as FFPE and excellent detection of structural variants due to long read length of sequencing. Importantly, Nanopore technology does not require expensive equipment, is low-maintenance and ideal for patient-near testing, making it an attractive DNA sequencing device for low-to-middle-income countries. Methods Eleven untreated CLL samples were selected for the analysis, harbouring both mutated (n=5) and unmutated (n=6) IgHV genes, seven TP53 mutations (five missense, one stop gain and one frameshift) and two del(17p) events. Primers were designed to amplify all exons of TP53, along with the IgHV locus, and each primer included universal tails for individual sample barcoding. The resulting PCR amplicons were prepared for sequencing using a ligation sequencing kit (SQK-LSK108, Oxford Nanopore Technologies, Oxford, UK). All IgHV libraries were pooled and sequenced on one R9.4 flowcell, with the TP53 libraries pooled and sequenced on a second R9.4 flowcell. Whole genome libraries were prepared from 400ng genomic DNA for each sample using a rapid sequencing kit (SQK-RAD004, Oxford Nanopore Technologies, Oxford, UK), and each sample sequenced on individual flowcells on a MinION mk1b instrument (Oxford Nanopore Technologies, Oxford, UK). We developed a bespoke bioinformatics pipeline to detect copy-number changes, TP53 mutations and IgHV mutation status from the Nanopore sequencing data. Results were compared to short-read sequencing data obtained earlier by targeted deep sequencing (MiSeq, Illumina Inc, San Diego, CA, USA) and whole genome sequencing (HiSeq 2500, Illumina Inc, San Diego CA, USA). Results Following basecalling and adaptor trimming, the raw data were submitted to the IMGT database. In the absence of error correction, it was possible to identify the correct VH family for each sample; however the germline homology was not sufficient to differentiate between IgHVmut and IgHVunmut CLL cases. Following bio-informatic error correction and consensus building, the percentage to germline homology was the same as that obtained from short-read sequencing and nanopore sequencing also called the same productive rearrangements in all cases. A total of 77 TP53 variants were identified, including 68 in non-coding regions, and three synonymous SNVs. The remaining 6 were predicted to be functional variants (eight missense and two stop-gains) and had all been identified in early MiSeq targeted sequencing. However, the frameshift mutation was not called by the analysis pipeline, although it is present in the aligned reads. Using the low-coverage WGS data, we were able to identify del(17p) events, of 19Mb and 20Mb length, in both patients with high confidence. Conclusions Here we demonstrate that characterization of the IgHV locus in CLL cases is possible using the MinION platform, provided sufficient downstream analysis, including error correction, is applied. Furthermore, somatic SNVs in TP53 can be identified, although similar to second generation sequencing, variant calling of small insertions and deletions is more problematic. Identification of del(17p) is possible from low-coverage WGS on the MinION and is inexpensive. Our data demonstrates that Nanopore sequencing can be a viable, patient-near, low-cost alternative to established screening methods, with the potential of diagnostic implementation in resource-poor regions of the world. Disclosures Schuh: Giles, Roche, Janssen, AbbVie: Honoraria.

Download Full-text

Direct RNA nanopore sequencing of full-length coronavirus genomes provides novel insights into structural variants and enables modification analysis

10.1101/483693 ◽

2018 ◽

Cited By ~ 8

Author(s):

Adrian Viehweger ◽

Sebastian Krautwurst ◽

Kevin Lamkiewicz ◽

Ramakanth Madhugiri ◽

John Ziebuhr ◽

...

Keyword(s):

Rna Sequencing ◽

Rna Virus ◽

Consensus Sequence ◽

Full Length ◽

Viral Rna ◽

Genome Replication ◽

Nanopore Sequencing ◽

Human Coronavirus ◽

Viral Rnas ◽

Virus Genomes

Sequence analyses of RNA virus genomes remain challenging due to the exceptional genetic plasticity of these viruses. Because of high mutation and recombination rates, genome replication by viral RNA-dependent RNA polymerases leads to populations of closely related viruses, so-called 'quasispecies'. Standard (short-read) sequencing technologies are ill-suited to reconstruct large numbers of full-length haplotypes of (i) RNA virus genomes and (ii) subgenome-length (sg) RNAs comprised of noncontiguous genome regions. Here, we used a full-length, direct RNA sequencing (DRS) approach based on nanopores to characterize viral RNAs produced in cells infected with a human coronavirus. Using DRS, we were able to map the longest (~26 kb) contiguous read to the viral reference genome. By combining Illumina and nanopore sequencing, we reconstructed a highly accurate consensus sequence of the human coronavirus (HCoV) 229E genome (27.3 kb). Furthermore, using long reads that did not require an assembly step, we were able to identify, in infected cells, diverse and novel HCoV-229E sg RNAs that remain to be characterized. Also, the DRS approach, which circumvents reverse transcription and amplification of RNA, allowed us to detect methylation sites in viral RNAs. Our work paves the way for haplotype-based analyses of viral quasispecies by demonstrating the feasibility of intra-sample haplotype separation. Even though several technical challenges remain to be addressed to exploit the potential of the nanopore technology fully, our work illustrates that direct RNA sequencing may significantly advance genomic studies of complex virus populations, including predictions on long-range interactions in individual full-length viral RNA haplotypes.

Download Full-text

Direct RNA Nanopore Sequencing of SARS-CoV-2 Extracted from Critical Material from Swabs

Life ◽

10.3390/life12010069 ◽

2022 ◽

Vol 12 (1) ◽

pp. 69

Author(s):

Davide Vacca ◽

Antonino Fiannaca ◽

Fabio Tramuto ◽

Valeria Cancila ◽

Laura La Paglia ◽

...

Keyword(s):

Rna Sequencing ◽

Systematic Errors ◽

N Gene ◽

Nanopore Sequencing ◽

Rna Seq ◽

Bioinformatics Pipeline ◽

Critical Material ◽

Oxford Nanopore ◽

Oropharyngeal Swab ◽

Generation Sequencing

In consideration of the increasing prevalence of COVID-19 cases in several countries and the resulting demand for unbiased sequencing approaches, we performed a direct RNA sequencing (direct RNA seq.) experiment using critical oropharyngeal swab samples collected from Italian patients infected with SARS-CoV-2 from the Palermo region in Sicily. Here, we identified the sequences SARS-CoV-2 directly in RNA extracted from critical samples using the Oxford Nanopore MinION technology without prior cDNA retrotranscription. Using an appropriate bioinformatics pipeline, we could identify mutations in the nucleocapsid (N) gene, which have been reported previously in studies conducted in other countries. In conclusion, to the best of our knowledge, the technique used in this study has not been used for SARS-CoV-2 detection previously owing to the difficulties in the extraction of RNA of sufficient quantity and quality from routine oropharyngeal swabs. Despite these limitations, this approach provides the advantages of true native RNA sequencing and does not include amplification steps that could introduce systematic errors. This study can provide novel information relevant to the current strategies adopted in SARS-CoV-2 next-generation sequencing.

Download Full-text

Nanopore sequencing reveals endogenous NMD-targeted isoforms in human cells

10.1101/2021.04.30.442116 ◽

2021 ◽

Author(s):

Evangelos D. Karousis ◽

Foivos Gypas ◽

Mihaela Zavolan ◽

Oliver Muehlemann

Keyword(s):

Regulation Of Gene Expression ◽

Degradation Pathway ◽

Human Cells ◽

Full Length ◽

Main Function ◽

Nanopore Sequencing ◽

Sequencing Data ◽

Short Read ◽

Expression Levels ◽

Short Read Sequencing

Background: Nonsense-mediated mRNA decay (NMD) is a eukaryotic, translation-dependent degradation pathway that targets mRNAs with premature termination codons and also regulates the expression of some mRNAs that encode full-length proteins. Although many genes express NMD-sensitive transcripts, identifying them based on short-read sequencing data remains a challenge. Results: To identify and analyze endogenous targets of NMD, we applied cDNA Nanopore sequencing and short-read sequencing to human cells with varying expression levels of NMD factors. Our approach detects full-length NMD substrates that are highly unstable and increase in levels or even only appear when NMD is inhibited. Among the many new NMD-targeted isoforms that our analysis identified, most derive from alternative exon usage. The isoform-aware analysis revealed many genes with significant changes in splicing but no significant changes in overall expression levels upon NMD knockdown. NMD-sensitive mRNAs have more exons in the 3΄UTR and, for those mRNAs with a termination codon in the last exon, the length of the 3΄UTR per se does not correlate with NMD sensitivity. Analysis of splicing signals reveals isoforms where NMD has been co-opted in the regulation of gene expression, though the main function of NMD still seems to be ridding the transcriptome of isoforms resulting from spurious splicing events. Conclusions: Long-read sequencing enabled the identification of many novel NMD-sensitive mRNAs and revealed both known and unexpected features concerning their biogenesis and their biological role. Our data provide a highly valuable resource of human NMD transcript targets for future genomic and transcriptomic applications.

Download Full-text

Direct RNA nanopore sequencing of SARS-CoV-2 extracted from critical material from swabs

10.1101/2020.12.21.20191346 ◽

2020 ◽

Author(s):

Davide Vacca ◽

Antonino Fiannaca ◽

Fabio Tramuto ◽

Valeria Cancila ◽

Laura La Paglia ◽

...

Keyword(s):

Rna Sequencing ◽

Gene Sequence ◽

N Gene ◽

Nanopore Sequencing ◽

Bioinformatics Pipeline ◽

Critical Material ◽

Oxford Nanopore ◽

Sequencing Experiment ◽

Oropharyngeal Swab ◽

Generation Sequencing

ABSTRACTBackgroundIn consideration of the increasing prevalence of COVID-19 cases in several countries and the resulting demand for unbiased sequencing approaches, we performed a direct RNA sequencing experiment using critical oropharyngeal swab samples collected from Italian patients infected with SARS-CoV-2 from the Palermo region in Sicily.MethodsHere, we identified the sequences SARS-CoV-2 directly in RNA extracted from critical samples using the Oxford Nanopore MinION technology without prior cDNA retro-transcription.ResultsUsing an appropriate bioinformatics pipeline, we could identify mutations in the nucleocapisid (N) gene, which have been reported previously in studies conducted in other countries.ConclusionTo the best of our knowledge, the technique used in this study has not been used for SARS-CoV-2 detection previously owing to the difficulties in the extraction of RNA of sufficient quantity and quality from routine oropharyngeal swabs.Despite these limitations, this approach provides the advantages of true native RNA sequencing, and does not include amplification steps that could introduce systematic errors.This study can provide novel information relevant to the current strategies adopted in SARS-CoV-2 next-generation sequencing.We deposited the gene sequence in the NCBI database under the following URL:https://www.ncbi.nlm.nih.gov/nuccore/MT457389

Download Full-text

Nanopore sequencing reveals endogenous NMD-targeted isoforms in human cells

Genome Biology ◽

10.1186/s13059-021-02439-3 ◽

2021 ◽

Vol 22 (1) ◽

Author(s):

Evangelos D. Karousis ◽

Foivos Gypas ◽

Mihaela Zavolan ◽

Oliver Mühlemann

Keyword(s):

Regulation Of Gene Expression ◽

Degradation Pathway ◽

Human Cells ◽

Full Length ◽

Main Function ◽

Nanopore Sequencing ◽

Sequencing Data ◽

Short Read ◽

Expression Levels ◽

Short Read Sequencing

Abstract Background Nonsense-mediated mRNA decay (NMD) is a eukaryotic, translation-dependent degradation pathway that targets mRNAs with premature termination codons and also regulates the expression of some mRNAs that encode full-length proteins. Although many genes express NMD-sensitive transcripts, identifying them based on short-read sequencing data remains a challenge. Results To identify and analyze endogenous targets of NMD, we apply cDNA Nanopore sequencing and short-read sequencing to human cells with varying expression levels of NMD factors. Our approach detects full-length NMD substrates that are highly unstable and increase in levels or even only appear when NMD is inhibited. Among the many new NMD-targeted isoforms that our analysis identifies, most derive from alternative exon usage. The isoform-aware analysis reveals many genes with significant changes in splicing but no significant changes in overall expression levels upon NMD knockdown. NMD-sensitive mRNAs have more exons in the 3΄UTR and, for those mRNAs with a termination codon in the last exon, the length of the 3΄UTR per se does not correlate with NMD sensitivity. Analysis of splicing signals reveals isoforms where NMD has been co-opted in the regulation of gene expression, though the main function of NMD seems to be ridding the transcriptome of isoforms resulting from spurious splicing events. Conclusions Long-read sequencing enables the identification of many novel NMD-sensitive mRNAs and reveals both known and unexpected features concerning their biogenesis and their biological role. Our data provide a highly valuable resource of human NMD transcript targets for future genomic and transcriptomic applications.

Download Full-text

A streamlined protocol for high-throughput amplification-based analysis of DNA samples via nanopore sequencing (based on the 96-well PCR barcoding kit) v1

10.17504/protocols.io.bw2cpgaw ◽

2021 ◽

Author(s):

Jordan P Cuff ◽

HooperAS not provided ◽

Shrinivas Nivrutti Dighe ◽

Angela Marchbank ◽

Peter Kille

Keyword(s):

High Throughput ◽

Genomic Dna ◽

Dna Amplification ◽

Cost Effective ◽

Nanopore Sequencing ◽

Optimal Method ◽

Adapter Ligation ◽

Large Numbers ◽

Oxford Nanopore

Nanopore sequencing facilitates the rapid and cost-effective sequencing of long fragment DNA for a massive range of applications. When looking to holistically analyse low-yield DNA samples using nanopore sequencing, the optimal method is likely to involve the PCR Barcoding Kit. This effectively involves blunt end ligation of priming sites onto all extant DNA for holistic amplification to achieve yields suitable for nanopore sequencing. The currently available kits from nanopore facilitate the multiplexing of 96 samples in one sequencing run using this method, but the reagent costs are inherently multiplicative. This protocol is designed to streamline (in terms of cost, reagents and time) the process of sequencing up to 96 samples of genomic DNA through nanopore sequencing. This protocol is best applied to large numbers of samples (up to 96). For smaller numbers of samples, consider the smaller "PCR Barcoding" kits provided by nanopore which similarly achieve holistic DNA amplification and sequencing, but without the need for additional adapter ligation. The protocol is best suited to samples with low DNA yields (100 ng input is recommended). If you can input 1000 ng of DNA from each of your samples, consider using the 96-well Ligation Kit from Oxford Nanopore which can similarly be streamlined in terms of cost and time, but avoids the amplification step.

Download Full-text