scholarly journals A bioinformatics approach to identifyingWolbachiainfections in arthropods

PeerJ ◽  
2018 ◽  
Vol 6 ◽  
pp. e5486 ◽  
Author(s):  
Jane Pascar ◽  
Christopher H. Chandler

Wolbachiais the most widespread endosymbiont, infecting >20% of arthropod species, and capable of drastically manipulating the host’s reproductive mechanisms. Conventionally, diagnosis has relied on PCR amplification; however, PCR is not always a reliable diagnostic technique due to primer specificity, strain diversity, degree of infection and/or tissue sampled. Here, we look for evidence ofWolbachiainfection across a wide array of arthropod species using a bioinformatic approach to detect theWolbachiagenesftsZ, wsp,and thegroEoperon in next-generation sequencing samples available through the NCBI Sequence Read Archive. For samples showing signs of infection, we attempted to assemble entireWolbachiagenomes, and in order to better understand the relationships between hosts and symbionts, phylogenies were constructed using the assembled gene sequences. Out of the 34 species with positively identified infections, eight species of arthropod had not previously been recorded to harborWolbachiainfection. All putative infections cluster with known representative strains belonging to supergroup A or B, which are known to only infect arthropods. This study presents an efficient bioinformatic approach for post-sequencing diagnosis and analysis ofWolbachiainfection in arthropods.

F1000Research ◽  
2019 ◽  
Vol 8 ◽  
pp. 532 ◽  
Author(s):  
Saket Choudhary

The NCBI Sequence Read Archive (SRA) is the primary archive of next-generation sequencing datasets. SRA makes metadata and raw sequencing data available to the research community to encourage reproducibility and to provide avenues for testing novel hypotheses on publicly available data. However, methods to programmatically access this data are limited. We introduce the Python package, pysradb, which provides a collection of command line methods to query and download metadata and data from SRA, utilizing the curated metadata database available through the SRAdb project. We demonstrate the utility of pysradb on multiple use cases for searching and downloading SRA datasets. It is available freely at https://github.com/saketkc/pysradb.


2019 ◽  
Author(s):  
Saket Choudhary

AbstractNCBIs Sequence Read Archive (SRA) is the primary archive of next-generation sequencing datasets. SRA makes metadata and raw sequencing data available to the research community to encourage reproducibility, and to provide avenues for testing novel hypotheses on publicly available data. However, existing methods to programmatically access these data are limited. We introduce a Python packagepysradbthat provides a collection of command line methods to query and download metadata and data from SRA utilizing the curated metadata database available through the SRAdb project. We demonstrate the utility ofpysradbon multiple use cases for searching and downloading SRA datasets. It is available freely athttps://github.com/saketkc/pysradb.


2015 ◽  
Vol 407 (7) ◽  
pp. 1841-1848 ◽  
Author(s):  
Marta Pawluczyk ◽  
Julia Weiss ◽  
Matthew G. Links ◽  
Mikel Egaña Aranguren ◽  
Mark D. Wilkinson ◽  
...  

2017 ◽  
Author(s):  
Xiao-Tao Jiang ◽  
Ke Yu ◽  
Li-Guan Li ◽  
Xiao-Le Yin ◽  
An-Dong Li ◽  
...  

AbstractMetatranscriptome has become increasingly important along with the application of next generation sequencing in the studies of microbial functional gene activity in environmental samples. However, the quantification of target active gene is hindered by the current relative quantification methods, especially when tracking the sharp environmental change. Great needs are here for an easy-to-perform method to obtain the absolute quantification. By borrowing information from the parallel metagenome, an absolute quantification method for both metagenomic and metatranscriptomic data to per gene/cell/volume/gram level was developed. The effectiveness of AQMM was validated by simulated experiments and was demonstrated with a real experimental design of comparing activated sludge with and without foaming. Our method provides a novel bioinformatic approach to fast and accurately conduct absolute quantification of metagenome and metatranscriptome in environmental samples. The AQMM can be accessed from https://github.com/biofuture/aqmm.


Blood ◽  
2015 ◽  
Vol 126 (23) ◽  
pp. 5283-5283
Author(s):  
Fiona M Quinn ◽  
Deirdre Waldron ◽  
David O'Brien ◽  
Johanna Kelly ◽  
Kathleen Scott ◽  
...  

Abstract Genetic makers such as TP53, ATM mutations and IgH somatic hypermutation status have been used to predict clinical behaviour and response to therapy in Chronic Lymphocytic Leukaemia (CLL) for the last decade. Recently identified genetic markers including splicing factor 3 B1 (SF3B1), NOTCH and BirC3 are also significant but their role cannot be fully evaluated until a robust assay is available for a routine molecular diagnostic laboratory The cell spliceosome is involved in the precise excision of introns from pre-mRNA and is composed of five subunits of which SF3B1 is a core component. Patient samples with SF3B1 mutations display enhanced intron retention in a small subset of transcripts involved in cancer related regulation such as cell cycle control, apoptosis and angiogenesis. Studies of CLL patient samples has shown that those with mutations in SF3B1 have been associated with shorter treatment free and overall survival. The screening methodology for SF3B1 mutations through deep sequencing, though very sensitive, is not available in routine diagnostic laboratories. Approximately 90% of CLL-associated SF3B1 mutations occur is 336bp region of the cDNA exome amenable to PCR amplification. We designed primers flanking this region and following reverse transcription (RT) PCR amplification, detected the products on standard agarose gels. Following bidirectional Sanger sequencing, we compared the sequences obtained with reference SF3B1 cDNA/amino acid code. We assessed the impact of the mutations by the polyphen database and compared this to the immunophenotype and cytogenetic data as well as response to treatment through minimal residual disease (MRD) tracking. We subsequently designed an assay for detection of sequence variants in the SF3B1 region of interest by High Resolution Melt (HRM) curve analysis. This has enabled us to identify genetic variants of the SF3B1 region prior to sequencing. The application of this HRM pre-sequencing screening assay has resulted in a significant reduction in the number of samples requiring sequencing. The patient cohort consisted of 52 patients enrolled on a Phase II, multi centre study of fludarabine, cyclophosphamide and rituximab (FCR) and included patients < 65 yrs, WHO status of 0, 1 or 2 without a deletion 17p. All samples were collected and tested prior to treatment, at the end of treatment (after 4 or 6 cycles when MRD negative), 6 monthly post treatment for 5 years for minimal residual disease and then annually by 6 colour immunophenotyping. Patients (51/52, no RNA available for 1 patient) were screened prior to treatment for the presence of SF3B1 mutations. Five of 51 (9.8 %) patients had SF3B1 mutations detectable by this methodology shown in the table below. Table 1. Pt. Nucleotide change Amino acid change Polyphen IGHV M=mutatedU=unmutated Cytogenetics Immunophenotyping MRD 1 c.2146A>G p.K700E Damaging U V4-6*01 Normal CD5 -ve Not tracked due to neg CD5 2 c.2179G>C p.A711P Damaging U V1-69*01 Normal Neg @ 3 mts Pos @ 16 mts 3 c.2267G>A p.G740E Damaging M V3-74*01 del 13q CD5 weak +ve CD38 +ve Neg @ 3 mts Pos @ 12 mts 4 c.2146A>G p.K700E Damaging U V1-69*01 del 13q Never MRD neg 5 c.2032C>T p.H662Y Damaging U V3-11*01 del 13q CD38 +ve Withdrawn Patients with mutations in SF3B1 all had favourable cytogenetics (normal or del 13q) and the majority (4/5) had unmutated IGHV genes. We detected two patients with a mutation at c.2146A>G, p.K700E and one at c.2267G>A, pG740E which have been described previously and two novel mutations at c.2179G>C, p.A711P and c.2032C>T, p.H662Y. All mutations detected were damaging as per the polyphen database. Two patients have weak or no expression of CD5. One patient has never achieved MRD negativity, and two patients who became MRD negative at 3 months reverted to MRD positivity 9 and 13 months later respectively. One patient's MRD levels could not be tracked due to negative CD5 expression and was followed by a less sensitive IgH clonality assay (sensitivity approx. 1%) and one patient was withdrawn from the study. In summary, we have developed a simple PCR based technique to detect SF3B1 mutations in CLL patient samples with high disease burden which can be applied in routine molecular diagnostic laboratories without access to next generation sequencing. The use of HRM technology resulted in a > 90% reduction in the number of samples requiring sequencing. Disclosures No relevant conflicts of interest to declare.


PeerJ ◽  
2020 ◽  
Vol 8 ◽  
pp. e9255 ◽  
Author(s):  
Carlos Farkas ◽  
Francisco Fuentes-Villalobos ◽  
Jose Luis Garrido ◽  
Jody Haigh ◽  
María Inés Barría

Here we aim to describe early mutational events across samples from publicly available SARS-CoV-2 sequences from the sequence read archive and GenBank repositories. Up until 27 March 2020, we downloaded 50 illumina datasets, mostly from China, USA (WA State) and Australia (VIC). A total of 30 datasets (60%) contain at least a single founder mutation and most of the variants are missense (over 63%). Five-point mutations with clonal (founder) effect were found in USA next-generation sequencing samples. Sequencing samples from North America in GenBank (22 April 2020) present this signature with up to 39% allele frequencies among samples (n = 1,359). Australian variant signatures were more diverse than USA samples, but still, clonal events were found in these samples. Mutations in the helicase, encoded by the ORF1ab gene in SARS-CoV-2 were predominant, among others, suggesting that these regions are actively evolving. Finally, we firmly urge that primer sets for diagnosis be carefully designed, since rapidly occurring variants would affect the performance of the reverse transcribed quantitative PCR (RT-qPCR) based viral testing.


2018 ◽  
Author(s):  
Mudra Hegde ◽  
Christine Strand ◽  
Ruth E. Hanna ◽  
John G. Doench

ABSTRACTMany implementations of pooled screens in mammalian cells rely on linking an element of interest to a barcode, with the latter subsequently quantitated by next generation sequencing. However, substantial uncoupling between these paired elements during lentiviral production has been reported, especially as the distance between elements increases. We detail that PCR amplification is another major source of uncoupling, and becomes more pronounced with increased amounts of DNA template molecules and PCR cycles. To lessen uncoupling in systems that use paired elements for detection, we recommend minimizing the distance between elements, using low and equal template DNA inputs for plasmid and genomic DNA during PCR, and minimizing the number of PCR cycles. We also present a vector design for conducting combinatorial CRISPR screens that enables accurate barcode-based detection with a single short sequencing read and minimal uncoupling.


Blood ◽  
2015 ◽  
Vol 126 (23) ◽  
pp. 478-478
Author(s):  
Michael W. Schmitt ◽  
Justin R. Pritchard ◽  
Graeme Hodgson ◽  
Victor M. Rivera ◽  
Bella I. Aminov ◽  
...  

Abstract Tyrosine kinase inhibitors (TKIÕs) vastly improve survival in chronic myeloid leukemia (CML). However resistance is a common occurrence, and is most frequently mediated by point mutations in the Abl kinase. There have been conflicting reports regarding the frequency of resistance mutations in the Abl kinase at the time of CML diagnosis, with some studies indicating that a large number of resistance mutations are pre-existing in many patients. However, a high burden of pre-existing resistance mutations is seemingly at odds with the ability of TKIÕs to initially control CML for months to years in the large majority of patients. Nearly all prior studies have used dozens of cycles of PCR amplification to enrich the Abl gene prior to mutation detection. PCR amplification introduces thousands of errors throughout the gene, which makes accurate identification of mutations challenging. Methods of mutation detection such as next-generation sequencing (NGS) are also error prone, with a background error rate near 1%. These high rates of background mutation render accurate detection of minority variants difficult or impossible. We have developed an alternate DNA mutation detection method, termed Duplex Sequencing (Schmitt MW et al, Nature Methods 2015;12:423-425), which improves on the accuracy of DNA sequencing by >100,000 fold. In Duplex Sequencing, the two strands of single DNA molecules are individually tagged and sequenced. True mutations are present at the same position in both strands and are complementary, whereas artifacts arising from amplification or sequencing errors are seen in only one strand and are not scored. We have additionally developed methods for >1,000,000 fold enrichment of the Abl gene from human cells. This enrichment protocol allows for unbiased detection of all possible mutations throughout the Abl gene, with an accuracy exceeding 99.9999%. We applied Duplex Sequencing to 18 patients with newly diagnosed chronic phase CML (CP-CML). We sequenced the active site exons of Abl from tens of thousands of individual CML genomes; no sub-clonal resistance mutations were seen in any patient. We demonstrate that prior reports indicating frequent resistance mutations at the time of CML diagnosis may be attributable to artifactual mutations introduced by reverse transcription and PCR amplification. We are now studying patients from the PACE trial (Cortes JE et al. N Engl J Med. 2013;369:1783-1796); these patients were heavily pre-treated, with the majority having received 3 or more prior TKIÕs. We identified 29 patients from the trial with a treatment-emergent mutation not detectable at baseline by conventional methods (NGS or Sanger), and have performed Duplex Sequencing on 9 of these patients to date. In contrast to treatment-naïve patients, we find that these heavily pre-treated patients commonly harbor multiple co-existing resistant sub-clones, most of which fall below the detection limit of conventional methods but are readily detected by Duplex Sequencing. In addition, in 44% of patients, mutations that emerged during ponatinib treatment were found as rare sub-clones at baseline (average detection limit of 1 in 11,412 cells), indicating that the mutant sub-clones pre-existed prior to ponatinib therapy. We are now sequencing the remainder of the cohort, and are increasing our detection limit through interrogation of a larger number of cells per patient. Our results indicate the feasibility of detecting sub-clonal resistance mutations at the time of initiation of next-line TKI therapy in order to guide treatment, and argue that high-sensitivity methods for mutation detection are required to optimally stratify patients to the most appropriate TKI. Moreover, our approach can detect sub-clonal mutations in any gene, and could be broadly applicable for identification of sub-clonal drug resistance mutations relevant to other targeted therapies. Figure 1. Detection of sub-clonal mutations in the Abl gene. Top panel: Next-generation sequencing of the Abl gene in a CML patient results in thousands of artifactual errors, preventing detection of low-level mutations. Bottom panel: Duplex Sequencing of the same sample eliminates artifactual errors, revealing a single point mutation in Abl. This mutation (E279K) confers resistance to imatinib. Figure 1. Detection of sub-clonal mutations in the Abl gene. Top panel: Next-generation sequencing of the Abl gene in a CML patient results in thousands of artifactual errors, preventing detection of low-level mutations. Bottom panel: Duplex Sequencing of the same sample eliminates artifactual errors, revealing a single point mutation in Abl. This mutation (E279K) confers resistance to imatinib. Disclosures Pritchard: ARIAD: Employment, Equity Ownership. Hodgson:ARIAD Pharmaceuticals: Employment, Equity Ownership. Rivera:ARIAD: Employment, Equity Ownership.


Sign in / Sign up

Export Citation Format

Share Document