scholarly journals BANDITS: Bayesian differential splicing accounting for sample-to-sample variability and mapping uncertainty

2019 ◽  
Author(s):  
Simone Tiberi ◽  
Mark D Robinson

AbstractAlternative splicing is a biological process during gene expression that allows a single gene to code for multiple proteins. However, splicing patterns can be altered in some conditions or diseases. Here, we present BANDITS, a R/Bioconductor package to perform differential splicing, at both gene and transcript-level, based on RNA-seq data. BANDITS uses a Bayesian hierarchical structure to explicitly model the variability between samples, and treats the transcript allocation of reads as latent variables. We perform an extensive benchmark across both simulated and experimental RNA-seq datasets, where BANDITS has extremely favorable performance with respect to the competitors considered.

2019 ◽  
Author(s):  
Hongxu Ding ◽  
Andrew Blair ◽  
Ying Yang ◽  
Joshua M. Stuart

ABSTRACTThe maintenance and transition of cellular states are controlled by biological processes. Here we present a gene set-based transformation of single cell RNA-Seq data into biological process activities that provides a robust description of cellular states. Moreover, as these activities represent species-independent descriptors, they facilitate the alignment of single cell states across different organisms.


2019 ◽  
Author(s):  
Caleb M. Radens ◽  
Davia Blake ◽  
Paul Jewell ◽  
Yoseph Barash ◽  
Kristen W. Lynch

SummaryDistinct T cell subtypes are typically defined by the expression of distinct gene repertoires. However, there is variability between studies regarding the markers used to define each T cell subtype. Moreover, previous analysis of gene expression in T cell subsets has largely focused on gene expression rather than alternative splicing. Here we take a meta-analysis approach, comparing eleven independent RNA-Seq studies of human Th1, Th2, Th17 and/or Treg cells to identify transcriptomic features that correlate consistently with subtype. We find that known master-regulators are consistently enriched in the appropriate subtype, however, cytokines and other genes often used as markers are more variable. Importantly, we also identify previously unknown transcriptomic markers that consistently differentiate between subsets, including a few Treg-specific splicing patterns. Together this work highlights the heterogeneity in gene expression between isolates of the same subtype, but also suggests additional markers that can be used to define functional groupings.


2020 ◽  
Vol 36 (12) ◽  
pp. 3907-3909 ◽  
Author(s):  
Ruijia Wang ◽  
Bin Tian

Abstract Summary Most eukaryotic genes produce alternative polyadenylation (APA) isoforms. APA is dynamically regulated under different growth and differentiation conditions. Here, we present a bioinformatics package, named APAlyzer, for examining 3′UTR APA, intronic APA and gene expression changes using RNA-seq data and annotated polyadenylation sites in the PolyA_DB database. Using APAlyzer and data from the GTEx database, we present APA profiles across human tissues. Availability and implementation APAlyzer is freely available at https://bioconductor.org/packages/release/bioc/html/APAlyzer.html as an R/Bioconductor package. Supplementary information Supplementary data are available at Bioinformatics online.


2018 ◽  
Vol 20 (6) ◽  
pp. 2044-2054 ◽  
Author(s):  
Adam McDermaid ◽  
Brandon Monier ◽  
Jing Zhao ◽  
Bingqiang Liu ◽  
Qin Ma

Abstract Differential gene expression (DGE) analysis is one of the most common applications of RNA-sequencing (RNA-seq) data. This process allows for the elucidation of differentially expressed genes across two or more conditions and is widely used in many applications of RNA-seq data analysis. Interpretation of the DGE results can be nonintuitive and time consuming due to the variety of formats based on the tool of choice and the numerous pieces of information provided in these results files. Here we reviewed DGE results analysis from a functional point of view for various visualizations. We also provide an R/Bioconductor package, Visualization of Differential Gene Expression Results using R, which generates information-rich visualizations for the interpretation of DGE results from three widely used tools, Cuffdiff, DESeq2 and edgeR. The implemented functions are also tested on five real-world data sets, consisting of one human, one Malus domestica and three Vitis riparia data sets.


2018 ◽  
Vol 4 (Supplement 2) ◽  
pp. 95s-95s
Author(s):  
A. Nandan ◽  
V. Sharma ◽  
H. Singh ◽  
A. Chandra ◽  
R. Tripathi ◽  
...  

Background: Alternate splicing (AS) is a regulatory process during gene expression that allows a single gene to code multiple proteins. Sequencing of RNA (RNA-Seq) is a high throughput technology, which has been used in various studies to identify AS mechanisms in head and neck cancer (HNC). Until date, there is no available review that could update us with the major outcomes from these studies. Aim: To perform a comprehensive literature search for AS studies on HNC via RNA-Seq. Methods: A systematic literature search was performed following PRISMA guidelines to give a complete picture of AS in HNC identified through RNA-Seq. In addition, comprehensive search was also performed to identify the bioinformatics softwares that analyses RNA-Seq data for finding AS in cancer. Results: Six studies were found that used RNA-Seq data for identifying AS events in HNC. Five softwares were used by these studies to identify AS events, of which Suppa and AltAnalyze can also categorize all four AS events to subtypes, i.e., cassette exon skipping (ES), intron retention (IR), mutually exclusive exon (MXE), and alternative 5′ and 3′ splice site (ASS). Additionally, SplAdder, ASprofile, JuncBASE, and MATS softwares have been used to identify and categorize AS events in cancers other than HNC. Conclusion: Alternate splicing in HNC is a complex regulatory process of gene expression. It can be studied through RNA-Seq using various bioinformatics softwares. SplAdder, ASprofile, JuncBASE, and MATS have been used to identify and characterize other cancers, but not implemented in HNC, and hence could be used for studying AS in HNC.


Author(s):  
I. Tsers ◽  
V. Gorshkov ◽  
N. Gogoleva ◽  
Y. Gogolev

We propose an algorithm for RNA-Seq data analysis useful for revealing the “master regulators” of gene expression in experimental condition, as well as of cis-elements regulating transcript level of genes from certain groups.


2016 ◽  
Author(s):  
Huijuan Feng ◽  
Tingting Li ◽  
Xuegong Zhang

AbstractBackgroundAlternative splicing is a ubiquitous post-transcriptional process in most eukaryotic genes. Aberrant splicing isoforms and abnormal isoform ratios can contribute to cancer development. Kinase genes are key regulators of various cellular processes. Many kinases are found to be oncogenic and have been intensively investigated in the study of cancer and drugs. RNA-Seq provides a powerful technology for genome-wide study of alternative splicing in cancer besides the conventional gene expression profiling. But this potential has not been fully demonstrated yet.MethodsHere we characterized the transcriptome profile of prostate cancer using RNA-Seq data from viewpoints of both differential expression and differential splicing, with an emphasis on kinase genes and their splicing variations. We built up a pipeline to conduct differential expression and differential splicing analysis. Further functional enrichment analysis was performed to explore functional interpretation of the genes. With focus on kinase genes, we performed kinase domain analysis to identify the functionally important candidate kinase gene in prostate cancer. We further calculated the expression level of isoforms to explore the function of isoform switching of kinase genes in prostate cancer.ResultsWe identified distinct gene groups from differential expression and splicing analysis, which suggested that alternative splicing adds another level to gene expression regulation. Enriched GO terms of differentially expressed and spliced kinase genes were found to play different roles in regulation of cellular metabolism. Function analysis on differentially spliced kinase genes showed that differentially spliced exons of these genes are significantly enriched in protein kinase domains. Among them, we found that gene CDK5 has isoform switching between prostate cancer and benign tissues, which may affect cancer development by changing androgen receptor (AR) phosphorylation. The observation was validated in another RNA-Seq dataset of prostate cancer cell lines.ConclusionsOur work characterized the expression and splicing profile of kinase genes in prostate cancer and proposed a hypothetical model on isoform switching of CDK5 and AR phosphorylation in prostate cancer. These findings bring new understanding to the role of alternatively spliced kinases in prostate cancer and demonstrate the use of RNA-Seq data in studying alternative splicing in cancer.


2013 ◽  
Vol 7 (1) ◽  
pp. 48-67 ◽  
Author(s):  
Juhee Lee ◽  
Yuan Ji ◽  
Shoudan Liang ◽  
Guoshuai Cai ◽  
Peter Müller

2019 ◽  
Vol 12 (1) ◽  
Author(s):  
Nathan D. Pennock ◽  
Sonali Jindal ◽  
Wesley Horton ◽  
Duanchen Sun ◽  
Jayasri Narasimhan ◽  
...  

Abstract Background Formalin-fixed, paraffin-embedded (FFPE) tissues for RNA-seq have advantages over fresh frozen tissue including abundance and availability, connection to rich clinical data, and association with patient outcomes. However, FFPE-derived RNA is highly degraded and chemically modified, which impacts its utility as a faithful source for biological inquiry. Methods True archival FFPE breast cancer cases (n = 58), stored at room temperature for 2–23 years, were utilized to identify key steps in tissue selection, RNA isolation, and library choice. Gene expression fidelity was evaluated by comparing FFPE data to public data obtained from fresh tissues, and by employing single-gene, gene set and transcription network-based regulon analyses. Results We report a single 10 μm section of breast tissue yields sufficient RNA for RNA-seq, and a relationship between RNA quality and block age that was not linear. We find single-gene analysis is limiting with FFPE tissues, while targeted gene set approaches effectively distinguish ER+ from ER- breast cancers. Novel utilization of regulon analysis identified the transcription factor KDM4B to associate with ER+ disease, with KDM4B regulon activity and gene expression having prognostic significance in an independent cohort of ER+ cases. Conclusion Our results, which outline a robust FFPE-RNA-seq pipeline for broad use, support utilizing FFPE tissues to address key questions in the breast cancer field, including the delineation between indolent and life-threatening disease, biological stratification and molecular mechanisms of treatment resistance.


Blood ◽  
2018 ◽  
Vol 132 (Supplement 1) ◽  
pp. 4454-4454
Author(s):  
Michael A Bauer ◽  
Cody Ashby ◽  
Christopher Wardell ◽  
Maria Ortiz ◽  
Erin Flynt ◽  
...  

Abstract Introduction: Mutations in the components of the spliceosome have been shown to occur at relatively high frequency in many cancers such as chronic lymphocytic leukemia, myelodysplastic syndromes and breast cancer. One component in particular, encoded by SF3B1, has hotspot missense mutations that result in a significant increase in alternatively spliced transcripts. RNA splicing in Multiple Myeloma (MM) has not been investigated and in particular the extent of mutations in SF3B1 and its effects on the transcriptome. Methods: Using the MMRF CoMMpass dataset (N=1273) of newly diagnosed MM patients, samples with whole exome sequencing (WES) were analyzed for mutations using Strelka and Mutect, and samples with SF3B1 mutations identified. A range of approaches were used to explore the effect of the SF3B1 mutations on the transcriptome and to determine possible downstream effects. Using RNA-seq with matched WES samples (n=615), the splice junction usage of SF3B1 mutants was compared against non-mutated samples which were matched for key MM molecular sub-types. The RNA-seq data was analyzed using a pipeline that included STAR and Salmon, aligning to human reference genome hg38, gene and transcript differential expression analysis tools DESeq2 and StringTie/Ballgown, differential splicing exon usage tools JunctionSeq/QoRTs, DEXSeq, and SUPPA and for Gene Set Enrichment Analysis (GSEA) the R package FGSEA was used. Results: From the WES data 1.7% (22/1273) of samples had mutations in SF3B1 of which 5 had mutations in the hotpot codons of K666 and K700. Differential isoform analysis of the 22 SF3B1 mutant samples compared to non-mutated samples did not identify any transcripts. However, when the analysis was restricted to the 5 samples with hotspot mutations differential gene expression identified 146 genes that were significantly differentially expressed at an adjusted p-value <0.05. Additionally, many genes that did not show an overall gene expression change between the control and the SF3B1 hotspot mutants did at transcriptional level where we observed isoform switching which included the protein coding genes BCL2L1, SNUR, ACKR3 and CRLF2. Results of differential gene analysis between the control and SF3B1 mutants were used in GSEA and significant normalized enrichment scores (NES) identifying increased protein secretion (p-value =0.009, NES= 1.9) and unfolded protein response (UPR) (p-value = 0.02, NES = 1.52) pathways. Conversely GSEA identified decreased apoptosis (p-value = 0.008, NES = -1.76), KRAS signaling (p-value = 0.008, NES = -1.92), TNFA signaling via NF-κB (p-value = 0.008, NES= 2.12) pathways in SF3B1 mutant samples. Investigation of splicing loci revealed that novel splice loci were significantly more abundant in the SF3B1 mutants versus control samples. Differential splicing analysis detected 474 genes to be significantly differentially spliced and of those 311 were not found to be differentially expressed at the gene level, indicating that alternative splicing is as important alternative mechanism to gene expression differences. 59 novel splice sites were identified, as well as 152 known splice sites and 218 exon significant differential usage with a p-value of < 0.05. The genes with most significant levels of alternative splicing and found by more than one approach were DYNLL1, TMEM14C, CRNDE, BRD4 and BCL2L1, several of which are also seen in other cancers with mutated SF3B1. Conclusions: Hotspot mutations in SF3B1 result in alternative splicing of genes as well as the introduction of novel splice sites. The confirmation that SF3B1 hotspot mutations in MM increases alternative splicing as well as the identification of the genes undergoing alternative splicing may present novel therapeutic targets. Gene expression analysis of these samples identifies key deregulated pathways, perhaps in response to alternative splicing, including the UPR and protein secretion pathways. These analyses indicate that disruption of these pathways are potential avenues of therapeutic intervention in patients with SF3B1 mutations. Disclosures Ortiz: Celgene Corporation: Employment, Equity Ownership. Flynt:Celgene Corporation: Employment, Equity Ownership. Thakurta:Celgene Corporation: Employment, Equity Ownership. Morgan:Celgene: Consultancy, Honoraria, Research Funding; Takeda: Consultancy, Honoraria; Janssen: Research Funding; Bristol-Myers Squibb: Consultancy, Honoraria.


Sign in / Sign up

Export Citation Format

Share Document