scholarly journals Putatively cancer-specific alternative splicing is shared across patients and present in developmental and other non-cancer cells

2019 ◽  
Author(s):  
Julianne K. David ◽  
Sean K. Maden ◽  
Benjamin R. Weeder ◽  
Reid F. Thompson ◽  
Abhinav Nellore

ABSTRACTWe compared cancer and non-cancer RNA sequencing (RNA-seq) data from The Cancer Genome Atlas (TCGA), the Genotype-Tissue Expression (GTEx) Project, and the Sequence Read Archive (SRA). We found that: 1) averaging across cancer types, 80.6% of exon-exon junctions thought to be cancer-specific based on comparison with tissue-matched samples are in fact present in other adult non-cancer tissues throughout the body; 2) 30.8% of junctions not present in any GTEx or TCGA normal tissues are shared by multiple samples within at least one cancer type cohort, and 87.4% of these distinguish between different cancer types; and 3) many of these junctions not found in GTEx or TCGA normal tissues (15.4% on average) are also found in embryological and other developmentally associated cells. This study probes the distribution of putatively cancer-specific junctions across a broad set of publicly available non-cancer human RNA-seq datasets. Overall, we identify a subset of shared cancer-specific junctions that could represent novel sources of cancer neoantigens. We further describe a framework for characterizing possible origins of these junctions, including potential developmental and embryological sources, as well as cell type-specific markers particularly related to cell types of cancer origin. These findings refine the meaning of RNA splicing event novelty, particularly with respect to the human neoepitope repertoire. Ultimately, cancer-specific exon-exon junctions may affect the anti-cancer immune response and may have a substantial causal relationship with the biology of disease.

NAR Cancer ◽  
2020 ◽  
Vol 2 (1) ◽  
Author(s):  
Julianne K David ◽  
Sean K Maden ◽  
Benjamin R Weeder ◽  
Reid F Thompson ◽  
Abhinav Nellore

Abstract This study probes the distribution of putatively cancer-specific junctions across a broad set of publicly available non-cancer human RNA sequencing (RNA-seq) datasets. We compared cancer and non-cancer RNA-seq data from The Cancer Genome Atlas (TCGA), the Genotype-Tissue Expression (GTEx) Project and the Sequence Read Archive. We found that (i) averaging across cancer types, 80.6% of exon–exon junctions thought to be cancer-specific based on comparison with tissue-matched samples (σ = 13.0%) are in fact present in other adult non-cancer tissues throughout the body; (ii) 30.8% of junctions not present in any GTEx or TCGA normal tissues are shared by multiple samples within at least one cancer type cohort, and 87.4% of these distinguish between different cancer types; and (iii) many of these junctions not found in GTEx or TCGA normal tissues (15.4% on average, σ = 2.4%) are also found in embryological and other developmentally associated cells. These findings refine the meaning of RNA splicing event novelty, particularly with respect to the human neoepitope repertoire. Ultimately, cancer-specific exon–exon junctions may have a substantial causal relationship with the biology of disease.


2017 ◽  
Author(s):  
Zhuyi Xue ◽  
René L Warren ◽  
Ewan A Gibb ◽  
Daniel MacMillan ◽  
Johnathan Wong ◽  
...  

AbstractAlternative polyadenylation (APA) of 3’ untranslated regions (3’ UTRs) has been implicated in cancer development. Earlier reports on APA in cancer primarily focused on 3’ UTR length modifications, and the conventional wisdom is that tumor cells preferentially express transcripts with shorter 3’ UTRs. Here, we analyzed the APA patterns of 114 genes, a select list of oncogenes and tumor suppressors, in 9,939 tumor and 729 normal tissue samples across 33 cancer types using RNA-Seq data from The Cancer Genome Atlas, and we found that the APA regulation machinery is much more complicated than what was previously thought. We report 77 cases (gene-cancer type pairs) of differential 3’ UTR cleavage patterns between normal and tumor tissues, involving 33 genes in 13 cancer types. For 15 genes, the tumor-specific cleavage patterns are recurrent across multiple cancer types. While the cleavage patterns in certain genes indicate apparent trends of 3’ UTR shortening in tumor samples, over half of the 77 cases imply 3’ UTR length change trends in cancer that are more complex than simple shortening or lengthening. This work extends the current understanding of APA regulation in cancer, and demonstrates how large volumes of RNA-seq data generated for characterizing cancer cohorts can be mined to investigate this process.


2021 ◽  
Author(s):  
Guangnan Wei ◽  
Yuchen Zhang ◽  
Hongkai Zhuang ◽  
Yingzi Li ◽  
Chongyang Ren ◽  
...  

Abstract Background: A member of histone lysine methyltransferases subfamily, The histone 3 lysine 4 (H3K4) monomethylase KMT2C, has mutations across many cancer types. However, the role of KMT2C in different cancers and its correlation with tumor infiltration and immune therapy indicators remain unknown.Method: Expression and mutation information of KMT2C has been analyzed through the Genotype-Tissue Expression (GTEx), The Cancer Genome Atlas (TCGA) Cancer Cell Line Encyclopedia (CCLE) and International Cancer Genome Consortium (ICGC) database in our study. Prognostic value of KMT2C was evaluated via univariate survival analysisand expression detection in different cancer cells. Result: Survival analysis showed that high expression of KMT2C in some cancer type may be a indication of better outcome, while in other cancer like UVM, patient with high expression of KMT2C suffered from early recurrence. Further, we found there is a strongly link between KMT2C expression and immune cells infiltration, mutation indicators through analysising in the Tumor Immune Evaluation Resource (TIMER) database. Conclusion: The bioinformatics analysis here deliver us a message that KMT2C might be a good molecular biomarker for prognostic and therapeutic evaluation in specific cancer types.


2021 ◽  
Author(s):  
Ioannis Kavakiotis ◽  
Athanasios Alexiou ◽  
Spyros Tastsoglou ◽  
Ioannis S Vlachos ◽  
Artemis G Hatzigeorgiou

Abstract microRNAs (miRNAs) are short (∼23nt) single-stranded non-coding RNAs that act as potent post-transcriptional gene expression regulators. Information about miRNA expression and distribution across cell types and tissues is crucial to the understanding of their function and for their translational use as biomarkers or therapeutic targets. DIANA-miTED is the most comprehensive and systematic collection of miRNA expression values derived from the analysis of 15 183 raw human small RNA-Seq (sRNA-Seq) datasets from the Sequence Read Archive (SRA) and The Cancer Genome Atlas (TCGA). Metadata quality maximizes the utility of expression atlases, therefore we manually curated SRA and TCGA-derived information to deliver a comprehensive and standardized set, incorporating in total 199 tissues, 82 anatomical sublocations, 267 cell lines and 261 diseases. miTED offers rich instant visualizations of the expression and sample distributions of requested data across variables, as well as study-wide diagrams and graphs enabling efficient content exploration. Queries also generate links towards state-of-the-art miRNA functional resources, deeming miTED an ideal starting point for expression retrieval, exploration, comparison, and downstream analysis, without requiring bioinformatics support or expertise. DIANA-miTED is freely available at http://www.microrna.gr/mited.


Cancers ◽  
2020 ◽  
Vol 12 (9) ◽  
pp. 2476 ◽  
Author(s):  
Shaoli Das ◽  
Kevin Camphausen ◽  
Uma Shankavaram

To elucidate the role of immune cell infiltration as a prognostic signature in solid tumors, we analyzed immune-function-related genes from four publicly available single-cell RNA-Seq data sets and twenty bulk tumor RNA-Seq data sets from The Cancer Genome Atlas (TCGA). Unsupervised clustering of pan-cancer transcriptomic signature showed two major immune function types: one related to NK-, T-, and B-cell functions and the other related to monocyte, macrophage, dendritic cell, and Toll-like receptor functions. Kaplan–Meier analysis showed differential prognosis of these two groups, dependent on the cancer type. Our analysis of TCGA solid tumors with an elastic net model identified 155 genes associated with disease-free survival in different tumor types with varied influence across different cancer types. With this gene set, we computed cancer-specific prognostic immune score models for individual cancer types that predicted disease-free and overall survival. Validation of our model on available published data of immune checkpoint blockade therapies on melanoma, kidney renal cell carcinoma, non-small cell lung cancer, gastric cancer and bladder cancer confirmed that cancer-specific higher immune scores are associated with response to immunotherapy. Our analysis provides a comprehensive map of cancer-specific immune-related prognostic gene sets that are associated with immunotherapy response.


2021 ◽  
Author(s):  
H. Robert Frost

AbstractThe genetic alterations that underlie cancer development are highly tissue-specific with the majority of driving alterations occurring in only a few cancer types and with alterations common to multiple cancer types often showing a tissue-specific functional impact. This tissue-specificity means that the biology of normal tissues carries important information regarding the pathophysiology of the associated cancers, information that can be leveraged to improve the power and accuracy of cancer genomic analyses. Research exploring the use of normal tissue data for the analysis of cancer genomics has primarily focused on the paired analysis of tumor and adjacent normal samples. Efforts to leverage the general characteristics of normal tissue for cancer analysis has received less attention with most investigations focusing on understanding the tissue-specific factors that lead to individual genomic alterations or dysregulated pathways within a single cancer type. To address this gap and support scenarios where adjacent normal tissue samples are not available, we explored the genome-wide association between the transcriptomes of 21 solid human cancers and their associated normal tissues as profiled in healthy individuals. While the average gene expression profiles of normal and cancerous tissue may appear distinct, with normal tissues more similar to other normal tissues than to the associated cancer types, when transformed into relative expression values, i.e., the ratio of expression in one tissue or cancer relative to the mean in other tissues or cancers, the close association between gene activity in normal tissues and related cancers is revealed. As we demonstrate through an analysis of tumor data from The Cancer Genome Atlas and normal tissue data from the Human Protein Atlas, this association between tissue-specific and cancer-specific expression values can be leveraged to improve the prognostic modeling of cancer, the comparative analysis of different cancer types, and the analysis of cancer and normal tissue pairs.


2021 ◽  
Vol 11 ◽  
Author(s):  
Wencheng Zhang ◽  
Zhouyong Gao ◽  
Mingxiu Guan ◽  
Ning Liu ◽  
Fanjie Meng ◽  
...  

Anti-silencing function 1B histone chaperone (ASF1B) is known to be an important modulator of oncogenic processes, yet its role in lung adenocarcinoma (LUAD) remains to be defined. In this study, an integrated assessment of The Cancer Genome Atlas (TCGA) and genotype-tissue expression (GTEx) datasets revealed the overexpression of ASF1B in all analyzed cancer types other than LAML. Genetic, epigenetic, microsatellite instability (MSI), and tumor mutational burden (TMB) analysis showed that ASF1B was regulated by single or multiple factors. Kaplan-Meier survival curves suggested that elevated ASF1B expression was associated with better or worse survival in a cancer type-dependent manner. The CIBERSORT algorithm was used to evaluate immune microenvironment composition, and distinct correlations between ASF1B expression and immune cell infiltration were evident when comparing tumor and normal tissue samples. Gene set enrichment analysis (GSEA) indicated that ASF1B was associated with proliferation- and immunity-related pathways. Knocking down ASF1B impaired the proliferation, affected cell cycle distribution, and induced cell apoptosis in LUAD cell lines. In contrast, ASF1B overexpression had no impact on the malignant characteristics of LUAD cells. At the mechanistic level, ASF1B served as an indirect regulator of DNA Polymerase Epsilon 3, Accessory Subunit (POLE3), CDC28 protein kinase regulatory subunit 1(CKS1B), Dihydrofolate reductase (DHFR), as established through proteomic profiling and Immunoprecipitation-Mass Spectrometry (IP-MS) analyses. Overall, these data suggested that ASF1B serves as a tumor promoter and potential target for cancer therapy and provided us with clues to better understand the importance of ASF1B in many types of cancer.


Author(s):  
Pora Kim ◽  
Mengyuan Yang ◽  
Ke Yiya ◽  
Weiling Zhao ◽  
Xiaobo Zhou

AbstractExon skipping (ES) is reported to be the most common alternative splicing event due to loss of functional domains/sites or shifting of the open reading frame (ORF), leading to a variety of human diseases and considered therapeutic targets. To date, systematic and intensive annotations of ES events based on the skipped exon units in cancer and normal tissues are not available. Here, we built ExonSkipDB, the ES annotation database available at https://ccsm.uth.edu/ExonSkipDB/, aiming to provide a resource and reference for functional annotation of ES events in multiple cancer and tissues to identify therapeutically targetable genes in individual exon units. We collected 14 272 genes that have 90 616 and 89 845 ES events across 33 cancer types and 31 normal tissues from The Cancer Genome Atlas (TCGA) and Genotype-Tissue Expression (GTEx). For the ES events, we performed multiple functional annotations. These include ORF assignment of exon skipped transcript, studies of lost protein functional features due to ES events, and studies of exon skipping events associated with mutations and methylations based on multi-omics evidence. ExonSkipDB will be a unique resource for cancer and drug research communities to identify therapeutically targetable exon skipping events.


2019 ◽  
Vol 37 (15_suppl) ◽  
pp. e13032-e13032 ◽  
Author(s):  
Anton Buzdin ◽  
Andrew Garazha ◽  
Maxim Sorokin ◽  
Alex Glusker ◽  
Alexey Aleshin ◽  
...  

e13032 Background: Intracellular molecular pathways (IMPs) control all major events in the living cell. They are considered hotspots in contemporary oncology because knowledge of IMPs activation is essential for understanding mechanisms of molecular pathogenesis in oncology. Profiling IMPs requires RNA-seq data for tumors and for a collection of reference normal tissues. However, there is a shortage now in such profiles for normal tissues from healthy human donors, uniformly profiled in a single series of experiments. Access to the largest dataset of normal profiles GTEx is only partly available through the dbGaP. In TCGA database, norms are adjacent to surgically removed tumors and may be affected by tumor-linked growth factors, inflammation and altered vascularization. ENCODE datasets were for the autopsies of normal tissues, but they can’t form statistically significant reference groups. Methods: Tissue samples representing 20 organs were taken from post-mortal human healthy donors killed in road accidents no later than 36 hours after death, blood samples were taken from healthy volunteers. Gene expression was profiled in RNA-seq experiments using the same reagents, equipment and protocols. Bioinformatic algorithms for IMP analysis were developed and validated using experimental and public gene expression datasets. Results: From original sequencing data we constructed the biggest fully open reference expression database of normal human tissues including 465 profiles termed Oncobox Atlas of Normal Tissue Expression (ANTE, original data: GSE120795). We next developed a method termed Oncobox for interrogating activation of IMPs in human cancers. It includes modules of expression data harmonization and comparison and an algorithm for automatic annotation of molecular pathways. The Oncobox system enables accurate scoring of thousands molecular pathways using RNA-seq data. Oncobox pathway analysis is also applicable for quantitative proteomics and microRNA data in oncology. Conclusions: The Oncobox system can be used for a plethora of applications in cancer research including finding differentially regulated genes and IMPs, and for discovery of new pathway-related diagnostic and prognostic biomarkers.


Author(s):  
Kim M. Summers ◽  
Stephen J. Bush ◽  
David A. Hume

AbstractThe mononuclear phagocyte system (MPS) is a family of cells including progenitors, circulating blood monocytes, resident tissue macrophages and dendritic cells (DC) present in every tissue in the body. To test the relationships between markers and transcriptomic diversity in the MPS, we collected from NCBI-GEO >500 quality RNA-seq datasets generated from mouse MPS cells isolated from multiple tissues. The primary data were randomly down-sized to a depth of 10 million reads and requantified. The resulting dataset was clustered using the network analysis tool Graphia. A sample-to-sample matrix revealed that MPS populations could be separated based upon tissue of origin. Cells identified as classical DC subsets, cDC1 and cDC2, and lacking Fcgr1 (CD64), were centrally-located within the MPS cluster and no more distinct than other MPS cell types. A gene-to-gene correlation matrix identified large generic co-expression clusters associated with MPS maturation and innate immune function. Smaller co-expression gene clusters including the transcription factors that drive them showed higher expression within defined isolated cells, including macrophages and DC from specific tissues. They include a cluster containing Lyve1 that implies a function in endothelial cell homeostasis, a cluster of transcripts enriched in intestinal macrophages and a generic cDC cluster associated with Ccr7. However, transcripts encoding many other putative MPS subset markers including Adgre1, Itgax, Itgam, Clec9a, Cd163, Mertk, Retnla and H2-a/e (class II MHC) clustered idiosyncratically and were not correlated with underlying functions. The data provide no support for the concept of markers of M2 polarization or the specific adaptation of DC to present antigen to T cells. Co-expression of immediate early genes (e.g. Egr1, Fos, Dusp1) and inflammatory cytokines and chemokines (Tnf, Il1b, Ccl3/4) indicated that all tissue disaggregation protocols activate MPS cells. Tissue-specific expression clusters indicated that all cell isolation procedures also co-purify other unrelated cell types that may interact with MPS cells in vivo. Comparative analysis of public RNA-seq and single cell RNA-seq data from the same lung cell populations showed that the extensive heterogeneity implied by the global cluster analysis may be even greater at a single cell level with few markers strongly correlated with each other. This analysis highlights the power of large datasets to identify the diversity of MPS cellular phenotypes, and the limited predictive value of surface markers to define lineages, functions or subpopulations.


Sign in / Sign up

Export Citation Format

Share Document