Uncover Cancer Genomics by Jointly Analysing Aneuploidy and Gene Expression

Analyzing cancer gene expression data through the lens of normal tissue-specificity

10.1101/2021.01.25.428166 ◽

2021 ◽

Author(s):

H. Robert Frost

Keyword(s):

Gene Expression ◽

Normal Tissue ◽

Tissue Specificity ◽

Cancer Genomics ◽

Expression Profiles ◽

Genetic Alterations ◽

Cancer Type ◽

Tissue Specific ◽

Normal Tissues ◽

Cancer Types

AbstractThe genetic alterations that underlie cancer development are highly tissue-specific with the majority of driving alterations occurring in only a few cancer types and with alterations common to multiple cancer types often showing a tissue-specific functional impact. This tissue-specificity means that the biology of normal tissues carries important information regarding the pathophysiology of the associated cancers, information that can be leveraged to improve the power and accuracy of cancer genomic analyses. Research exploring the use of normal tissue data for the analysis of cancer genomics has primarily focused on the paired analysis of tumor and adjacent normal samples. Efforts to leverage the general characteristics of normal tissue for cancer analysis has received less attention with most investigations focusing on understanding the tissue-specific factors that lead to individual genomic alterations or dysregulated pathways within a single cancer type. To address this gap and support scenarios where adjacent normal tissue samples are not available, we explored the genome-wide association between the transcriptomes of 21 solid human cancers and their associated normal tissues as profiled in healthy individuals. While the average gene expression profiles of normal and cancerous tissue may appear distinct, with normal tissues more similar to other normal tissues than to the associated cancer types, when transformed into relative expression values, i.e., the ratio of expression in one tissue or cancer relative to the mean in other tissues or cancers, the close association between gene activity in normal tissues and related cancers is revealed. As we demonstrate through an analysis of tumor data from The Cancer Genome Atlas and normal tissue data from the Human Protein Atlas, this association between tissue-specific and cancer-specific expression values can be leveraged to improve the prognostic modeling of cancer, the comparative analysis of different cancer types, and the analysis of cancer and normal tissue pairs.

Download Full-text

Mutation Load and a Functional T Effector Signature May Distinguish Immunologically Distinct and Clinically Relevant Lymphoma Subsets

Blood ◽

10.1182/blood.v128.22.913.913 ◽

2016 ◽

Vol 128 (22) ◽

pp. 913-913

Author(s):

Chris Bolen ◽

Ronald McCord ◽

Garrett Frampton ◽

Richard Bourgon ◽

Elizabeth A. Punnoose ◽

...

Keyword(s):

Gene Expression ◽

T Cell ◽

Research Funding ◽

Cancer Genomics ◽

International Prognostic Index ◽

Significant Benefit ◽

Newly Diagnosed ◽

Mutation Load ◽

Rituximab Maintenance ◽

Effector Gene

Abstract Introduction: Recent studies linking cancer genomics and immunity have reinforced the concepts that some mutations trigger T cell effector responses and that the likelihood of an immunogenic mutation increases with increasing mutation load. Importantly, these data highlight the potential utility of such markers in identifying patient subsets likely to respond to cancer immunotherapies. This study investigated the clinical impact of mutation load and its association with T cell gene expression in newly diagnosed patients with follicular lymphoma (FL). Methods: We used clinical and genomic data from FL patients (n = 249; 216 with follow-up information) with evaluable pre-treatment tumor tissue who were treated in a randomized study of rituximab maintenance vs observation (PRIMA; ClinicalTrials.gov ID: NCT00140582). We estimated mutation load per megabase (Mb) as a proxy for neoantigen formation using FoundationOne Heme (Foundation Medicine, Inc). We quantified expression of T cytotoxic effector genes (GZMA, GZMB, PRF1, IFNG, EOMES, CD8A) as a surrogate for pre-existing immunity (and the inflammatory state of the tumor) using TruSeq (Illumina, Inc) RNA seq (n = 142). We used Cox regression to examine associations between these markers and progression-free survival (PFS), adjusting for the FL International Prognostic Index, age, sex, treatment arm and response to induction therapy. Pvalues were calculated for exploratory purposes. Results: The mutation load estimate among newly diagnosed patients with FL was highly variable (range, 0-33 mutations/Mb; Q1: 4.2; median: 6.6; Q3: 10.0). Patients with > 15 mutations/Mb (n = 19) were considered to have a high probability of neoantigen formation, and the remaining patients were stratified into mutation-low (< 6.6 mutations/Mb; n = 112) or mutation-mid (≥ 6.6 mutations/Mb and ≤ 15 mutations/Mb; n = 85) groups. The 3-year PFS in patients with high mutation load was 83% compared with 66% for mid-mutation load and 68% for low-mutation load groups, but mutation load was not independently prognostic in either the rituximab (P = .13) or observation (P = .66) arms. Of note, 92% of FL patients with high mutation load (n = 12/13) also had high T-effector gene expression compared with 49% of those with midlevel (n = 24/49) and 44% of those with low mutation load (n = 35/80) (P = .001). Mutation load was also associated with benefit from rituximab maintenance: FL patients with low mutation load experienced a significant benefit from rituximab maintenance (HR, 0.29 [95% CI, 0.15-0.54]; P < .001), whereas no statistically significant benefit was seen among FL patients with medium (HR, 0.81 [95% CI, 0.43-1.5]; P = .51) or high mutation load (HR, 0.29 [95% CI, 0.026-3.3]; P = .32). Importantly, the T/NK gene signature was prognostic as a continuous predictor (P = .008) and clearly separated 2 large groups of FL patients into an "inflamed" subset (T-effector signature high; n = 74) and an "uninflamed" subset (T-effector signature low; n = 75), with longer PFS seen in the "inflamed" FL subset (PFS HR, 0.39 [95% CI, 0.21-0.70]; P = .002). T-effector gene expression may be particularly useful for identifying the immunologically primed FL subset among patients with low/mid mutation load: there was a trend in 3-year PFS in 84.4% vs 56.6% for T-effector-high vs T-effector-low among low-mutation load patients (P = .002) and 76.2% vs 58.3% of mid-mutation load patients (P = .17), respectively. The subset of inflamed (T-effector signature high) FL tumors also demonstrated high expression of IDO1, which similarly correlated with longer PFS (HR, 0.25 [95% CI, 0.14-0.45]; P < .001), and a strong correlation was observed between IDO1 and IFNG expression (R2 = 0.61; P< .001). This is consistent with an interplay of pro- and anti-inflammatory immunity, wherein pro-inflammatory IFNγ drives the clinical outcome. Conclusions: Collectively, our results suggest that mutation load and T-effector gene expression may help identify immunologically distinct lymphoma subsets appropriate for modern immunotherapies. Disclosures Bolen: Genentech, Inc.: Employment. McCord:Genentech, Inc.: Employment. Frampton:Foundation Medicine: Employment, Equity Ownership. Bourgon:Genentech, Inc.: Employment; F Hoffman-La Roche: Other: Shareholder. Punnoose:Genetech, Inc.: Employment. Szafer-Glusman:Genentech, Inc.: Employment. Xerri:Novartis: Honoraria. Salles:Gilead: Honoraria, Research Funding; Janssen: Consultancy, Honoraria; Celgene: Consultancy, Honoraria; Mundipharma: Honoraria; Roche/Genentech: Consultancy, Honoraria, Research Funding; Amgen: Consultancy, Honoraria; Novartis: Consultancy, Honoraria. Venstrom:Genentech: Employment.

Download Full-text

Effect of method of deduplication on estimation of differential gene expression using RNA-seq

PeerJ ◽

10.7717/peerj.3091 ◽

2017 ◽

Vol 5 ◽

pp. e3091 ◽

Cited By ~ 10

Author(s):

Anna V. Klepikova ◽

Artem S. Kasianov ◽

Mikhail S. Chesnokov ◽

Natalia L. Lazarevich ◽

Aleksey A. Penin ◽

...

Keyword(s):

Gene Expression ◽

Differential Gene Expression ◽

Cancer Genomics ◽

False Negative ◽

Read Length ◽

Rna Seq ◽

Negative Results ◽

Paired Samples ◽

Highly Expressed Genes ◽

Differential Gene

BackgroundRNA-seq is a useful tool for analysis of gene expression. However, its robustness is greatly affected by a number of artifacts. One of them is the presence of duplicated reads.ResultsTo infer the influence of different methods of removal of duplicated reads on estimation of gene expression in cancer genomics, we analyzed paired samples of hepatocellular carcinoma (HCC) and non-tumor liver tissue. Four protocols of data analysis were applied to each sample: processing without deduplication, deduplication using a method implemented in samtools, and deduplication based on one or two molecular indices (MI). We also analyzed the influence of sequencing layout (single read or paired end) and read length. We found that deduplication without MI greatly affects estimated expression values; this effect is the most pronounced for highly expressed genes.ConclusionThe use of unique molecular identifiers greatly improves accuracy of RNA-seq analysis, especially for highly expressed genes. We developed a set of scripts that enable handling of MI and their incorporation into RNA-seq analysis pipelines. Deduplication without MI affects results of differential gene expression analysis, producing a high proportion of false negative results. The absence of duplicate read removal is biased towards false positives. In those cases where using MI is not possible, we recommend using paired-end sequencing layout.

Download Full-text

Screening and identification of key biomarkers of papillary renal cell carcinoma by bioinformatic analysis

PLoS ONE ◽

10.1371/journal.pone.0254868 ◽

2021 ◽

Vol 16 (8) ◽

pp. e0254868

Author(s):

Yingying Xu ◽

Deyang Kong ◽

Zhongtang Li ◽

Lingling Qian ◽

Junchao Li ◽

...

Keyword(s):

Gene Expression ◽

Renal Cell Carcinoma ◽

Cell Carcinoma ◽

Renal Cell ◽

Molecular Mechanisms ◽

Kidney Development ◽

Cancer Genomics ◽

Papillary Renal Cell Carcinoma ◽

Tissue Type ◽

Hub Genes

Background Papillary renal cell carcinoma (PRCC) is the most common type of renal cell carcinoma after clear cell renal cell carcinoma (ccRCC). Its pathological classification is controversial, and its molecular mechanism is poorly understood. Therefore, the identification of key genes and their biological pathways is of great significance to elucidate the molecular mechanisms of PRCC occurrence and progression. Methods The PRCC-related datasets GSE7023, GSE48352 and GSE15641 were downloaded from the Gene Expression Omnibus (GEO) database. Differentially expressed genes (DEGs) were identified, and gene ontology (GO) term enrichment analysis and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis were performed. Cytoscape and STRING were used to construct the protein-protein interaction network (PPI) and perform module analysis to identify hub genes and key pathways. A heatmap of hub genes was constructed using the UCSC cancer genomics browser. Overall survival and recurrence-free survival of patients stratified by the expression levels of hub genes were analysed using Kaplan-Meier Plotter. The online database UALCAN was applied to analyse gene expression based on tissue type, stage, subtype and race. Results A total of 214 DEGs, specifically, 205 downregulated genes and 9 upregulated genes, were identified. The DEGs were mainly enriched in angiogenesis, kidney development, oxidation-reduction process, metabolic pathways, etc. The 17 hub genes identified were mainly enriched in the biological processes of angiogenesis, cell adhesion, platelet degranulation, and leukocyte transendothelial migration. Survival analysis showed that EGF, KDR, CXCL12, REN, PECAM1, CDH5, THY1, WT1, PLAU and DCN might be related to the carcinogenesis, metastasis or recurrence of PRCC. UALCAN analysis showed that low expression of PECAM1 and PLAU in PRCC tissues was related to stage, subtype and race. Conclusions The DEGs and hub genes identified in the present study provide insight into the specific molecular mechanisms of PRCC occurrence and development and may be potential molecular markers and therapeutic targets for the accurate classification and efficient diagnosis and treatment of PRCC.

Download Full-text

Discovering novel driver mutations from pan-cancer analysis of mutational and gene expression profiles

PLoS ONE ◽

10.1371/journal.pone.0242780 ◽

2020 ◽

Vol 15 (11) ◽

pp. e0242780

Author(s):

Houriiyah Tegally ◽

Kevin H. Kensler ◽

Zahra Mungloo-Dilmohamud ◽

Anisah W. Ghoorah ◽

Timothy R. Rebbeck ◽

...

Keyword(s):

Gene Expression ◽

Large Scale ◽

Cancer Genomics ◽

Expression Profiles ◽

Gene Expression Profiles ◽

Driver Mutations ◽

Literature Mining ◽

Cancer Genes ◽

Multiple Cancer ◽

Driver Genes

As the genomic profile across cancers varies from person to person, patient prognosis and treatment may differ based on the mutational signature of each tumour. Thus, it is critical to understand genomic drivers of cancer and identify potential mutational commonalities across tumors originating at diverse anatomical sites. Large-scale cancer genomics initiatives, such as TCGA, ICGC and GENIE have enabled the analysis of thousands of tumour genomes. Our goal was to identify new cancer-causing mutations that may be common across tumour sites using mutational and gene expression profiles. Genomic and transcriptomic data from breast, ovarian, and prostate cancers were aggregated and analysed using differential gene expression methods to identify the effect of specific mutations on the expression of multiple genes. Mutated genes associated with the most differentially expressed genes were considered to be novel candidates for driver mutations, and were validated through literature mining, pathway analysis and clinical data investigation. Our driver selection method successfully identified 116 probable novel cancer-causing genes, with 4 discovered in patients having no alterations in any known driver genes: MXRA5, OBSCN, RYR1, and TG. The candidate genes previously not officially classified as cancer-causing showed enrichment in cancer pathways and in cancer diseases. They also matched expectations pertaining to properties of cancer genes, for instance, showing larger gene and protein lengths, and having mutation patterns suggesting oncogenic or tumor suppressor properties. Our approach allows for the identification of novel putative driver genes that are common across cancer sites using an unbiased approach without any a priori knowledge on pathways or gene interactions and is therefore an agnostic approach to the identification of putative common driver genes acting at multiple cancer sites.

Download Full-text

Role of Epigenetics in Cancer Genomics

Software Innovations in Clinical Drug Development and Safety - Advances in Medical Technologies and Clinical Practice ◽

10.4018/978-1-4666-8726-4.ch011 ◽

2015 ◽

pp. 234-246

Author(s):

Amit Nagal

Keyword(s):

Gene Expression ◽

Protein Interactions ◽

Cancer Genomics ◽

Epigenetic Mechanisms ◽

Parallel Sequencing ◽

Analysis Technique ◽

Epigenetic Analysis ◽

Indispensable Tool ◽

Day By Day

Epigenetics is the study of changes in organisms caused by modification of gene expression rather than alteration of the genetic code itself. ChIP-seq, is a method used to analyze protein interactions with DNA. It is a type of epigenetic analysis technique. Chromatin immunoprecipitation coupled with massive parallel sequencing (ChIP-seq) is gaining popularity day by day because of its clinical significance. It is a very effective tool in diagnosis of disease such as cancer. ChIP-seq is found to be very effective tool in understanding basic regulatory mechanism, cell differentiation study and studying disease processes with the decreasing cost of sequencing, ChIP-seq has become an indispensable tool for studying gene regulation and epigenetic mechanisms. The Present review explores epigenetic methods, pipeline and its role in cancer.

Download Full-text

Identification of Gene Expression Pattern Related to Breast Cancer Survival Using Integrated TCGA Datasets and Genomic Tools

BioMed Research International ◽

10.1155/2015/878546 ◽

2015 ◽

Vol 2015 ◽

pp. 1-10 ◽

Cited By ~ 15

Author(s):

Zhenzhen Huang ◽

Huilong Duan ◽

Haomin Li

Keyword(s):

Breast Cancer ◽

Gene Expression ◽

Cancer Patients ◽

Expression Pattern ◽

Clinical Data ◽

Cancer Genomics ◽

Expression Patterns ◽

Gene List ◽

Gene Expression Pattern ◽

Cancer Development

Several large-scale human cancer genomics projects such as TCGA offered huge genomic and clinical data for researchers to obtain meaningful genomics alterations which intervene in the development and metastasis of the tumor. A web-based TCGA data analysis platform called TCGA4U was developed in this study. TCGA4U provides a visualization solution for this study to illustrate the relationship of these genomics alternations with clinical data. A whole genome screening of the survival related gene expression patterns in breast cancer was studied. The gene list that impacts the breast cancer patient survival was divided into two patterns. Gene list of each of these patterns was separately analyzed on DAVID. The result showed that mitochondrial ribosomes play a more crucial role in the cancer development. We also reported that breast cancer patients with low HSPA2 expression level had shorter overall survival time. This is widely different to findings of HSPA2 expression pattern in other cancer types. TCGA4U provided a new perspective for the TCGA datasets. We believe it can inspire more biomedical researchers to study and explain the genomic alterations in cancer development and discover more targeted therapies to help more cancer patients.

Download Full-text

A Critical Evaluation of Analytic Aspects of Gene Expression Profiling in Lymphoid Leukemias with Broad Applications to Cancer Genomics

AIMS Medical Science ◽

10.3934/medsci.2016.3.248 ◽

2016 ◽

Vol 3 (3) ◽

pp. 248-271 ◽

Cited By ~ 1

Author(s):

Giuliano Crispatzu ◽

◽

Alexandra Schrader ◽

Michael Nothnagel ◽

Marco Herling ◽

...

Keyword(s):

Gene Expression ◽

Gene Expression Profiling ◽

Expression Profiling ◽

Cancer Genomics ◽

Critical Evaluation

Download Full-text

A male-specific association between AGTR1 hypermethylation and coronary heart disease

Bosnian Journal of Basic Medical Sciences ◽

10.17305/bjbms.2019.4321 ◽

2019 ◽

Cited By ~ 1

Author(s):

Xiaojing Li ◽

Nan Wu ◽

Huihui Ji ◽

Yi Huang ◽

Haochang Hu ◽

...

Keyword(s):

Gene Expression ◽

Data Mining ◽

Coronary Heart Disease ◽

Heart Disease ◽

Cancer Genomics ◽

Gene Expression Omnibus ◽

Inverse Association ◽

Angiotensin Ii Receptor ◽

Specific Pcr ◽

Data Mining Analysis

The AGTR1 gene encodes angiotensin II receptor type 1, which is involved in cardiovascular diseases such as coronary heart disease (CHD). In the current study, we analyzed AGTR1 methylation level in a Han Chinese population by SYBR green-based quantitative methylation-specific PCR (qMSP). We collected blood samples from 761 CHD patients and 398 non-CHD controls at the Ningbo First Hospital. A data mining analysis was also performed to explore the association between AGTR1 methylation and AGTR1 gene expression, using datasets from the cBioPortal for Cancer Genomics and the Gene Expression Omnibus (GEO) database. Our results showed a significantly higher percentage of methylated reference (PMR) of AGTR1 in male CHD patients compared with male non-CHD controls (median PMR: 2.12% vs. 0.59%, p = 0.037). The data mining analysis showed that AGTR1 expression was significantly increased in human hepatoma HepG2 cells treated with the demethylation agent 5-aza-2'-deoxycytidine (fold = 3.12, p = 0.009). Further data mining analysis using the cholangiocarcinoma (TCGA, PanCancer Atlas) data indicated an inverse association between AGTR1 methylation and AGTR1 expression (r = -0.595, p = 1.29E-04). Overall, our results suggest that AGTR1 methylation is involved in the regulation of AGTR1 gene expression and that AGTR1 hypermethylation is associated with CHD in males. These findings may provide new clues about the pathogenesis of CHD.

Download Full-text

Crosstalk between the IGF1 and CXCL12 Pathways Defines Objective Responses to the Farnesyl Transferase Inhibitor Tipifarnib in AML and PTCL Patients

Blood ◽

10.1182/blood-2018-99-116969 ◽

2018 ◽

Vol 132 (Supplement 1) ◽

pp. 5349-5349

Author(s):

Antonio Gualberto ◽

Catherine Scholz ◽

Vishnu Mishra ◽

Matthew R Janes ◽

Linda Kessler

Keyword(s):

Gene Expression ◽

Mechanism Of Action ◽

Cancer Genomics ◽

Lymphoid Cells ◽

Rna Seq ◽

Employment Equity ◽

Cxcl12 Expression ◽

Best Response ◽

Equity Ownership ◽

Pre Treatment

Abstract Background CXCL12 is a chemokine that is essential for the maturation of myeloid and lymphoid cells. Tipifarnib is a potent and selective inhibitor of the enzyme farnesyltransferase (FT). Treatment with this agent may translate to durable responses in subsets of patients (pts) with acute myeloid leukemia (AML), chronic myelomonocytic leukemia (CMML) and peripheral T-cell lymphoma (PTCL) but its mechanism of action in these indications is poorly understood. We have previously reported that tipifarnib interferes with the CXCL12 pathway. Here we show that an interplay between the CXCL12 and IGF1 pathways may define those pts who may experience objective responses with tipifarnib monotherapy. Methods Gene expression profile (GEP) data generated using RNA-Seq and the Affymetrix U133A gene-chips of tumor samples from 129 pts enrolled into tipifarnib trials (CTEP-20, KO-TIP-002,KO-TIP-004, INT-17) were analyzed with respect to study outcomes and complemented with analyses of mRNA expression in data sets from the cBioportal for Cancer Genomics. Gene expression was further validated using RT-PCR assays. RNA-Seq and Whole Exome Sequencing were conducted using standard methodologies. Clinical trial information: NCT00027872, NCT02464228. NCT02807272, NCT00354146. Results In order to improve our understanding of the molecular pathology of tumor CXCL12 overexpression, we investigated GEPs from 8,401 cancer pts in 25 studies available at cBioportal (TCGA, Provisional). Notably, we found a highly significant correlation in the expression of the IGF1 and CXCL12 genes in 19 of those studies. Intriguingly, the highest IGF1/CXCL12 correlations were observed in indications, including AML (ρ=0.698, p<0.001), in which activity of tipifarnib as monotherapy has been previously reported (AML, breast and urothelial cancer). Based on these results, we investigated the effect of IGF1/CXCL12 co-expression on pt outcome in tipifarnib studies. In previously untreated AML, 3 subsets of pts were identified with respect to bone marrow (BM) IGF1/CXCL12 expression: (1) high IGF1, high CXCL12 with predominantly hematological improvement or stable disease (SD) as best response, (2) intermediate IGF1, low CXCL12, with predominantly disease progression (PD), and (3) low IGF1, variable CXCL12, with 6 complete responses in 15 pts that were associated with CXCL12 expression (p=0.013), supporting the notion that CXCL12 pathway activation determines objective responses with tipifarnib while IGF1 mediates drug resistance. Barely detectable levels of IGF1 (and IGF2) were observed in the BMs of CMML pts in whom only 1 best response of PD was reported. In contrast, elevated levels of both CXCL12 and IGF1 were observed in PTCL pts responding to tipifarnib. Further investigation revealed that tumors of PTCL pts experiencing a partial response (PR) with tipifarnib expressed high levels of IGFBP7 (p=0.03), a natural inhibitor of the IGF1 receptor. Sequencing of the CXCL12 and IGF1 genes in PTCL samples revealed the presence of polymorphisms in non-responding pts: 8 pts, 7 carrying CXCL12 rs2839695 and 1 with a novel 3UTR variant, experienced a best response of PD. No pts with a best response of PR or SD carried 3UTR variants in CXCL12 (0% vs 80%, p=0.007). No pt with a best response of PR, 1 of 4 pts with SD and 6 of 10 pts with PD carried the IGFBP7 variant L11F (rs11573021) (16% PR/SD vs 60% PD, p=0.15) Conclusions Pre-treatment tumor CXCL12, IGF1 and IGFBP7 expression may enable the identification of pts susceptible to experience objective responses with tipifarnib monotherapy. These data may contribute to the understanding of the mechanism of action of FT inhibitors. Disclosures Gualberto: Kura Oncology: Employment, Equity Ownership. Mishra:Kura Oncology: Employment, Equity Ownership. Janes:Wellspring Biosciences: Employment, Equity Ownership. Kessler:Kura Oncology: Employment, Equity Ownership.

Download Full-text