scholarly journals Bayesian inference of cancer driver genes using signatures of positive selection

2017 ◽  
Author(s):  
Luis Zapata ◽  
Hana Susak ◽  
Oliver Drechsel ◽  
Marc R. Friedländer ◽  
Xavier Estivill ◽  
...  

AbstractTumors are composed of an evolving population of cells subjected to tissue-specific selection, which fuels tumor heterogeneity and ultimately complicates cancer driver gene identification. Here, we integrate cancer cell fraction, population recurrence, and functional impact of somatic mutations as signatures of selection into a Bayesian inference model for driver prediction. In an in-depth benchmark, we demonstrate that our model, cDriver, outperforms competing methods when analyzing solid tumors, hematological malignancies, and pan-cancer datasets. Applying cDriver to exome sequencing data of 21 cancer types from 6,870 individuals revealed 98 unreported tumor type-driver gene connections. These novel connections are highly enriched for chromatin-modifying proteins, hinting at a universal role of chromatin regulation in cancer etiology. Although infrequently mutated as single genes, we show that chromatin modifiers are altered in a large fraction of cancer patients. In summary, we demonstrate that integration of evolutionary signatures is key for identifying mutational driver genes, thereby facilitating the discovery of novel therapeutic targets for cancer treatment.

2019 ◽  
Author(s):  
Pramod Chandrashekar ◽  
Navid Ahmadinejad ◽  
Junwen Wang ◽  
Aleksandar Sekulic ◽  
Jan B. Egan ◽  
...  

ABSTRACTFunctions of cancer driver genes depend on cellular contexts that vary substantially across tissues and organs. Distinguishing oncogenes (OGs) and tumor suppressor genes (TSGs) for each cancer type is critical to identifying clinically actionable targets. However, current resources for context-aware classifications of cancer drivers are limited. In this study, we show that the direction and magnitude of somatic selection of missense and truncating mutations of a gene are suggestive of its contextual activities. By integrating these features with ratiometric and conservation measures, we developed a computational method to categorize OGs and TSGs using exome sequencing data. This new method, named genes under selection in tumors (GUST) shows an overall accuracy of 0.94 when tested on manually curated benchmarks. Application of GUST to 10,172 tumor exomes of 33 cancer types identified 98 OGs and 179 TSGs, >70% of which promote tumorigenesis in only one cancer type. In broad-spectrum drivers shared across multiple cancer types, we found heterogeneous mutational hotspots modifying distinct functional domains, implicating the synchrony of convergent and divergent disease mechanisms. We further discovered two novel OGs and 28 novel TSGs with high confidence. The GUST program is available at https://github.com/liliulab/gust. A database with pre-computed classifications is available at https://liliulab.shinyapps.io/gust


2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Ege Ülgen ◽  
O. Uğur Sezerman

Abstract Background Cancer develops due to “driver” alterations. Numerous approaches exist for predicting cancer drivers from cohort-scale genomics data. However, methods for personalized analysis of driver genes are underdeveloped. In this study, we developed a novel personalized/batch analysis approach for driver gene prioritization utilizing somatic genomics data, called driveR. Results Combining genomics information and prior biological knowledge, driveR accurately prioritizes cancer driver genes via a multi-task learning model. Testing on 28 different datasets, this study demonstrates that driveR performs adequately, achieving a median AUC of 0.684 (range 0.651–0.861) on the 28 batch analysis test datasets, and a median AUC of 0.773 (range 0–1) on the 5157 personalized analysis test samples. Moreover, it outperforms existing approaches, achieving a significantly higher median AUC than all of MutSigCV (Wilcoxon rank-sum test p < 0.001), DriverNet (p < 0.001), OncodriveFML (p < 0.001) and MutPanning (p < 0.001) on batch analysis test datasets, and a significantly higher median AUC than DawnRank (p < 0.001) and PRODIGY (p < 0.001) on personalized analysis datasets. Conclusions This study demonstrates that the proposed method is an accurate and easy-to-utilize approach for prioritizing driver genes in cancer genomes in personalized or batch analyses. driveR is available on CRAN: https://cran.r-project.org/package=driveR.


2019 ◽  
Author(s):  
Jie Xu ◽  
Fan Song ◽  
Emily Schleicher ◽  
Christopher Pool ◽  
Darrin Bann ◽  
...  

AbstractWhile genomic analysis of tumors has stimulated major advances in cancer diagnosis, prognosis and treatment, current methods fail to identify a large fraction of somatic structural variants in tumors. We have applied a combination of whole genome sequencing and optical genome mapping to a number of adult and pediatric leukemia samples, which revealed in each of these samples a large number of structural variants not recognizable by current tools of genomic analyses. We developed computational methods to determine which of those variants likely arose as somatic mutations. The method identified 97% of the structural variants previously reported by karyotype analysis of these samples and revealed an additional fivefold more such somatic rearrangements. The method identified on average tens of previously unrecognizable inversions and duplications and hundreds of previously unrecognizable insertions and deletions. These structural variants recurrently affected a number of leukemia associated genes as well as cancer driver genes not previously associated with leukemia and genes not previously associated with cancer. A number of variants only affected intergenic regions but caused cis-acting alterations in expression of neighboring genes. Analysis of TCGA data indicates that the status of several of the recurrently mutated genes identified in this study significantly affect survival of AML patients. Our results suggest that current genomic analysis methods fail to identify a majority of structural variants in leukemia samples and this lacunae may hamper diagnostic and prognostic efforts.


2020 ◽  
Vol 49 (D1) ◽  
pp. D1289-D1301 ◽  
Author(s):  
Tao Wang ◽  
Shasha Ruan ◽  
Xiaolu Zhao ◽  
Xiaohui Shi ◽  
Huajing Teng ◽  
...  

Abstract The prevalence of neutral mutations in cancer cell population impedes the distinguishing of cancer-causing driver mutations from passenger mutations. To systematically prioritize the oncogenic ability of somatic mutations and cancer genes, we constructed a useful platform, OncoVar (https://oncovar.org/), which employed published bioinformatics algorithms and incorporated known driver events to identify driver mutations and driver genes. We identified 20 162 cancer driver mutations, 814 driver genes and 2360 pathogenic pathways with high-confidence by reanalyzing 10 769 exomes from 33 cancer types in The Cancer Genome Atlas (TCGA) and 1942 genomes from 18 cancer types in International Cancer Genome Consortium (ICGC). OncoVar provides four points of view, ‘Mutation’, ‘Gene’, ‘Pathway’ and ‘Cancer’, to help researchers to visualize the relationships between cancers and driver variants. Importantly, identification of actionable driver alterations provides promising druggable targets and repurposing opportunities of combinational therapies. OncoVar provides a user-friendly interface for browsing, searching and downloading somatic driver mutations, driver genes and pathogenic pathways in various cancer types. This platform will facilitate the identification of cancer drivers across individual cancer cohorts and helps to rank mutations or genes for better decision-making among clinical oncologists, cancer researchers and the broad scientific community interested in cancer precision medicine.


Author(s):  
Shu-Hsuan Liu ◽  
Pei-Chun Shen ◽  
Chen-Yang Chen ◽  
An-Ni Hsu ◽  
Yi-Chun Cho ◽  
...  

Abstract An integrative multi-omics database is needed urgently, because focusing only on analysis of one-dimensional data falls far short of providing an understanding of cancer. Previously, we presented DriverDB, a cancer driver gene database that applies published bioinformatics algorithms to identify driver genes/mutations. The updated DriverDBv3 database (http://ngs.ym.edu.tw/driverdb) is designed to interpret cancer omics’ sophisticated information with concise data visualization. To offer diverse insights into molecular dysregulation/dysfunction events, we incorporated computational tools to define CNV and methylation drivers. Further, four new features, CNV, Methylation, Survival, and miRNA, allow users to explore the relations from two perspectives in the ‘Cancer’ and ‘Gene’ sections. The ‘Survival’ panel offers not only significant survival genes, but gene pairs synergistic effects determine. A fresh function, ‘Survival Analysis’ in ‘Customized-analysis,’ allows users to investigate the co-occurring events in user-defined gene(s) by mutation status or by expression in a specific patient group. Moreover, we redesigned the web interface and provided interactive figures to interpret cancer omics’ sophisticated information, and also constructed a Summary panel in the ‘Cancer’ and ‘Gene’ sections to visualize the features on multi-omics levels concisely. DriverDBv3 seeks to improve the study of integrative cancer omics data by identifying driver genes and contributes to cancer biology.


2020 ◽  
Vol 11 (1) ◽  
Author(s):  
Antonio Colaprico ◽  
Catharina Olsen ◽  
Matthew H. Bailey ◽  
Gabriel J. Odom ◽  
Thilde Terkelsen ◽  
...  

AbstractCancer driver gene alterations influence cancer development, occurring in oncogenes, tumor suppressors, and dual role genes. Discovering dual role cancer genes is difficult because of their elusive context-dependent behavior. We define oncogenic mediators as genes controlling biological processes. With them, we classify cancer driver genes, unveiling their roles in cancer mechanisms. To this end, we present Moonlight, a tool that incorporates multiple -omics data to identify critical cancer driver genes. With Moonlight, we analyze 8000+ tumor samples from 18 cancer types, discovering 3310 oncogenic mediators, 151 having dual roles. By incorporating additional data (amplification, mutation, DNA methylation, chromatin accessibility), we reveal 1000+ cancer driver genes, corroborating known molecular mechanisms. Additionally, we confirm critical cancer driver genes by analysing cell-line datasets. We discover inactivation of tumor suppressors in intron regions and that tissue type and subtype indicate dual role status. These findings help explain tumor heterogeneity and could guide therapeutic decisions.


2020 ◽  
Vol 3 (1) ◽  
Author(s):  
Xiaobao Dong ◽  
Dandan Huang ◽  
Xianfu Yi ◽  
Shijie Zhang ◽  
Zhao Wang ◽  
...  

AbstractMutation-specific effects of cancer driver genes influence drug responses and the success of clinical trials. We reasoned that these effects could unbalance the distribution of each mutation across different cancer types, as a result, the cancer preference can be used to distinguish the effects of the causal mutation. Here, we developed a network-based framework to systematically measure cancer diversity for each driver mutation. We found that half of the driver genes harbor cancer type-specific and pancancer mutations simultaneously, suggesting that the pervasive functional heterogeneity of the mutations from even the same driver gene. We further demonstrated that the specificity of the mutations could influence patient drug responses. Moreover, we observed that diversity was generally increased in advanced tumors. Finally, we scanned potentially novel cancer driver genes based on the diversity spectrum. Diversity spectrum analysis provides a new approach to define driver mutations and optimize off-label clinical trials.


2020 ◽  
Author(s):  
Constance H. Li ◽  
Syed Haider ◽  
Paul C. Boutros

AbstractEpidemiological studies have identified innumerable ways in which cancer presentation and behaviour is associated with patient ancestry. The molecular bases for these relationships remain largely unknown. We analyzed ancestry associations in the somatic mutational landscape of 12,774 tumours across 33 tumour-types, including 2,562 with whole-genome sequencing. Ancestry influences both the number of mutations in a tumour and the evolutionary timing of when they occur. Specific mutational signatures are associated with ancestry, reflecting potential differences in exogenous and endogenous oncogenic processes. A subset of known cancer driver genes was mutated in ancestry-associated patterns, with transcriptomic consequences. Cancer genome sequencing data is not well-balanced in epidemiologic factors; these data suggest ancestry strongly shapes the somatic mutational landscape of cancer, with potential functional implications.


Sign in / Sign up

Export Citation Format

Share Document