scholarly journals Moonlight: a tool for biological interpretation and driver genes discovery

2018 ◽  
Author(s):  
Antonio Colaprico ◽  
Catharina Olsen ◽  
Claudia Cava ◽  
Thilde Terkelsen ◽  
Tiago C. Silva ◽  
...  

AbstractCancer is a complex and heterogeneous disease. It is crucial to identify the key driver genes and their role in cancer mechanisms with attention to different cancer stages, types or subtypes. Cancer driver genes are elusive and their discovery is complicated by the fact that the same gene can play a diverse role in different contexts. Key biological processes, such as cell proliferation and cell death, have been linked to cancer progression. Thus, in principle, they can be exploited to classify the cancer genes and unveil their role. Here, we present a new method, Moonlight, that exploit expression data to classify cancer genes. Moonlight relies on the integration of functional enrichment analysis, gene regulatory networks and upstream regulator analysis from expression data to score the importance of biological cancer-related processes taking into account either the inter- or intra-tumor heterogeneity. We then employed these scores to predict if each gene acts as a tumor suppressor gene (TSG) or as an oncogene (OCG). Our methodology also allow to predict genes with dual role, i.e. the moonlight genes (TSG in one cancer type or stage and OCG in another), as well as to elucidate the underlying biological processes. Availability: https://bioconductor.org/packages/MoonlightR & https://github.com/ibsquare/MoonlightR/

2021 ◽  
Author(s):  
Hong Lin ◽  
Qizheng Han ◽  
Junhao Wang ◽  
Zhaoqian Zhong ◽  
Haihua Luo ◽  
...  

Abstract Purpose Retinoid-binding protein (RBP7) is a member of the cellular retinol-binding protein (CRBP) family, which is involved in the pathogenesis of breast cancer (BRCA). The study aims to illustrate the prognostic value and the potential regulatory mechanisms of RBP7 expression in BRCA. Methods We utilized a series of bioinformatics tools, including HPA, GEPIA, UALCAN, ONCOMINE, Kaplan–Meier plotter, PROGeneV2, TISCH, LinkedOmics, UCSC Xena, MethSurv, SMART APP, bc-GenExMiner4.7, OSbrca, STRING, CARE, SwissDock and R software packages, to investigate the expression, prognostic value and functional regulatory networks of RBP7 in BRCA. Results Bioinformatics analysis with the TCGA and CPTAC databases revealed that the mRNA and protein expression levels of RBP7 in normal were higher compared to BRCA tissues. Survival analysis displayed that the lower expression of RBP7, the worse the prognosis in ER-positive (ER+) BRCA patients. Genomic analysis showed that promoter methylation result in transcriptional silencing of RBP7 in BRCA. Functional enrichment analysis demonstrated that downregulation of RBP7 expression may exert its biological influence on BRCA through the PPAR pathway and the PI3K/AKT pathway. Conclusions In summary, we identified RBP7 as a novel biomarker that is helpful for the prognosis of ER+ BRCA patients. Promoter methylation of RBP7 is involved in its gene silencing in BRCA, thus regulating the occurrence and development of ER+ BRCA through the PPAR and PI3K/AKT pathways.


2017 ◽  
Author(s):  
Joana Carlevaro-Fita ◽  
Andrés Lanzós ◽  
Lars Feuerbach ◽  
Chen Hong ◽  
David Mas-Ponte ◽  
...  

AbstractLong non-coding RNAs (lncRNAs) that drive tumorigenesis are a growing focus of cancer genomics studies. To facilitate further discovery, we have created the “Cancer LncRNA Census” (CLC), a manually-curated and strictly-defined compilation of lncRNAs with causative roles in cancer. CLC has two principle applications: first, as a resource for training and benchmarking de novo identification methods; and second, as a dataset for studying the fundamental properties of these genes.CLC Version 1 comprises 122 lncRNAs implicated in 29 distinct cancers. LncRNAs are included based on functional or genetic evidence for causative roles in cancer progression. All belong to the GENCODE reference annotation, to enable integration across projects and datasets. For each entry, the evidence type, biological activity (oncogene or tumour suppressor), source reference and cancer type are recorded. Supporting its usefulness, CLC genes are significantly enriched amongst de novo predicted driver genes from PCAWG. CLC genes are distinguished from other lncRNAs by a series of features consistent with biological function, including gene length, high expression and sequence conservation of both exons and promoters. We identify a trend for CLC genes to be co-localised with known protein-coding cancer genes along the human genome. Finally, by integrating data from transposon-mutagenesis functional screens, we show that mouse orthologues of CLC genes tend also to be cancer genes.Thus CLC represents a valuable resource for research into long non-coding RNAs in cancer. Their evolutionary and genomic properties have implications for understanding disease mechanisms and point to conserved functions across ~80 million years of evolution.


2020 ◽  
Author(s):  
Gulden Olgun ◽  
Oznur Tastan

AbstractThe dysregulation of long non-coding RNAs’ (lncRNAs) expressions has been implicated in cancer. Since most of the lncRNAs’ are not functionally characterized well, investigating the set of perturbed lncRNAs are is challenging. Existing methods that inspect lncRNAs functionally rely on the co-expressed coding genes, which are far better characterized functionally. LncRNAs can be known to act as transcriptional regulators; they may activate or repress the neighborhood’s coding genes on the genome. Based on this, in this work, we aim to analyze the deregulated lncRNAs in cancer by taking into account their ability to regulate nearby loci on the genome. We perform functional analysis on differentially expressed lncRNAs for 28 different cancers considering their adjacent coding genes. We identify that some deregulated lncRNAs are cancer-specific, but a substantial number of lncRNAs are shared across cancers. Next, we assess the similarities of the cancer types based on the functional enrichment of the deregulated lncRNA sets. We find some cancers are very similar in the functions and biological processes related to the deregulated lncRNAs. We observe that some of the cancers for which we find similarity can be linked through primary, metastatic site relations. We investigate the similarity of enriched functional terms for the deregulated lncRNAs and the mRNAs. We further assess the enriched functions’ similarity to the functions and processes that the known cancer driver genes take place. We believe that our methodology help to understand the impact of the lncRNAs in cancer functionally.


2015 ◽  
Vol 2015 ◽  
pp. 1-9 ◽  
Author(s):  
Meng Liu ◽  
Xia Li ◽  
Rui Fan ◽  
Xinhua Liu ◽  
Ju Wang

Nicotine, as the major psychoactive component of tobacco, has broad physiological effects within the central nervous system, but our understanding of the molecular mechanism underlying its neuronal effects remains incomplete. In this study, we performed a systematic analysis on a set of nicotine addiction-related genes to explore their characteristics at network levels. We found that NAGenes tended to have a more moderate degree and weaker clustering coefficient and to be less central in the network compared to alcohol addiction-related genes or cancer genes. Further, clustering of these genes resulted in six clusters with themes in synaptic transmission, signal transduction, metabolic process, and apoptosis, which provided an intuitional view on the major molecular functions of the genes. Moreover, functional enrichment analysis revealed that neurodevelopment, neurotransmission activity, and metabolism related biological processes were involved in nicotine addiction. In summary, by analyzing the overall characteristics of the nicotine addiction related genes, this study provided valuable information for understanding the molecular mechanisms underlying nicotine addiction.


2022 ◽  
Author(s):  
Malvika Sudhakar ◽  
Raghunathan Rengaswamy ◽  
Karthik Raman

The progression of tumorigenesis starts with a few mutational and structural driver events in the cell. Various cohort-based computational tools exist to identify driver genes but require a large number of samples to produce reliable results. Many studies use different methods to identify driver mutations/genes from mutations that have no impact on tumour progression; however, a small fraction of patients show no mutational events in any known driver genes. Current unsupervised methods map somatic and expression data onto a network to identify the perturbation in the network. Our method is the first machine learning model to classify genes as tumour suppressor gene (TSG), oncogene (OG) or neutral, thus assigning the functional impact of the gene in the patient. In this study, we develop a multi-omic approach, PIVOT (Personalised Identification of driVer OGs and TSGs), to train on experimentally or computationally validated mutational and structural driver events. Given the lack of any gold standards for the identification of personalised driver genes, we label the data using four strategies and, based on classification metrics, show gene-based labelling strategies perform best. We build different models using SNV, RNA, and multi-omic features to be used based on the data available. Our models trained on multi-omic data improved predictions compared to mutation and expression data, achieving an accuracy >0.99 for BRCA, LUAD and COAD datasets. We show network and expression-based features contribute the most to PIVOT. Our predictions on BRCA, COAD and LUAD cancer types reveal commonly altered genes such as TP53, and PIK3CA, which are predicted drivers for multiple cancer types. Along with known driver genes, our models also identify new driver genes such as PRKCA, SOX9 and PSMD4. Our multi-omic model labels both CNV and mutations with a more considerable contribution by CNV alterations. While predicting labels for genes mutated in multiple samples, we also label rare driver events occurring in as few as one sample. We also identify genes with dual roles within the same cancer type. Overall, PIVOT labels personalised driver genes as TSGs and OGs and also identifies rare driver genes. PIVOT is available at https://github.com/RamanLab/PIVOT.


2021 ◽  
Vol 12 ◽  
Author(s):  
Maryam Heidari ◽  
Abbas Pakdel ◽  
Mohammad Reza Bakhtiarizadeh ◽  
Fariba Dehghanian

Johne’s disease is a chronic infection of ruminants that burdens dairy herds with a significant economic loss. The pathogenesis of the disease has not been revealed clearly due to its complex nature. In order to achieve deeper biological insights into molecular mechanisms involved in MAP infection resulting in Johne’s disease, a system biology approach was used. As far as is known, this is the first study that considers lncRNAs, TFs, and mRNAs, simultaneously, to construct an integrated gene regulatory network involved in MAP infection. Weighted gene coexpression network analysis (WGCNA) and functional enrichment analysis were conducted to explore coexpression modules from which nonpreserved modules had altered connectivity patterns. After identification of hub and hub-hub genes as well as TFs and lncRNAs in the nonpreserved modules, integrated networks of lncRNA-mRNA-TF were constructed, and cis and trans targets of lncRNAs were identified. Both cis and trans targets of lncRNAs were found in eight nonpreserved modules. Twenty-one of 47 nonpreserved modules showed significant biological processes related to the immune system and MAP infection. Some of the MAP infection’s related pathways in the most important nonpreserved modules comprise “positive regulation of cytokine-mediated signaling pathway,” “negative regulation of leukocyte migration,” “T-cell differentiation,” “neutrophil activation,” and “defense response.” Furthermore, several genes were identified in these modules, including SLC11A1, MAPK8IP1, HMGCR, IFNGR1, CMPK2, CORO1A, IRF1, LDLR, BOLA-DMB, and BOLA-DMA, which are potentially associated with MAP pathogenesis. This study not only enhanced our knowledge of molecular mechanisms behind MAP infection but also highlighted several promising hub and hub-hub genes involved in macrophage-pathogen interaction.


PeerJ ◽  
2020 ◽  
Vol 8 ◽  
pp. e8831 ◽  
Author(s):  
Xiaojiao Guan ◽  
Yao Yao ◽  
Guangyao Bao ◽  
Yue Wang ◽  
Aimeng Zhang ◽  
...  

Esophageal cancer is a common malignant tumor in the world, and the aim of this study was to screen key genes related to the development of esophageal cancer using a variety of bioinformatics analysis tools and analyze their biological functions. The data of esophageal squamous cell carcinoma from the Gene Expression Omnibus (GEO) were selected as the research object, processed and analyzed to screen differentially expressed microRNAs (miRNAs) and differential methylation genes. The competing endogenous RNAs (ceRNAs) interaction network of differentially expressed genes was constructed by bioinformatics tools DAVID, String, and Cytoscape. Biofunctional enrichment analysis was performed using Gene Ontology (GO) and the Kyoto Encyclopedia of Genes and Genomes (KEGG). The expression of the screened genes and the survival of the patients were verified. By analyzing GSE59973 and GSE114110, we found three down-regulated and nine up-regulated miRNAs. The gene expression matrix of GSE120356 was calculated by Pearson correlation coefficient, and the 11696 pairs of ceRNA relation were determined. In the ceRNA network, 643 lncRNAs and 147 mRNAs showed methylation difference. Functional enrichment analysis showed that these differentially expressed genes were mainly concentrated in the FoxO signaling pathway and were involved in the corresponding cascade of calcineurin. By analyzing the clinical data in The Cancer Genome Atlas (TCGA) database, it was found that four lncRNAs had an important impact on the survival and prognosis of esophageal carcinoma patients. QRT-PCR was also conducted to identify the expression of the key lncRNAs (RNF217-AS1, HCP5, ZFPM2-AS1 and HCG22) in ESCC samples. The selected key genes can provide theoretical guidance for further research on the molecular mechanism of esophageal carcinoma and the screening of molecular markers.


F1000Research ◽  
2020 ◽  
Vol 9 ◽  
pp. 709 ◽  
Author(s):  
Liis Kolberg ◽  
Uku Raudvere ◽  
Ivan Kuzmin ◽  
Jaak Vilo ◽  
Hedi Peterson

g:Profiler (https://biit.cs.ut.ee/gprofiler) is a widely used gene list functional profiling and namespace conversion toolset that has been contributing to reproducible biological data analysis already since 2007. Here we introduce the accompanying R package, gprofiler2, developed to facilitate programmatic access to g:Profiler computations and databases via REST API. The gprofiler2 package provides an easy-to-use functionality that enables researchers to incorporate functional enrichment analysis into automated analysis pipelines written in R. The package also implements interactive visualisation methods to help to interpret the enrichment results and to illustrate them for publications. In addition, gprofiler2 gives access to the versatile gene/protein identifier conversion functionality in g:Profiler enabling to map between hundreds of different identifier types or orthologous species. The gprofiler2 package is freely available at the CRAN repository.


2021 ◽  
Vol 12 ◽  
Author(s):  
Yani Dong ◽  
Likang Lyu ◽  
Daiqiang Zhang ◽  
Jing Li ◽  
Haishen Wen ◽  
...  

Long non-coding RNAs (lncRNAs) have been reported to be involved in multiple biological processes. However, the roles of lncRNAs in the reproduction of half-smooth tongue sole (Cynoglossus semilaevis) are unclear, especially in the molecular regulatory mechanism driving ovarian development and ovulation. Thus, to explore the mRNA and lncRNA mechanisms regulating reproduction, we collected tongue sole ovaries in three stages for RNA sequencing. In stage IV vs. V, we identified 312 differentially expressed (DE) mRNAs and 58 DE lncRNAs. In stage V vs. VI, we identified 1,059 DE mRNAs and 187 DE lncRNAs. Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analyses showed that DE mRNAs were enriched in ECM-receptor interaction, oocyte meiosis and steroid hormone biosynthesis pathways. Furthermore, we carried out gene set enrichment analysis (GSEA) to identify potential reproduction related-pathways additionally, such as fatty metabolism and retinol metabolism. Based on enrichment analysis, DE mRNAs with a potential role in reproduction were selected and classified into six categories, including signal transduction, cell growth and death, immune response, metabolism, transport and catabolism, and cell junction. The interactions of DE lncRNAs and mRNAs were predicted according to antisense, cis-, and trans-regulatory mechanisms. We constructed a competing endogenous RNA (ceRNA) network. Several lncRNAs were predicted to regulate genes related to reproduction including cyp17a1, cyp19a1, mmp14, pgr, and hsd17b1. The functional enrichment analysis of these target genes of lncRNAs revealed that they were involved in several signaling pathways, such as the TGF-beta, Wnt signaling, and MAPK signaling pathways and reproduction related-pathways such as the progesterone-mediated oocyte maturation, oocyte meiosis, and GnRH signaling pathway. RT-qPCR analysis showed that two lncRNAs (XR_522278.2 and XR_522171.2) were mainly expressed in the ovary. Dual-fluorescence in situ hybridization experiments showed that both XR_522278.2 and XR_522171.2 colocalized with their target genes cyp17a1 and cyp19a1, respectively, in the follicular cell layer. The results further demonstrated that lncRNAs might be involved in the biological processes by modulating gene expression. Taken together, this study provides lncRNA profiles in the ovary of tongue sole and further insight into the role of lncRNA involvement in regulating reproduction in tongue sole.


PeerJ ◽  
2021 ◽  
Vol 9 ◽  
pp. e11321
Author(s):  
Di Zhang ◽  
Pengguang Yan ◽  
Taotao Han ◽  
Xiaoyun Cheng ◽  
Jingnan Li

Background Ulcerative colitis-associated colorectal cancer (UC-CRC) is a life-threatening complication of ulcerative colitis (UC). The mechanisms underlying UC-CRC remain to be elucidated. The purpose of this study was to explore the key genes and biological processes contributing to colitis-associated dysplasia (CAD) or carcinogenesis in UC via database mining, thus offering opportunities for early prediction and intervention of UC-CRC. Methods Microarray datasets (GSE47908 and GSE87466) were downloaded from Gene Expression Omnibus (GEO). Differentially expressed genes (DEGs) between groups of GSE47908 were identified using the “limma” R package. Weighted gene co-expression network analysis (WGCNA) based on DEGs between the CAD and control groups was conducted subsequently. Functional enrichment analysis was performed, and hub genes of selected modules were identified using the “clusterProfiler” R package. Single-gene gene set enrichment analysis (GSEA) was conducted to predict significant biological processes and pathways associated with the specified gene. Results Six functional modules were identified based on 4929 DEGs. Green and blue modules were selected because of their consistent correlation with UC and CAD, and the highest correlation coefficient with the progress of UC-associated carcinogenesis. Functional enrichment analysis revealed that genes of these two modules were significantly enriched in biological processes, including mitochondrial dysfunction, cell-cell junction, and immune responses. However, GSEA based on differential expression analysis between sporadic colorectal cancer (CRC) and normal controls from The Cancer Genome Atlas (TCGA) indicated that mitochondrial dysfunction may not be the major carcinogenic mechanism underlying sporadic CRC. Thirteen hub genes (SLC25A3, ACO2, AIFM1, ATP5A1, DLD, TFE3, UQCRC1, ADIPOR2, SLC35D1, TOR1AIP1, PRR5L, ATOX1, and DTX3) were identified. Their expression trends were validated in UC patients of GSE87466, and their potential carcinogenic effects in UC were supported by their known functions and other relevant studies reported in the literature. Single-gene GSEA indicated that biological processes and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways related to angiogenesis and immune response were positively correlated with the upregulation of TFE3, whereas those related to mitochondrial function and energy metabolism were negatively correlated with the upregulation of TFE3. Conclusions Using WGCNA, this study found two gene modules that were significantly correlated with CAD, of which 13 hub genes were identified as the potential key genes. The critical biological processes in which the genes of these two modules were significantly enriched include mitochondrial dysfunction, cell-cell junction, and immune responses. TFE3, a transcription factor related to mitochondrial function and cancers, may play a central role in UC-associated carcinogenesis.


Sign in / Sign up

Export Citation Format

Share Document