scholarly journals PAN: Personalized Annotation-based Networks for the Prediction of Breast Cancer Relapse

2019 ◽  
Author(s):  
Thin Nguyen ◽  
Samuel C. Lee ◽  
Thomas P. Quinn ◽  
Buu Truong ◽  
Xiaomei Li ◽  
...  

AbstractThe classification of clinical samples based on gene expression data is an important part of precision medicine. However, it has proved difficult to accurately predict survival outcomes and treatment responses for cancer patients. In this manuscript, we show how transforming gene expression data into a set of personalized (sample-specific) networks can allow us to harness existing graph-based methods to improve classifier performance. Existing approaches to personalized gene networks all have the limitation that they depend on other samples in the data and must get re-computed whenever a new sample is introduced. Here, we propose a novel method, called Personalized Annotation-based Networks (PAN), that avoids this limitation by using curated annotation databases to transform gene expression data into a graph. These databases organize genes into overlapping gene sets, called annotations, that we use to build a network where nodes represent functional terms and edges represent the similarity between them. Unlike competing methods, PANs are calculated for each sample independent of the population, making it a more efficient way to obtain single-sample networks. Using three breast cancer datasets as a case study (METABRIC and a super-set of GEO studies), we show that PAN classifiers not only predict cancer relapse better than gene features alone, but also outperform PPI and population-level graph-based classifiers. This work demonstrates the practical advantages of graph-based classification for high-dimensional genomic data, while offering a new approach to making sample-specific networks.Supplementary informationThe codes and data are available at https://github.com/thinng/[email protected]

2020 ◽  
Vol 36 (9) ◽  
pp. 2755-2762
Author(s):  
Jia-Juan Tu ◽  
Le Ou-Yang ◽  
Hong Yan ◽  
Xiao-Fei Zhang ◽  
Hong Qin

Abstract Motivation Reconstruction of cancer gene networks from gene expression data is important for understanding the mechanisms underlying human cancer. Due to heterogeneity, the tumor tissue samples for a single cancer type can be divided into multiple distinct subtypes (inter-tumor heterogeneity) and are composed of non-cancerous and cancerous cells (intra-tumor heterogeneity). If tumor heterogeneity is ignored when inferring gene networks, the edges specific to individual cancer subtypes and cell types cannot be characterized. However, most existing network reconstruction methods do not simultaneously take inter-tumor and intra-tumor heterogeneity into account. Results In this article, we propose a new Gaussian graphical model-based method for jointly estimating multiple cancer gene networks by simultaneously capturing inter-tumor and intra-tumor heterogeneity. Given gene expression data of heterogeneous samples for different cancer subtypes, a non-cancerous network shared across different cancer subtypes and multiple subtype-specific cancerous networks are estimated jointly. Tumor heterogeneity can be revealed by the difference in the estimated networks. The performance of our method is first evaluated using simulated data, and the results indicate that our method outperforms other state-of-the-art methods. We also apply our method to The Cancer Genome Atlas breast cancer data to reconstruct non-cancerous and subtype-specific cancerous gene networks. Hub nodes in the networks estimated by our method perform important biological functions associated with breast cancer development and subtype classification. Availability and implementation The source code is available at https://github.com/Zhangxf-ccnu/NETI2. Supplementary information Supplementary data are available at Bioinformatics online.


2005 ◽  
Vol 03 (06) ◽  
pp. 1295-1313 ◽  
Author(s):  
YOSHINORI TAMADA ◽  
HIDEO BANNAI ◽  
SEIYA IMOTO ◽  
TOSHIAKI KATAYAMA ◽  
MINORU KANEHISA ◽  
...  

Since microarray gene expression data do not contain sufficient information for estimating accurate gene networks, other biological information has been considered to improve the estimated networks. Recent studies have revealed that highly conserved proteins that exhibit similar expression patterns in different organisms, have almost the same function in each organism. Such conserved proteins are also known to play similar roles in terms of the regulation of genes. Therefore, this evolutionary information can be used to refine regulatory relationships among genes, which are estimated from gene expression data. We propose a statistical method for estimating gene networks from gene expression data by utilizing evolutionarily conserved relationships between genes. Our method simultaneously estimates two gene networks of two distinct organisms, with a Bayesian network model utilizing the evolutionary information so that gene expression data of one organism helps to estimate the gene network of the other. We show the effectiveness of the method through the analysis on Saccharomyces cerevisiae and Homo sapiens cell cycle gene expression data. Our method was successful in estimating gene networks that capture many known relationships as well as several unknown relationships which are likely to be novel. Supplementary information is available at .


2021 ◽  
Vol 7 (1) ◽  
Author(s):  
Gaia Griguolo ◽  
Maria Vittoria Dieci ◽  
Laia Paré ◽  
Federica Miglietta ◽  
Daniele Giulio Generali ◽  
...  

AbstractLittle is known regarding the interaction between immune microenvironment and tumor biology in hormone receptor (HR)+/HER2− breast cancer (BC). We here assess pretreatment gene-expression data from 66 HR+/HER2− early BCs from the LETLOB trial and show that non-luminal tumors (HER2-enriched, Basal-like) present higher tumor-infiltrating lymphocyte levels than luminal tumors. Moreover, significant differences in immune infiltrate composition, assessed by CIBERSORT, were observed: non-luminal tumors showed a more proinflammatory antitumor immune infiltrate composition than luminal ones.


Author(s):  
Crescenzio Gallo

The possible applications of modeling and simulation in the field of bioinformatics are very extensive, ranging from understanding basic metabolic paths to exploring genetic variability. Experimental results carried out with DNA microarrays allow researchers to measure expression levels for thousands of genes simultaneously, across different conditions and over time. A key step in the analysis of gene expression data is the detection of groups of genes that manifest similar expression patterns. In this chapter, the authors examine various methods for analyzing gene expression data, addressing the important topics of (1) selecting the most differentially expressed genes, (2) grouping them by means of their relationships, and (3) classifying samples based on gene expressions.


Methods ◽  
2020 ◽  
Vol 179 ◽  
pp. 89-100 ◽  
Author(s):  
Kyuri Jo ◽  
Beatriz Santos-Buitrago ◽  
Minsu Kim ◽  
Sungmin Rhee ◽  
Carolyn Talcott ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document