A functional landscape of chronic kidney disease entities from public transcriptomic data

Mapping Intimacies ◽

10.1101/265447 ◽

2018 ◽

Author(s):

Ferenc Tajti ◽

Christoph Kuppe ◽

Asier Antoranz ◽

Mahmoud M. Ibrahim ◽

Hyojin Kim ◽

...

Keyword(s):

Gene Expression ◽

Chronic Kidney Disease ◽

Kidney Disease ◽

Gene Expression Data ◽

Molecular Mechanisms ◽

Expression Profiles ◽

Kidney Tissue ◽

Therapeutic Targets ◽

Expression Data ◽

Disease Entities

AbstractTo develop efficient therapies and identify novel early biomarkers for chronic kidney disease an understanding of the molecular mechanisms orchestrating it is essential. We here set out to understand how differences in CKD origin are reflected in gene expression. To this end, we integrated publicly available human glomerular microarray gene expression data for nine kidney disease entities that account for a majority of CKD worldwide. We included data from five distinct studies and compared glomerular gene expression profiles to that of non-tumor parts of kidney cancer nephrectomy tissues. A major challenge was the integration of the data from different sources, platforms and conditions, that we mitigated with a bespoke stringent procedure. This allowed us to perform a global transcriptome-based delineation of different kidney disease entities, obtaining a landscape of their similarities and differences based on the genes that acquire a consistent differential expression between each kidney disease entity and nephrectomy tissue. Furthermore, we derived functional insights by inferring activity of signaling pathways and transcription factors from the collected gene expression data, and identified potential drug candidates based on expression signature matching. We validated representative findings by immunostaining in human kidney biopsies indicating e.g. that the transcription factor FOXM1 is significantly and specifically expressed in parietal epithelial cells in RPGN whereas not expressed in control kidney tissue. These results provide a foundation to comprehend the specific molecular mechanisms underlying different kidney disease entities, that can pave the way to identify biomarkers and potential therapeutic targets. To facilitate this, we provide our results as a free interactive web application: https://saezlab.shinyapps.io/ckd_landscape/.Translational StatementChronic kidney disease is a combination of entities with different etiologies. We integrate and analyse transcriptomics analysis of glomerular from different entities to dissect their different pathophysiology, what might help to identify novel entity-specific therapeutic targets.

Download Full-text

Random-Forest (RF) and Support Vector Machine (SVM) Implementation for Analysis of Gene Expression Data in Chronic Kidney Disease (CKD)

IOP Conference Series Materials Science and Engineering ◽

10.1088/1757-899x/546/5/052066 ◽

2019 ◽

Vol 546 ◽

pp. 052066 ◽

Cited By ~ 1

Author(s):

Zuherman Rustam ◽

Ely Sudarsono ◽

Devvi Sarwinda

Keyword(s):

Gene Expression ◽

Chronic Kidney Disease ◽

Support Vector Machine ◽

Random Forest ◽

Kidney Disease ◽

Gene Expression Data ◽

Support Vector ◽

Expression Data

Download Full-text

Predicting Host Immune Cell Dynamics and Key Disease-Associated Genes Using Tissue Transcriptional Profiles

Processes ◽

10.3390/pr7050301 ◽

2019 ◽

Vol 7 (5) ◽

pp. 301

Author(s):

Muying Wang ◽

Satoshi Fukuyama ◽

Yoshihiro Kawaoka ◽

Jason E. Shoemaker

Keyword(s):

Gene Expression ◽

Gene Expression Data ◽

Immune Cell ◽

Mean Squared Error ◽

Expression Profiles ◽

Statistical Tests ◽

Critical Factor ◽

Expression Data ◽

Cell Dynamics ◽

Cell Counts

Motivation: Immune cell dynamics is a critical factor of disease-associated pathology (immunopathology) that also impacts the levels of mRNAs in diseased tissue. Deconvolution algorithms attempt to infer cell quantities in a tissue/organ sample based on gene expression profiles and are often evaluated using artificial, non-complex samples. Their accuracy on estimating cell counts given temporal tissue gene expression data remains not well characterized and has never been characterized when using diseased lung. Further, how to remove the effects of cell migration on transcript counts to improve discovery of disease factors is an open question. Results: Four cell count inference (i.e., deconvolution) tools are evaluated using microarray data from influenza-infected lung sampled at several time points post-infection. The analysis finds that inferred cell quantities are accurate only for select cell types and there is a tendency for algorithms to have a good relative fit (R 2 ) but a poor absolute fit (normalized mean squared error; NMSE), which suggests systemic biases exist. Nonetheless, using cell fraction estimates to adjust gene expression data, we show that genes associated with influenza virus replication and increased infection pathology are more likely to be identified as significant than when applying traditional statistical tests.

Download Full-text

Uncovering Potential Therapeutic Targets in Colorectal Cancer by Deciphering Mutational Status and Expression of Druggable Oncogenes

Cancers ◽

10.3390/cancers11070983 ◽

2019 ◽

Vol 11 (7) ◽

pp. 983 ◽

Cited By ~ 3

Author(s):

Otília Menyhart ◽

Tatsuhiko Kakisaka ◽

Lőrinc Sándor Pongor ◽

Hiroyuki Uetake ◽

Ajay Goel ◽

...

Keyword(s):

Gene Expression ◽

Colorectal Cancer ◽

Gene Expression Data ◽

Drug Targets ◽

Therapeutic Targets ◽

Independent Set ◽

Driver Mutations ◽

Expression Data ◽

Expression Levels ◽

Mutational Status

Background: Numerous driver mutations have been identified in colorectal cancer (CRC), but their relevance to the development of targeted therapies remains elusive. The secondary effects of pathogenic driver mutations on downstream signaling pathways offer a potential approach for the identification of therapeutic targets. We aimed to identify differentially expressed genes as potential drug targets linked to driver mutations. Methods: Somatic mutations and the gene expression data of 582 CRC patients were utilized, incorporating the mutational status of 39,916 and the expression levels of 20,500 genes. To uncover candidate targets, the expression levels of various genes in wild-type and mutant cases for the most frequent disruptive mutations were compared with a Mann–Whitney test. A survival analysis was performed in 2100 patients with transcriptomic gene expression data. Up-regulated genes associated with worse survival were filtered for potentially actionable targets. The most significant hits were validated in an independent set of 171 CRC patients. Results: Altogether, 426 disruptive mutation-associated upregulated genes were identified. Among these, 95 were linked to worse recurrence-free survival (RFS). Based on the druggability filter, 37 potentially actionable targets were revealed. We selected seven genes and validated their expression in 171 patient specimens. The best independently validated combinations were DUSP4 (p = 2.6 × 10−12) in ACVR2A mutated (7.7%) patients; BMP4 (p = 1.6 × 10−04) in SOX9 mutated (8.1%) patients; TRIB2 (p = 1.35 × 10−14) in ACVR2A mutated patients; VSIG4 (p = 2.6 × 10−05) in ANK3 mutated (7.6%) patients, and DUSP4 (p = 7.1 × 10−04) in AMER1 mutated (8.2%) patients. Conclusions: The results uncovered potentially druggable genes in colorectal cancer. The identified mutations could enable future patient stratification for targeted therapy.

Download Full-text

Connecting gene expression data from connectivity map and in silico target predictions for small molecule mechanism-of-action analysis

Molecular BioSystems ◽

10.1039/c4mb00328d ◽

2015 ◽

Vol 11 (1) ◽

pp. 86-96 ◽

Cited By ~ 17

Author(s):

Aakash Chavan Ravindranath ◽

Nolen Perualila-Tan ◽

Adetayo Kasim ◽

Georgios Drakakis ◽

Sonia Liggi ◽

...

Keyword(s):

Gene Expression ◽

Ligand Binding ◽

Gene Expression Data ◽

Mechanism Of Action ◽

In Silico ◽

Expression Profiles ◽

Gene Expression Profiles ◽

Expression Data ◽

Connectivity Map ◽

Action Analysis

Integrating gene expression profiles with certain proteins can improve our understanding of the fundamental mechanisms in protein–ligand binding.

Download Full-text

Building Gene Networks by Analyzing Gene Expression Profiles

Advanced Methodologies and Technologies in Medicine and Healthcare - Advances in Medical Diagnosis, Treatment, and Care ◽

10.4018/978-1-5225-7489-7.ch003 ◽

2019 ◽

pp. 27-44

Author(s):

Crescenzio Gallo

Keyword(s):

Gene Expression ◽

Gene Expression Data ◽

Gene Networks ◽

Dna Microarrays ◽

Expression Profiles ◽

Expression Patterns ◽

Gene Expression Profiles ◽

Expression Data ◽

Gene Expressions ◽

Over Time

The possible applications of modeling and simulation in the field of bioinformatics are very extensive, ranging from understanding basic metabolic paths to exploring genetic variability. Experimental results carried out with DNA microarrays allow researchers to measure expression levels for thousands of genes simultaneously, across different conditions and over time. A key step in the analysis of gene expression data is the detection of groups of genes that manifest similar expression patterns. In this chapter, the authors examine various methods for analyzing gene expression data, addressing the important topics of (1) selecting the most differentially expressed genes, (2) grouping them by means of their relationships, and (3) classifying samples based on gene expressions.

Download Full-text

Simultaneous enumeration of cancer and immune cell types from bulk tumor gene expression data

eLife ◽

10.7554/elife.26476 ◽

2017 ◽

Vol 6 ◽

Cited By ~ 107

Author(s):

Julien Racle ◽

Kaat de Jonge ◽

Petra Baumgaertner ◽

Daniel E Speiser ◽

David Gfeller

Keyword(s):

Gene Expression ◽

Gene Expression Data ◽

Immune Cell ◽

Expression Profiles ◽

Cell Types ◽

Response To Therapy ◽

Expression Data ◽

Cell Type ◽

Tumor Gene Expression ◽

Tumor Gene

Immune cells infiltrating tumors can have important impact on tumor progression and response to therapy. We present an efficient algorithm to simultaneously estimate the fraction of cancer and immune cell types from bulk tumor gene expression data. Our method integrates novel gene expression profiles from each major non-malignant cell type found in tumors, renormalization based on cell-type-specific mRNA content, and the ability to consider uncharacterized and possibly highly variable cell types. Feasibility is demonstrated by validation with flow cytometry, immunohistochemistry and single-cell RNA-Seq analyses of human melanoma and colorectal tumor specimens. Altogether, our work not only improves accuracy but also broadens the scope of absolute cell fraction predictions from tumor gene expression data, and provides a unique novel experimental benchmark for immunogenomics analyses in cancer research (http://epic.gfellerlab.org).

Download Full-text

ExAtlas: An interactive online tool for meta-analysis of gene expression data

Journal of Bioinformatics and Computational Biology ◽

10.1142/s0219720015500195 ◽

2015 ◽

Vol 13 (06) ◽

pp. 1550019 ◽

Cited By ~ 37

Author(s):

Alexei A. Sharov ◽

David Schlessinger ◽

Minoru S. H. Ko

Keyword(s):

Gene Expression ◽

Gene Ontology ◽

Gene Expression Data ◽

Fixed Effects ◽

Expression Profiles ◽

Meta Analysis ◽

Data Sets ◽

Expression Data ◽

Gene Set ◽

Public Data

We have developed ExAtlas, an on-line software tool for meta-analysis and visualization of gene expression data. In contrast to existing software tools, ExAtlas compares multi-component data sets and generates results for all combinations (e.g. all gene expression profiles versus all Gene Ontology annotations). ExAtlas handles both users’ own data and data extracted semi-automatically from the public repository (GEO/NCBI database). ExAtlas provides a variety of tools for meta-analyses: (1) standard meta-analysis (fixed effects, random effects, z-score, and Fisher’s methods); (2) analyses of global correlations between gene expression data sets; (3) gene set enrichment; (4) gene set overlap; (5) gene association by expression profile; (6) gene specificity; and (7) statistical analysis (ANOVA, pairwise comparison, and PCA). ExAtlas produces graphical outputs, including heatmaps, scatter-plots, bar-charts, and three-dimensional images. Some of the most widely used public data sets (e.g. GNF/BioGPS, Gene Ontology, KEGG, GAD phenotypes, BrainScan, ENCODE ChIP-seq, and protein–protein interaction) are pre-loaded and can be used for functional annotations.

Download Full-text

Machine learning analysis of gene expression data reveals novel diagnostic and prognostic biomarkers and identifies therapeutic targets for soft tissue sarcomas

PLoS Computational Biology ◽

10.1371/journal.pcbi.1006826 ◽

2019 ◽

Vol 15 (2) ◽

pp. e1006826 ◽

Cited By ~ 14

Author(s):

David G. P. van IJzendoorn ◽

Karoly Szuhai ◽

Inge H. Briaire-de Bruijn ◽

Marie Kostine ◽

Marieke L. Kuijjer ◽

...

Keyword(s):

Gene Expression ◽

Machine Learning ◽

Soft Tissue ◽

Gene Expression Data ◽

Therapeutic Targets ◽

Soft Tissue Sarcomas ◽

Prognostic Biomarkers ◽

Expression Data ◽

Learning Analysis

Download Full-text

GEDS: A Gene Expression Display Server for mRNAs, miRNAs and Proteins

Cells ◽

10.3390/cells8070675 ◽

2019 ◽

Vol 8 (7) ◽

pp. 675 ◽

Cited By ~ 5

Author(s):

Xia ◽

Liu ◽

Zhang ◽

Guo

Keyword(s):

Gene Expression ◽

Cell Lines ◽

Gene Expression Data ◽

Expression Profiles ◽

Gene Expression Profiles ◽

Cancer Cell Line ◽

Tissue Expression ◽

The Cancer Genome Atlas ◽

Expression Data ◽

Protein Levels

High-throughput technologies generate a tremendous amount of expression data on mRNA, miRNA and protein levels. Mining and visualizing the large amount of expression data requires sophisticated computational skills. An easy to use and user-friendly web-server for the visualization of gene expression profiles could greatly facilitate data exploration and hypothesis generation for biologists. Here, we curated and normalized the gene expression data on mRNA, miRNA and protein levels in 23315, 9009 and 9244 samples, respectively, from 40 tissues (The Cancer Genome Atlas (TCGA) and Genotype-Tissue Expression (GETx)) and 1594 cell lines (Cancer Cell Line Encyclopedia (CCLE) and MD Anderson Cell Lines Project (MCLP)). Then, we constructed the Gene Expression Display Server (GEDS), a web-based tool for quantification, comparison and visualization of gene expression data. GEDS integrates multiscale expression data and provides multiple types of figures and tables to satisfy several kinds of user requirements. The comprehensive expression profiles plotted in the one-stop GEDS platform greatly facilitate experimental biologists utilizing big data for better experimental design and analysis. GEDS is freely available on http://bioinfo.life.hust.edu.cn/web/GEDS/.

Download Full-text

Association Between Gene Expression Profiles and Commonly Mutated Genes In The Hematopoietic Stem Cells Of Patients With Myelodysplastic Syndromes

Blood ◽

10.1182/blood.v122.21.2779.2779 ◽

2013 ◽

Vol 122 (21) ◽

pp. 2779-2779 ◽

Cited By ~ 1

Author(s):

Andrea Pellagatti ◽

Moritz Gerstung ◽

Elli Papaemmanuil ◽

Luca Malcovati ◽

Aristoteles Giagounidis ◽

...

Keyword(s):

Gene Expression ◽

Differentially Expressed Genes ◽

Gene Expression Data ◽

Expression Profiles ◽

Gene Mutations ◽

Gene Expression Profiles ◽

Differentially Expressed ◽

Expression Data ◽

Common Gene ◽

Mutational Status

Abstract A particular profile of gene expression can reflect an underlying molecular abnormality in malignancy. Distinct gene expression profiles and deregulated gene pathways can be driven by specific gene mutations and may shed light on the biology of the disease and lead to the identification of new therapeutic targets. We selected 143 cases from our large-scale gene expression profiling (GEP) dataset on bone marrow CD34+ cells from patients with myelodysplastic syndromes (MDS), for which matching genotyping data were obtained using next-generation sequencing of a comprehensive list of 111 genes involved in myeloid malignancies (including the spliceosomal genes SF3B1, SRSF2, U2AF1 and ZRSR2, as well as TET2, ASXL1and many other). The GEP data were then correlated with the mutational status to identify significantly differentially expressed genes associated with each of the most common gene mutations found in MDS. The expression levels of the mutated genes analyzed were generally lower in patients carrying a mutation than in patients wild-type for that gene (e.g. SF3B1, ASXL1 and TP53), with the exception of RUNX1 for which patients carrying a mutation showed higher expression levels than patients without mutation. Principal components analysis showed that the main directions of gene expression changes (principal components) tend to coincide with some of the common gene mutations, including SF3B1, SRSF2 and TP53. SF3B1 and STAG2 were the mutated genes showing the highest number of associated significantly differentially expressed genes, including ABCB7 as differentially expressed in association with SF3B1 mutation and SULT2A1 in association with STAG2 mutation. We found distinct differentially expressed genes associated with the four most common splicing gene mutations (SF3B1, SRSF2, U2AF1 and ZRSR2) in MDS, suggesting that different phenotypes associated with these mutations may be driven by different effects on gene expression and that the target gene may be different. We have also evaluated the prognostic impact of the GEP data in comparison with that of the genotype data and importantly we have found a larger contribution of gene expression data in predicting progression free survival compared to mutation-based multivariate survival models. In summary, this analysis correlating gene expression data with genotype data has revealed that the mutational status shapes the gene expression landscape. We have identified deregulated genes associated with the most common gene mutations in MDS and found that the prognostic power of gene expression data is greater than the prognostic power provided by mutation data. AP and MG contributed equally to this work. JB and PJC are co-senior authors. Disclosures: No relevant conflicts of interest to declare.

Download Full-text