iTOP: Inferring the Topology of Omics Data

FORESEE: a tool for the systematic comparison of translational drug response modeling pipelines

10.7287/peerj.preprints.27256v1 ◽

2018 ◽

Author(s):

Lisa-Katrin Turnhoff ◽

Ali Hadizadeh Esfahani ◽

Maryam Montazeri ◽

Nina Kusch ◽

Andreas Schuppert

Keyword(s):

Drug Response ◽

Drug Efficacy ◽

Response Prediction ◽

R Package ◽

Supplementary Information ◽

Supplementary File ◽

Data Sets ◽

Training Algorithms ◽

Model Training

Translational models that utilize omics data generated in in vitro studies to predict the drug efficacy of anti-cancer compounds in patients are highly distinct, which complicates the benchmarking process for new computational approaches. In reaction to this, we introduce the uniFied translatiOnal dRug rESponsE prEdiction platform FORESEE, an open-source R-package. FORESEE not only provides a uniform data format for public cell line and patient data sets, but also establishes a standardized environment for drug response prediction pipelines, incorporating various state-of-the-art preprocessing methods, model training algorithms and validation techniques. The modular implementation of individual elements of the pipeline facilitates a straightforward development of combinatorial models, which can be used to re-evaluate and improve already existing pipelines as well as to develop new ones. Availability and Implementation: FORESEE is licensed under GNU General Public License v3.0 and available at https://github.com/JRC-COMBINE/FORESEE . Supplementary Information: Supplementary Files 1 and 2 provide detailed descriptions of the pipeline and the data preparation process, while Supplementary File 3 presents basic use cases of the package. Contact: [email protected]

Download Full-text

MOLI: multi-omics late integration with deep neural networks for drug response prediction

Bioinformatics ◽

10.1093/bioinformatics/btz318 ◽

2019 ◽

Vol 35 (14) ◽

pp. i501-i509 ◽

Cited By ~ 28

Author(s):

Hossein Sharifi-Noghabi ◽

Olga Zolotareva ◽

Colin C Collins ◽

Martin Ester

Keyword(s):

Gene Expression ◽

Neural Networks ◽

Prediction Accuracy ◽

Drug Response ◽

Deep Neural Networks ◽

Response Prediction ◽

Supplementary Information ◽

Precision Oncology

Abstract Motivation Historically, gene expression has been shown to be the most informative data for drug response prediction. Recent evidence suggests that integrating additional omics can improve the prediction accuracy which raises the question of how to integrate the additional omics. Regardless of the integration strategy, clinical utility and translatability are crucial. Thus, we reasoned a multi-omics approach combined with clinical datasets would improve drug response prediction and clinical relevance. Results We propose MOLI, a multi-omics late integration method based on deep neural networks. MOLI takes somatic mutation, copy number aberration and gene expression data as input, and integrates them for drug response prediction. MOLI uses type-specific encoding sub-networks to learn features for each omics type, concatenates them into one representation and optimizes this representation via a combined cost function consisting of a triplet loss and a binary cross-entropy loss. The former makes the representations of responder samples more similar to each other and different from the non-responders, and the latter makes this representation predictive of the response values. We validate MOLI on in vitro and in vivo datasets for five chemotherapy agents and two targeted therapeutics. Compared to state-of-the-art single-omics and early integration multi-omics methods, MOLI achieves higher prediction accuracy in external validations. Moreover, a significant improvement in MOLI’s performance is observed for targeted drugs when training on a pan-drug input, i.e. using all the drugs with the same target compared to training only on drug-specific inputs. MOLI’s high predictive power suggests it may have utility in precision oncology. Availability and implementation https://github.com/hosseinshn/MOLI. Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text

FORESEE: a tool for the systematic comparison of translational drug response modeling pipelines

Bioinformatics ◽

10.1093/bioinformatics/btz145 ◽

2019 ◽

Vol 35 (19) ◽

pp. 3846-3848 ◽

Cited By ~ 6

Author(s):

Lisa-Katrin Turnhoff ◽

Ali Hadizadeh Esfahani ◽

Maryam Montazeri ◽

Nina Kusch ◽

Andreas Schuppert

Keyword(s):

Drug Response ◽

Drug Efficacy ◽

Response Prediction ◽

R Package ◽

Supplementary Information ◽

Training Algorithms ◽

Anti Cancer ◽

Model Training ◽

Response Modeling

Abstract Summary Translational models that utilize omics data generated in in vitro studies to predict the drug efficacy of anti-cancer compounds in patients are highly distinct, which complicates the benchmarking process for new computational approaches. In reaction to this, we introduce the uniFied translatiOnal dRug rESponsE prEdiction platform FORESEE, an open-source R-package. FORESEE not only provides a uniform data format for public cell line and patient datasets, but also establishes a standardized environment for drug response prediction pipelines, incorporating various state-of-the-art pre-processing methods, model training algorithms and validation techniques. The modular implementation of individual elements of the pipeline facilitates a straightforward development of combinatorial models, which can be used to re-evaluate and improve already existing pipelines as well as to develop new ones. Availability and implementation FORESEE is licensed under GNU General Public License v3.0 and available at https://github.com/JRC-COMBINE/FORESEE and https://doi.org/10.17605/OSF.IO/RF6QK, and provides vignettes for documentation and application both online and in the Supplementary Files 2 and 3. Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text

Super.FELT: supervised feature extraction learning using triplet loss for drug response prediction with multi-omics data

BMC Bioinformatics ◽

10.1186/s12859-021-04146-z ◽

2021 ◽

Vol 22 (1) ◽

Author(s):

Sejin Park ◽

Jihee Soh ◽

Hyunju Lee

Keyword(s):

Gene Expression ◽

Cell Line ◽

Drug Response ◽

Cross Validation ◽

External Validation ◽

Response Prediction ◽

Omics Data ◽

Three Stages ◽

Triplet Loss ◽

Supervised Feature Extraction

Abstract Background Predicting the drug response of a patient is important for precision oncology. In recent studies, multi-omics data have been used to improve the prediction accuracy of drug response. Although multi-omics data are good resources for drug response prediction, the large dimension of data tends to hinder performance improvement. In this study, we aimed to develop a new method, which can effectively reduce the large dimension of data, based on the supervised deep learning model for predicting drug response. Results We proposed a novel method called Supervised Feature Extraction Learning using Triplet loss (Super.FELT) for drug response prediction. Super.FELT consists of three stages, namely, feature selection, feature encoding using a supervised method, and binary classification of drug response (sensitive or resistant). We used multi-omics data including mutation, copy number aberration, and gene expression, and these were obtained from cell lines [Genomics of Drug Sensitivity in Cancer (GDSC), Cancer Cell Line Encyclopedia (CCLE), and Cancer Therapeutics Response Portal (CTRP)], patient-derived tumor xenografts (PDX), and The Cancer Genome Atlas (TCGA). GDSC was used for training and cross-validation tests, and CCLE, CTRP, PDX, and TCGA were used for external validation. We performed ablation studies for the three stages and verified that the use of multi-omics data guarantees better performance of drug response prediction. Our results verified that Super.FELT outperformed the other methods at external validation on PDX and TCGA and was good at cross-validation on GDSC and external validation on CCLE and CTRP. In addition, through our experiments, we confirmed that using multi-omics data is useful for external non-cell line data. Conclusion By separating the three stages, Super.FELT achieved better performance than the other methods. Through our results, we found that it is important to train encoders and a classifier independently, especially for external test on PDX and TCGA. Moreover, although gene expression is the most powerful data on cell line data, multi-omics promises better performance for external validation on non-cell line data than gene expression data. Source codes of Super.FELT are available at https://github.com/DMCB-GIST/Super.FELT.

Download Full-text

FORESEE: a tool for the systematic comparison of translational drug response modeling pipelines

10.7287/peerj.preprints.27256 ◽

2019 ◽

Author(s):

Lisa-Katrin Turnhoff ◽

Ali Hadizadeh Esfahani ◽

Maryam Montazeri ◽

Nina Kusch ◽

Andreas Schuppert

Keyword(s):

Drug Response ◽

Drug Efficacy ◽

Response Prediction ◽

R Package ◽

Supplementary Information ◽

Supplementary File ◽

Data Sets ◽

Training Algorithms ◽

Model Training

Translational models that utilize omics data generated in in vitro studies to predict the drug efficacy of anti-cancer compounds in patients are highly distinct, which complicates the benchmarking process for new computational approaches. In reaction to this, we introduce the uniFied translatiOnal dRug rESponsE prEdiction platform FORESEE, an open-source R-package. FORESEE not only provides a uniform data format for public cell line and patient data sets, but also establishes a standardized environment for drug response prediction pipelines, incorporating various state-of-the-art preprocessing methods, model training algorithms and validation techniques. The modular implementation of individual elements of the pipeline facilitates a straightforward development of combinatorial models, which can be used to re-evaluate and improve already existing pipelines as well as to develop new ones. Availability and Implementation: FORESEE is licensed under GNU General Public License v3.0 and available at https://github.com/JRC-COMBINE/FORESEE . Supplementary Information: Supplementary Files 1 and 2 provide detailed descriptions of the pipeline and the data preparation process, while Supplementary File 3 presents basic use cases of the package. Contact: [email protected]

Download Full-text

FORESEE: a tool for the systematic comparison of translational drug response modeling pipelines

10.7287/peerj.preprints.27256v2 ◽

2019 ◽

Author(s):

Lisa-Katrin Turnhoff ◽

Ali Hadizadeh Esfahani ◽

Maryam Montazeri ◽

Nina Kusch ◽

Andreas Schuppert

Keyword(s):

Drug Response ◽

Drug Efficacy ◽

Response Prediction ◽

R Package ◽

Supplementary Information ◽

Supplementary File ◽

Data Sets ◽

Training Algorithms ◽

Model Training

Translational models that utilize omics data generated in in vitro studies to predict the drug efficacy of anti-cancer compounds in patients are highly distinct, which complicates the benchmarking process for new computational approaches. In reaction to this, we introduce the uniFied translatiOnal dRug rESponsE prEdiction platform FORESEE, an open-source R-package. FORESEE not only provides a uniform data format for public cell line and patient data sets, but also establishes a standardized environment for drug response prediction pipelines, incorporating various state-of-the-art preprocessing methods, model training algorithms and validation techniques. The modular implementation of individual elements of the pipeline facilitates a straightforward development of combinatorial models, which can be used to re-evaluate and improve already existing pipelines as well as to develop new ones. Availability and Implementation: FORESEE is licensed under GNU General Public License v3.0 and available at https://github.com/JRC-COMBINE/FORESEE . Supplementary Information: Supplementary Files 1 and 2 provide detailed descriptions of the pipeline and the data preparation process, while Supplementary File 3 presents basic use cases of the package. Contact: [email protected]

Download Full-text

Graph Convolutional Network for Drug Response Prediction Using Gene Expression Data

Mathematics ◽

10.3390/math9070772 ◽

2021 ◽

Vol 9 (7) ◽

pp. 772

Author(s):

Seonghun Kim ◽

Seockhun Bae ◽

Yinhua Piao ◽

Kyuri Jo

Keyword(s):

Gene Expression ◽

Gene Expression Data ◽

Large Scale ◽

Drug Response ◽

Response Prediction ◽

Biological Data ◽

Expression Data ◽

Convolutional Network ◽

Essential Information ◽

Protein Protein Interaction

Genomic profiles of cancer patients such as gene expression have become a major source to predict responses to drugs in the era of personalized medicine. As large-scale drug screening data with cancer cell lines are available, a number of computational methods have been developed for drug response prediction. However, few methods incorporate both gene expression data and the biological network, which can harbor essential information about the underlying process of the drug response. We proposed an analysis framework called DrugGCN for prediction of Drug response using a Graph Convolutional Network (GCN). DrugGCN first generates a gene graph by combining a Protein-Protein Interaction (PPI) network and gene expression data with feature selection of drug-related genes, and the GCN model detects the local features such as subnetworks of genes that contribute to the drug response by localized filtering. We demonstrated the effectiveness of DrugGCN using biological data showing its high prediction accuracy among the competing methods.

Download Full-text

iMIRAGE: an R package to impute microRNA expression using protein-coding genes

Bioinformatics ◽

10.1093/bioinformatics/btz939 ◽

2019 ◽

Vol 36 (8) ◽

pp. 2608-2610

Author(s):

Aritro Nath ◽

Jeremy Chang ◽

R Stephanie Huang

Keyword(s):

Gene Expression ◽

Small Rnas ◽

Transcriptional Regulators ◽

R Package ◽

Machine Learning Algorithms ◽

Supplementary Information ◽

Expression Data ◽

Protein Coding ◽

Altered Protein ◽

Independent Test

Abstract Summary MicroRNAs (miRNAs) are critical post-transcriptional regulators of gene expression. Due to challenges in accurate profiling of small RNAs, a vast majority of public transcriptome datasets lack reliable miRNA profiles. However, the biological consequence of miRNA activity in the form of altered protein-coding gene (PCG) expression can be captured using machine-learning algorithms. Here, we present iMIRAGE (imputed miRNA activity from gene expression), a convenient tool to predict miRNA expression using PCG expression of the test datasets. The iMIRAGE package provides an integrated workflow for normalization and transformation of miRNA and PCG expression data, along with the option to utilize predicted miRNA targets to impute miRNA activity from independent test PCG datasets. Availability and implementation The iMIRAGE package for R, along with package documentation and vignette, is available at https://aritronath.github.io/iMIRAGE/index.html. Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text

Pathway-Based Drug Response Prediction Using Similarity Identification in Gene Expression

Frontiers in Genetics ◽

10.3389/fgene.2020.01016 ◽

2020 ◽

Vol 11 ◽

Author(s):

Seyed Ali Madani Tonekaboni ◽

Gangesh Beri ◽

Benjamin Haibe-Kains

Keyword(s):

Gene Expression ◽

Drug Response ◽

Response Prediction

Download Full-text

Drug response prediction by ensemble learning and drug-induced gene expression signatures

Genomics ◽

10.1016/j.ygeno.2018.07.002 ◽

2019 ◽

Vol 111 (5) ◽

pp. 1078-1088 ◽

Cited By ~ 3

Author(s):

Mehmet Tan ◽

Ozan Fırat Özgül ◽

Batuhan Bardak ◽

Işıksu Ekşioğlu ◽

Suna Sabuncuoğlu

Keyword(s):

Gene Expression ◽

Ensemble Learning ◽

Drug Response ◽

Response Prediction ◽

Drug Induced ◽

Gene Expression Signatures

Download Full-text