CCA based multi-view feature selection for multi-omics data integration

Mapping Intimacies ◽

10.1101/243733 ◽

2018 ◽

Author(s):

Yasser El-Manzalawy

Keyword(s):

Feature Selection ◽

Feature Fusion ◽

Individualized Medicine ◽

Genomic Medicine ◽

Renal Clear Cell Carcinoma ◽

Omics Data ◽

Single View ◽

Technological Advances ◽

Using Data ◽

Omics Data Integration

AbstractRecent technological advances in high-throughput omics technologies and their applications in genomic medicine have opened up outstanding opportunities for individualized medicine. However, several challenges arise in the integrative analysis of such data including heterogeneity and high dimensionality of the omics data. In this study, we present a novel multi-view feature selection algorithm based on the well-known canonical correlation analysis (CCA) statistical method for jointly selecting discriminative features from multi-omics data sources (multi-views). Our results demonstrate that models for predicting kidney renal clear cell carcinoma (KIRC) survival using our proposed method for jointly selecting discriminative features from copy number alteration (CNA), gene expression RNA-Seq, and reverse-phase protein arrays (RPPA) views outperform models trained using single-view data as well as three integrated models developed using data fusion approaches including CCA-based feature fusion.

Download Full-text

CCA based multi-view feature selection for multi-omics data integration

2018 IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB) ◽

10.1109/cibcb.2018.8404968 ◽

2018 ◽

Cited By ~ 2

Author(s):

Yasser El-Manzalawy

Keyword(s):

Feature Selection ◽

Data Integration ◽

Omics Data ◽

Selection For ◽

Omics Data Integration

Download Full-text

0311 - Effect of ammonia on the dynamics of anaerobic digestion microbiome: omics data integration in a time-course context

10.26226/morressier.5b5199beb1b87b000ecee94b ◽

2018 ◽

Author(s):

Olivier Chapleur

Keyword(s):

Anaerobic Digestion ◽

Data Integration ◽

Time Course ◽

Omics Data ◽

Omics Data Integration

Download Full-text

Multi-omics data integration reveals correlated regulatory features of triple negative breast cancer

Molecular Omics ◽

10.1039/d1mo00117e ◽

2021 ◽

Author(s):

Kevin Chappell ◽

Kanishka Manna ◽

Charity L. Washam ◽

Stefan Graw ◽

Duah Alkam ◽

...

Keyword(s):

Breast Cancer ◽

Data Integration ◽

Triple Negative Breast Cancer ◽

Triple Negative ◽

Biological Pathways ◽

Omics Data ◽

Insight Into ◽

Omics Data Integration

Multi-omics data integration of triple negative breast cancer (TNBC) provides insight into biological pathways.

Download Full-text

A Detailed Catalogue of Multi-Omics Methodologies for Identification of Putative Biomarkers and Causal Molecular Networks in Translational Cancer Research

International Journal of Molecular Sciences ◽

10.3390/ijms22062822 ◽

2021 ◽

Vol 22 (6) ◽

pp. 2822

Author(s):

Efstathios Iason Vlachavas ◽

Jonas Bohn ◽

Frank Ückert ◽

Sylvia Nürnberg

Keyword(s):

Cancer Research ◽

Clinical Information ◽

Disease Diagnosis ◽

Molecular Data ◽

Molecular Networks ◽

Biological Knowledge ◽

Omics Data ◽

Translational Cancer Research ◽

Using Data ◽

Biological Entities

Recent advances in sequencing and biotechnological methodologies have led to the generation of large volumes of molecular data of different omics layers, such as genomics, transcriptomics, proteomics and metabolomics. Integration of these data with clinical information provides new opportunities to discover how perturbations in biological processes lead to disease. Using data-driven approaches for the integration and interpretation of multi-omics data could stably identify links between structural and functional information and propose causal molecular networks with potential impact on cancer pathophysiology. This knowledge can then be used to improve disease diagnosis, prognosis, prevention, and therapy. This review will summarize and categorize the most current computational methodologies and tools for integration of distinct molecular layers in the context of translational cancer research and personalized therapy. Additionally, the bioinformatics tools Multi-Omics Factor Analysis (MOFA) and netDX will be tested using omics data from public cancer resources, to assess their overall robustness, provide reproducible workflows for gaining biological knowledge from multi-omics data, and to comprehensively understand the significantly perturbed biological entities in distinct cancer types. We show that the performed supervised and unsupervised analyses result in meaningful and novel findings.

Download Full-text

MuSA: a graphical user interface for multi-OMICs data integration in radiogenomic studies

Scientific Reports ◽

10.1038/s41598-021-81200-z ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Mario Zanfardino ◽

Rossana Castaldo ◽

Katia Pane ◽

Ornella Affinito ◽

Marco Aiello ◽

...

Keyword(s):

User Interface ◽

Data Integration ◽

Graphical User Interface ◽

Data Science ◽

Heterogeneous Data ◽

Biological Information ◽

Omics Data ◽

Correlation Clustering ◽

Downstream Analysis ◽

Omics Data Integration

AbstractAnalysis of large-scale omics data along with biomedical images has gaining a huge interest in predicting phenotypic conditions towards personalized medicine. Multiple layers of investigations such as genomics, transcriptomics and proteomics, have led to high dimensionality and heterogeneity of data. Multi-omics data integration can provide meaningful contribution to early diagnosis and an accurate estimate of prognosis and treatment in cancer. Some multi-layer data structures have been developed to integrate multi-omics biological information, but none of these has been developed and evaluated to include radiomic data. We proposed to use MultiAssayExperiment (MAE) as an integrated data structure to combine multi-omics data facilitating the exploration of heterogeneous data. We improved the usability of the MAE, developing a Multi-omics Statistical Approaches (MuSA) tool that uses a Shiny graphical user interface, able to simplify the management and the analysis of radiogenomic datasets. The capabilities of MuSA were shown using public breast cancer datasets from TCGA-TCIA databases. MuSA architecture is modular and can be divided in Pre-processing and Downstream analysis. The pre-processing section allows data filtering and normalization. The downstream analysis section contains modules for data science such as correlation, clustering (i.e., heatmap) and feature selection methods. The results are dynamically shown in MuSA. MuSA tool provides an easy-to-use way to create, manage and analyze radiogenomic data. The application is specifically designed to guide no-programmer researchers through different computational steps. Integration analysis is implemented in a modular structure, making MuSA an easily expansible open-source software.

Download Full-text