Vertical and horizontal integration of multi-omics data with miodin

Mapping Intimacies ◽

10.1101/431429 ◽

2018 ◽

Cited By ~ 1

Author(s):

Benjamin Ulfenborg

Keyword(s):

Data Analysis ◽

R Package ◽

Heterogeneous Data ◽

Omics Data ◽

Technical Expertise ◽

Multiple Modalities ◽

Level Data ◽

Genomics And Proteomics ◽

Health And Disease ◽

Molecular Layers

AbstractBackgroundStudies on multiple modalities of omics data such as transcriptomics, genomics and proteomics are growing in popularity, since they allow us to investigate complex mechanisms across molecular layers. It is widely recognized that integrative omics analysis holds the promise to unlock novel and actionable biological insights to health and disease. Integration of multi-omics data remains challenging, however, and requires combination of several software tools and extensive technical expertise to account for the properties of heterogeneous data.ResultsThis paper presents the miodin R package, which provides a streamlined workflow-based syntax for multi-omics data analysis. The package allows users to perform analysis and integration of omics data either across experiments on the same samples, or across studies on the same variables. Workflows have been designed to promote transparent data analysis and reduce the technical expertise required to perform low-level data import and processing.ConclusionsThe miodin package is implemented in R and is freely available for use and extension under the GPL-3 license. Package source, reference documentation and user manual are available at https://gitlab.com/algoromics/miodin.

Download Full-text

Vertical and horizontal integration of multi-omics data with miodin

BMC Bioinformatics ◽

10.1186/s12859-019-3224-4 ◽

2019 ◽

Vol 20 (1) ◽

Cited By ~ 13

Author(s):

Benjamin Ulfenborg

Keyword(s):

Data Analysis ◽

R Package ◽

Heterogeneous Data ◽

Omics Data ◽

Technical Expertise ◽

Horizontal Integration ◽

Level Data ◽

Genomics And Proteomics ◽

Health And Disease ◽

Molecular Layers

Abstract Background Studies on multiple modalities of omics data such as transcriptomics, genomics and proteomics are growing in popularity, since they allow us to investigate complex mechanisms across molecular layers. It is widely recognized that integrative omics analysis holds the promise to unlock novel and actionable biological insights into health and disease. Integration of multi-omics data remains challenging, however, and requires combination of several software tools and extensive technical expertise to account for the properties of heterogeneous data. Results This paper presents the miodin R package, which provides a streamlined workflow-based syntax for multi-omics data analysis. The package allows users to perform analysis of omics data either across experiments on the same samples (vertical integration), or across studies on the same variables (horizontal integration). Workflows have been designed to promote transparent data analysis and reduce the technical expertise required to perform low-level data import and processing. Conclusions The miodin package is implemented in R and is freely available for use and extension under the GPL-3 license. Package source, reference documentation and user manual are available at https://gitlab.com/algoromics/miodin.

Download Full-text

multiomics: A user-friendly multi-omics data harmonisation R pipeline

F1000Research ◽

10.12688/f1000research.53453.1 ◽

2021 ◽

Vol 10 ◽

pp. 538

Author(s):

Tyrone Chen ◽

Al J Abadi ◽

Kim-Anh Lê Cao ◽

Sonika Tyagi

Keyword(s):

Data Integration ◽

Case Studies ◽

R Package ◽

Heterogeneous Data ◽

General Purpose ◽

Omics Data ◽

Experimental Conditions ◽

Seamless Integration ◽

Data Object ◽

Omics Data Integration

Data from multiple omics layers of a biological system is growing in quantity, heterogeneity and dimensionality. Simultaneous multi-omics data integration is a growing field of research as it has strong potential to unlock information on previously hidden biological relationships leading to early diagnosis, prognosis and expedited treatments. Many tools for multi-omics data integration are being developed. However, these tools are often restricted to highly specific experimental designs, and types of omics data. While some general methods do exist, they require specific data formats and experimental conditions. A major limitation in the field is a lack of a single or multi-omics pipeline which can accept data in an unrefined, information-rich form pre-integration and subsequently generate output for further investigation. There is an increasing demand for a generic multi-omics pipeline to facilitate general-purpose data exploration and analysis of heterogeneous data. Therefore, we present our R multiomics pipeline as an easy to use and flexible pipeline that takes unrefined multi-omics data as input, sample information and user-specified parameters to generate a list of output plots and data tables for quality control and downstream analysis. We have demonstrated application of the pipeline on two separate COVID-19 case studies. We enabled limited checkpointing where intermediate output is staged to allow continuation after errors or interruptions in the pipeline and generate a script for reproducing the analysis to improve reproducibility. A seamless integration with the mixOmics R package is achieved, as the R data object can be loaded and manipulated with mixOmics functions. Our pipeline can be installed as an R package or from the git repository, and is accompanied by detailed documentation with walkthroughs on two case studies. The pipeline is also available as Docker and Singularity containers.

Download Full-text

PaintOmics 3: a web resource for the pathway analysis and visualization of multi-omics data

10.1101/281295 ◽

2018 ◽

Author(s):

Rafael Hernández-de-Diego ◽

Sonia Tarazona ◽

Carlos Martínez-Mira ◽

Leandro Balzano-Nogueira ◽

Pedro Furió-Tarí ◽

...

Keyword(s):

Data Analysis ◽

Pathway Analysis ◽

Feature Matching ◽

Omics Data ◽

Data Types ◽

Web Based ◽

Web Resource ◽

Analysis Workflow ◽

Pathway Diagrams ◽

Molecular Layers

ABSTRACTThe increasing availability of multi-omic platforms poses new challenges to data analysis. Joint visualization of multi-omics data is instrumental to understand interconnections across molecular layers and to fully leverage the biology discovery power offered by the multi-omics approach.We present here PaintOmics 3, a web-based resource for the integrated visualization of multiple omic data types onto KEGG pathway diagrams. PaintOmics 3 combines server-end capabilities for data analysis with the potential of modern web resources for data visualization, providing researchers with a powerful framework for interactive exploration of their multi-omics information.Unlike other visualization tools, PaintOmics 3 covers a complete pathway analysis workflow, including automatic feature name/identifier conversion, multi-layered feature matching, pathway enrichment, network analysis, interactive heatmaps, trend charts, etc. It accepts a wide variety of omic types, including transcriptomics, proteomics and metabolomics, as well as region-based approaches such as ATAC-seq or ChIP-seq data. The tool is freely available at http://bioinfo.cipf.es/paintomics/.

Download Full-text

Advancing Biopharmaceutical Process Development by System-Level Data Analysis and Integration of Omics Data

Genomics and Systems Biology of Mammalian Cell Culture ◽

10.1007/10_2010_98 ◽

2011 ◽

pp. 133-163 ◽

Cited By ~ 5

Author(s):

Jochen Schaub ◽

Christoph Clemens ◽

Hitto Kaufmann ◽

Torsten W. Schulz

Keyword(s):

Data Analysis ◽

Process Development ◽

System Level ◽

Omics Data ◽

Level Data

Download Full-text

Knowledge guided multi-level network inference

10.1101/2020.02.19.953679 ◽

2020 ◽

Author(s):

Christoph Ogris ◽

Yue Hu ◽

Janine Arloth ◽

Nikola S. Müller

Keyword(s):

Data Analysis ◽

Quantitative Trait ◽

Network Inference ◽

Integrated Analysis ◽

Full Potential ◽

Omics Data ◽

Lasso Regression ◽

Data Set ◽

Level Data ◽

Multi Level

AbstractConstantly decreasing costs of high-throughput profiling on many molecular levels generate vast amounts of so-called multi-omics data. Studying one biomedical question on two or more omic levels provides deeper insights into underlying molecular processes or disease pathophysiology. For the majority of multi-omics data projects, the data analysis is performed level-wise, followed by a combined interpretation of results. Few exceptions exist, for example the pairwise integration for quantitative trait analysis. However, the full potential of integrated data analysis is not leveraged yet, presumably due to the complexity of the data and the lacking toolsets. Here we propose a versatile approach, to perform a multi-level integrated analysis: The Knowledge guIded Multi-Omics Network inference approach, KiMONo. KiMONo performs network inference using statistical modeling on top of a powerful knowledge-guided strategy exploiting prior information from biological sources. Within the resulting network, nodes represent features of all input types and edges refer to associations between them, e.g. underlying a disease. Our method infers the network by combining sparse grouped-LASSO regression with a genomic position-confined Biogrid protein-protein interaction prior. In a comprehensive evaluation, we demonstrate that our method is robust to noise and still performs on low-sample size data. Applied to the five-level data set of the publicly available Pan-cancer collection, KiMONO integrated mutation, epigenetics, transcriptomics, proteomics and clinical information, detecting cancer specific omic features. Moreover, we analysed a four-level data set from a major depressive disorder cohort, including genetic, epigenetic, transcriptional and clinical data. Here we demonstrated KiMONo’s analytical power to identify expression quantitative trait methylation sites and loci and show it’s advantage to state-of-the-art methods. Our results show the general applicability to the full spectrum multi-omics data and demonstrating that KiMONo is a powerful approach towards leveraging the full potential of data sets. The method is freely available as an R package (https://github.com/cellmapslab/kimono).

Download Full-text

Meta-analysis integrated with multi-omics data analysis to elucidate pathogenic mechanisms of age-related knee osteoarthritis in mice

The Journals of Gerontology Series A ◽

10.1093/gerona/glab386 ◽

2022 ◽

Author(s):

Hirotaka Iijima ◽

Gabrielle Gilmer ◽

Kai Wang ◽

Sruthi Sivakumar ◽

Christopher Evans ◽

...

Keyword(s):

Data Analysis ◽

Knee Osteoarthritis ◽

Meta Analysis ◽

Integrated Approach ◽

Mass Spectrometry Data ◽

Omics Data ◽

Pathogenic Mechanisms ◽

Age Related ◽

Level Data ◽

Omics Data Analysis

Abstract Increased mechanistic insight into the pathogenesis of knee osteoarthritis (KOA) is needed to develop efficacious disease-modifying treatments. Though age-related pathogenic mechanisms are most relevant to the majority of clinically-presenting KOA, the bulk of our mechanistic understanding of KOA has been derived using surgically induced post-traumatic OA (PTOA) models. Here, we took an integrated approach of meta-analysis and multi-omics data analysis to elucidate pathogenic mechanisms of age-related KOA in mice. Protein-level data were integrated with transcriptomic profiling to reveal inflammation, autophagy, and cellular senescence as primary hallmarks of age-related KOA. Importantly, the molecular profiles of cartilage aging were unique from those observed following PTOA, with less than 3% overlap between the two models. At the nexus of the three aging hallmarks, Advanced Glycation End-Product (AGE)/Receptor for AGE emerged as the most statistically robust pathway associated with age-related KOA. This pathway was further supported by analysis of mass spectrometry data. Notably, the change in AGE-RAGE signaling over time was exclusively observed in male mice, suggesting sexual dimorphism in the pathogenesis of age-induced KOA in murine models. Collectively, these findings implicate dysregulation of AGE-RAGE signaling as a sex-dependent driver of age-related KOA.

Download Full-text

Architecture of a distributed system for processing heterogeneous data from social networks

Informatization and communication ◽

10.34219/2078-8320-2020-11-4-97-100 ◽

2020 ◽

Vol 4 ◽

pp. 97-100

Author(s):

A.P. Pronichev ◽

Keyword(s):

Social Networks ◽

Data Analysis ◽

Distributed System ◽

Heterogeneous Data ◽

Analysis Process ◽

Support Costs

The article discusses the architecture of a system for collecting and analyzing heterogeneous data from social networks. This architecture is a distributed system of subsystem modules, each of which is responsible for a separate task. The system also allows you to use external systems for data analysis, providing the necessary interface abstraction for connection. This allows for more flexible customization of the data analysis process and reduces development, implementation and support costs.

Download Full-text

Integrative Data Analysis from a Unifying Research Synthesis Perspective

10.1093/oso/9780190676001.003.0020 ◽

2018 ◽

Author(s):

Eun-Young Mun ◽

Anne E. Ray

Keyword(s):

Data Analysis ◽

Large Scale ◽

Research Synthesis ◽

Alcohol Intervention ◽

Data Set ◽

Integrative Data Analysis ◽

Level Data ◽

Model Complex ◽

Wide Range ◽

Individual Participant

Integrative data analysis (IDA) is a promising new approach in psychological research and has been well received in the field of alcohol research. This chapter provides a larger unifying research synthesis framework for IDA. Major advantages of IDA of individual participant-level data include better and more flexible ways to examine subgroups, model complex relationships, deal with methodological and clinical heterogeneity, and examine infrequently occurring behaviors. However, between-study heterogeneity in measures, designs, and samples and systematic study-level missing data are significant barriers to IDA and, more broadly, to large-scale research synthesis. Based on the authors’ experience working on the Project INTEGRATE data set, which combined individual participant-level data from 24 independent college brief alcohol intervention studies, it is also recognized that IDA investigations require a wide range of expertise and considerable resources and that some minimum standards for reporting IDA studies may be needed to improve transparency and quality of evidence.

Download Full-text

MuSA: a graphical user interface for multi-OMICs data integration in radiogenomic studies

Scientific Reports ◽

10.1038/s41598-021-81200-z ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Mario Zanfardino ◽

Rossana Castaldo ◽

Katia Pane ◽

Ornella Affinito ◽

Marco Aiello ◽

...

Keyword(s):

User Interface ◽

Data Integration ◽

Graphical User Interface ◽

Data Science ◽

Heterogeneous Data ◽

Biological Information ◽

Omics Data ◽

Correlation Clustering ◽

Downstream Analysis ◽

Omics Data Integration

AbstractAnalysis of large-scale omics data along with biomedical images has gaining a huge interest in predicting phenotypic conditions towards personalized medicine. Multiple layers of investigations such as genomics, transcriptomics and proteomics, have led to high dimensionality and heterogeneity of data. Multi-omics data integration can provide meaningful contribution to early diagnosis and an accurate estimate of prognosis and treatment in cancer. Some multi-layer data structures have been developed to integrate multi-omics biological information, but none of these has been developed and evaluated to include radiomic data. We proposed to use MultiAssayExperiment (MAE) as an integrated data structure to combine multi-omics data facilitating the exploration of heterogeneous data. We improved the usability of the MAE, developing a Multi-omics Statistical Approaches (MuSA) tool that uses a Shiny graphical user interface, able to simplify the management and the analysis of radiogenomic datasets. The capabilities of MuSA were shown using public breast cancer datasets from TCGA-TCIA databases. MuSA architecture is modular and can be divided in Pre-processing and Downstream analysis. The pre-processing section allows data filtering and normalization. The downstream analysis section contains modules for data science such as correlation, clustering (i.e., heatmap) and feature selection methods. The results are dynamically shown in MuSA. MuSA tool provides an easy-to-use way to create, manage and analyze radiogenomic data. The application is specifically designed to guide no-programmer researchers through different computational steps. Integration analysis is implemented in a modular structure, making MuSA an easily expansible open-source software.

Download Full-text

Using machine learning approaches for multi-omics data analysis: A review

Biotechnology Advances ◽

10.1016/j.biotechadv.2021.107739 ◽

2021 ◽

Vol 49 ◽

pp. 107739

Author(s):

Parminder S. Reel ◽

Smarti Reel ◽

Ewan Pearson ◽

Emanuele Trucco ◽

Emily Jefferson

Keyword(s):

Machine Learning ◽

Data Analysis ◽

Omics Data ◽

Learning Approaches ◽

Omics Data Analysis

Download Full-text