scholarly journals Protein Bioinformatics Infrastructure for the Integration and Analysis of Multiple High-Throughput “omics” Data

2010 ◽  
Vol 2010 ◽  
pp. 1-19 ◽  
Author(s):  
Chuming Chen ◽  
Peter B. McGarvey ◽  
Hongzhan Huang ◽  
Cathy H. Wu

High-throughput “omics” technologies bring new opportunities for biological and biomedical researchers to ask complex questions and gain new scientific insights. However, the voluminous, complex, and context-dependent data being maintained in heterogeneous and distributed environments plus the lack of well-defined data standard and standardized nomenclature imposes a major challenge which requires advanced computational methods and bioinformatics infrastructures for integration, mining, visualization, and comparative analysis to facilitate data-driven hypothesis generation and biological knowledge discovery. In this paper, we present the challenges in high-throughput “omics” data integration and analysis, introduce a protein-centric approach for systems integration of large and heterogeneous high-throughput “omics” data including microarray, mass spectrometry, protein sequence, protein structure, and protein interaction data, and use scientific case study to illustrate how one can use varied “omics” data from different laboratories to make useful connections that could lead to new biological knowledge.

2017 ◽  
Vol 2017 ◽  
pp. 1-10 ◽  
Author(s):  
Kalpana Raja ◽  
Matthew Patrick ◽  
Yilin Gao ◽  
Desmond Madu ◽  
Yuyang Yang ◽  
...  

In the past decade, the volume of “omics” data generated by the different high-throughput technologies has expanded exponentially. The managing, storing, and analyzing of this big data have been a great challenge for the researchers, especially when moving towards the goal of generating testable data-driven hypotheses, which has been the promise of the high-throughput experimental techniques. Different bioinformatics approaches have been developed to streamline the downstream analyzes by providing independent information to interpret and provide biological inference. Text mining (also known as literature mining) is one of the commonly used approaches for automated generation of biological knowledge from the huge number of published articles. In this review paper, we discuss the recent advancement in approaches that integrate results from omics data and information generated from text mining approaches to uncover novel biomedical information.


2021 ◽  
Vol 17 (8) ◽  
pp. e1009224
Author(s):  
Ran Duan ◽  
Lin Gao ◽  
Yong Gao ◽  
Yuxuan Hu ◽  
Han Xu ◽  
...  

Computational integrative analysis has become a significant approach in the data-driven exploration of biological problems. Many integration methods for cancer subtyping have been proposed, but evaluating these methods has become a complicated problem due to the lack of gold standards. Moreover, questions of practical importance remain to be addressed regarding the impact of selecting appropriate data types and combinations on the performance of integrative studies. Here, we constructed three classes of benchmarking datasets of nine cancers in TCGA by considering all the eleven combinations of four multi-omics data types. Using these datasets, we conducted a comprehensive evaluation of ten representative integration methods for cancer subtyping in terms of accuracy measured by combining both clustering accuracy and clinical significance, robustness, and computational efficiency. We subsequently investigated the influence of different omics data on cancer subtyping and the effectiveness of their combinations. Refuting the widely held intuition that incorporating more types of omics data always produces better results, our analyses showed that there are situations where integrating more omics data negatively impacts the performance of integration methods. Our analyses also suggested several effective combinations for most cancers under our studies, which may be of particular interest to researchers in omics data analysis.


Author(s):  
August John ◽  
Bo Qin ◽  
Krishna R Kalari ◽  
Liewei Wang ◽  
Jia Yu

The rapid advancement of high-throughput technologies and sharp decrease in cost have opened up the possibility to generate large amount of multi-omics data on an individual basis. The development of high-throughput -omics, including genomics, epigenomics, transcriptomics, proteomics, metabolomics and microbiomics, enables the application of multi-omics technologies in the clinical settings. Combination therapy, defined as disease treatment with two or more drugs to achieve efficacy with lower doses or lower drug toxicity, is the basis for the care of diseases like cancer. Patient-specific multi-omics data integration can help the identification and development of combination therapies. In this review, we provide an overview of different -omics platforms, and discuss the methods for multi-omics, high-throughput, data integration, personalized combination therapy.


2007 ◽  
Vol 8 (10) ◽  
pp. 112 ◽  
Author(s):  
Robert Gentleman ◽  
Wolfgang Huber

F1000Research ◽  
2015 ◽  
Vol 4 ◽  
pp. 1522
Author(s):  
Angela U. Makolo ◽  
Temitayo A. Olagunju

The knowledge of signaling pathways is central to understanding the biological mechanisms of organisms since it has been identified that in eukaryotic organisms, the number of signaling pathways determines the number of ways the organism will react to external stimuli. Signaling pathways are studied using protein interaction networks constructed from protein-protein interaction data obtained from high-throughput experiments. However, these high-throughput methods are known to produce very high rates of false positive and negative interactions. To construct a useful protein interaction network from this noisy data, computational methods are applied to validate the protein-protein interactions. In this study, a computational technique to identify signaling pathways from a protein interaction network constructed using validated protein-protein interaction data was designed.A weighted interaction graph of Saccharomyces Cerevisiae was constructed. The weights were obtained using a Bayesian probabilistic network to estimate the posterior probability of interaction between two proteins given the gene expression measurement as biological evidence. Only interactions above a threshold were accepted for the network model.We were able to identify some pathway segments, one of which is a segment of the pathway that signals the start of the process of meiosis in S. Cerevisiae.


2017 ◽  
Author(s):  
Genevieve L. Stein-O’Brien ◽  
Raman Arora ◽  
Aedin C. Culhane ◽  
Alexander V. Favorov ◽  
Lana X. Garmire ◽  
...  

AbstractOmics data contains signal from the molecular, physical, and kinetic inter- and intra-cellular interactions that control biological systems. Matrix factorization techniques can reveal low-dimensional structure from high-dimensional data that reflect these interactions. These techniques can uncover new biological knowledge from diverse high-throughput omics data in topics ranging from pathway discovery to time course analysis. We review exemplary applications of matrix factorization for systems-level analyses. We discuss appropriate application of these methods, their limitations, and focus on analysis of results to facilitate optimal biological interpretation. The inference of biologically relevant features with matrix factorization enables discovery from high-throughput data beyond the limits of current biological knowledge—answering questions from high-dimensional data that we have not yet thought to ask.


2004 ◽  
Vol 1 (1) ◽  
pp. 111-121 ◽  
Author(s):  
Bjorn Titz ◽  
Matthias Schlesner ◽  
Peter Uetz

Sign in / Sign up

Export Citation Format

Share Document