Reconstruction of Large-Scale Gene Regulatory Networks Using Bayesian Model Averaging

Author(s):  
Haseong Kim ◽  
Erol Gelenbe
2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Luis F. Iglesias-Martinez ◽  
Barbara De Kegel ◽  
Walter Kolch

AbstractReconstructing gene regulatory networks is crucial to understand biological processes and holds potential for developing personalized treatment. Yet, it is still an open problem as state-of-the-art algorithms are often not able to process large amounts of data within reasonable time. Furthermore, many of the existing methods predict numerous false positives and have limited capabilities to integrate other sources of information, such as previously known interactions. Here we introduce KBoost, an algorithm that uses kernel PCA regression, boosting and Bayesian model averaging for fast and accurate reconstruction of gene regulatory networks. We have benchmarked KBoost against other high performing algorithms using three different datasets. The results show that our method compares favorably to other methods across datasets. We have also applied KBoost to a large cohort of close to 2000 breast cancer patients and 24,000 genes in less than 2 h on standard hardware. Our results show that molecularly defined breast cancer subtypes also feature differences in their GRNs. An implementation of KBoost in the form of an R package is available at: https://github.com/Luisiglm/KBoost and as a Bioconductor software package.


2021 ◽  
Author(s):  
Luis F. Iglesias-Martinez ◽  
Barbara De Kegel ◽  
Walter Kolch

AbstractReconstructing gene regulatory networks is crucial to understand biological processes and holds potential for developing personalized treatment. Yet, it is still an open problem as state-of-the-art algorithms are often not able to process large amounts of data within reasonable time. Furthermore, many of the existing methods predict numerous false positives and have limited capabilities to integrate other sources of information, such as previously known interactions. Here we introduce KBoost, an algorithm that uses kernel PCA regression, boosting and Bayesian model averaging for fast and accurate reconstruction of gene regulatory networks. We have benchmarked KBoost against other high performing algorithms using three different datasets. The results show that our method compares favorably to other methods across datasets. We have also applied KBoost to a large cohort of close to 2000 breast cancer patients and 24000 genes in less than 2 hours on standard hardware. Our results show that molecularly defined breast cancer subtypes also feature differences in their GRNs. An implementation of KBoost in the form of an R package is available at: https://github.com/Luisiglm/KBoost.


2014 ◽  
Vol 10 (8) ◽  
pp. 2023-2030 ◽  
Author(s):  
Xun Huang ◽  
Zhike Zi

A new method that uses Bayesian model averaging for linear regression to infer molecular interactions in biological systems with high prediction accuracy and high computational efficiency.


Cell Reports ◽  
2014 ◽  
Vol 9 (6) ◽  
pp. 2290-2303 ◽  
Author(s):  
Delphine Potier ◽  
Kristofer Davie ◽  
Gert Hulselmans ◽  
Marina Naval Sanchez ◽  
Lotte Haagen ◽  
...  

2018 ◽  
Vol 42 (4) ◽  
pp. 423-457 ◽  
Author(s):  
David Kaplan ◽  
Chansoon Lee

This article provides a review of Bayesian model averaging as a means of optimizing the predictive performance of common statistical models applied to large-scale educational assessments. The Bayesian framework recognizes that in addition to parameter uncertainty, there is uncertainty in the choice of models themselves. A Bayesian approach to addressing the problem of model uncertainty is the method of Bayesian model averaging. Bayesian model averaging searches the space of possible models for a set of submodels that satisfy certain scientific principles and then averages the coefficients across these submodels weighted by each model’s posterior model probability (PMP). Using the weighted coefficients for prediction has been shown to yield optimal predictive performance according to certain scoring rules. We demonstrate the utility of Bayesian model averaging for prediction in education research with three examples: Bayesian regression analysis, Bayesian logistic regression, and a recently developed approach for Bayesian structural equation modeling. In each case, the model-averaged estimates are shown to yield better prediction of the outcome of interest than any submodel based on predictive coverage and the log-score rule. Implications for the design of large-scale assessments when the goal is optimal prediction in a policy context are discussed.


Sign in / Sign up

Export Citation Format

Share Document