A Multiple-trait Bayesian Variable Selection Regression Method for Integrating Phenotypic Causal Networks in Genome-Wide Association Studies

Mapping Intimacies ◽

10.1101/847285 ◽

2019 ◽

Author(s):

Zigui Wang ◽

Deborah Chapman ◽

Gota Morota ◽

Hao Cheng

Keyword(s):

Association Studies ◽

Bayesian Variable Selection ◽

Regression Method ◽

Bayesian Regression ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Multiple Trait ◽

Genome Wide ◽

Highest Posterior Density ◽

Causal Structures

ABSTRACTBayesian regression methods that incorporate different mixture priors for marker effects are used in multi-trait genomic prediction. These methods can also be extended to genome-wide association studies (GWAS). In multiple-trait GWAS, incorporating the underlying causal structures among traits is essential for comprehensively understanding the relationship between genotypes and traits of interest. Therefore, we develop a GWAS methodology, SEM-BayesCΠ, which, by applying the structural equation model (SEM), can be used to incorporate causal structures into a multi-trait Bayesian regression method using mixture priors. The performance of SEM-BayesCΠ was demonstrated by comparing its GWAS results with those from multi-trait BayesCΠ. Through the inductive causation (IC) algorithm, three potential causal structures were inferred of 0.9 highest posterior density (HPD) interval. SEM-BayesCΠ provides a more comprehensive understanding of the genotype-phenotype mapping than multi-trait BayesCΠ by performing GWAS based on indirect, direct and overall marker effects. The software tool JWAS offers open-source routines to perform these analyses.

Download Full-text

A Multiple-Trait Bayesian Variable Selection Regression Method for Integrating Phenotypic Causal Networks in Genome-Wide Association Studies

G3 Genes|Genome|Genetics ◽

10.1534/g3.120.401618 ◽

2020 ◽

Vol 10 (12) ◽

pp. 4439-4448

Author(s):

Zigui Wang ◽

Deborah Chapman ◽

Gota Morota ◽

Hao Cheng

Keyword(s):

Association Studies ◽

Bayesian Variable Selection ◽

Bayesian Regression ◽

Genome Wide Association ◽

Superior Performance ◽

Genome Wide Association Studies ◽

Multiple Trait ◽

Regression Methods ◽

Genome Wide ◽

Causal Structures

Bayesian regression methods that incorporate different mixture priors for marker effects are used in multi-trait genomic prediction. These methods can also be extended to genome-wide association studies (GWAS). In multiple-trait GWAS, incorporating the underlying causal structures among traits is essential for comprehensively understanding the relationship between genotypes and traits of interest. Therefore, we develop a GWAS methodology, SEM-Bayesian alphabet, which, by applying the structural equation model (SEM), can be used to incorporate causal structures into multi-trait Bayesian regression methods. SEM-Bayesian alphabet provides a more comprehensive understanding of the genotype-phenotype mapping than multi-trait GWAS by performing GWAS based on indirect, direct and overall marker effects. The superior performance of SEM-Bayesian alphabet was demonstrated by comparing its GWAS results with other similar multi-trait GWAS methods on real and simulated data. The software tool JWAS offers open-source routines to perform these analyses.

Download Full-text

A Bayesian Regression Model with Variable Selection for Genome-Wide Association Studies

Case Studies in Bayesian Statistical Modelling and Analysis - Wiley Series in Probability and Statistics ◽

10.1002/9781118394472.ch6 ◽

2012 ◽

pp. 103-117

Author(s):

Carla Chen ◽

Kerrie L. Mengersen ◽

Katja Ickstadt ◽

Jonathan M. Keith

Keyword(s):

Variable Selection ◽

Regression Model ◽

Association Studies ◽

Bayesian Regression ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Genome Wide ◽

Selection For

Download Full-text

Genome-Wide Association Studies Reveal Susceptibility Loci for Digital Dermatitis in Holstein Cattle

Animals ◽

10.3390/ani10112009 ◽

2020 ◽

Vol 10 (11) ◽

pp. 2009

Author(s):

Ellen Lai ◽

Alexa L. Danner ◽

Thomas R. Famula ◽

Anita M. Oberbauer

Keyword(s):

Predictive Value ◽

Mixed Model ◽

Linear Mixed Model ◽

Bos Taurus ◽

Association Studies ◽

Bayesian Regression ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Digital Dermatitis ◽

Genome Wide

Digital dermatitis (DD) causes lameness in dairy cattle. To detect the quantitative trait loci (QTL) associated with DD, genome-wide association studies (GWAS) were performed using high-density single nucleotide polymorphism (SNP) genotypes and binary case/control, quantitative (average number of FW per hoof trimming record) and recurrent (cases with ≥2 DD episodes vs. controls) phenotypes from cows across four dairies (controls n = 129 vs. FW n = 85). Linear mixed model (LMM) and random forest (RF) approaches identified the top SNPs, which were used as predictors in Bayesian regression models to assess the SNP predictive value. The LMM and RF analyses identified QTL regions containing candidate genes on Bos taurus autosome (BTA) 2 for the binary and recurrent phenotypes and BTA7 and 20 for the quantitative phenotype that related to epidermal integrity, immune function, and wound healing. Although larger sample sizes are necessary to reaffirm these small effect loci amidst a strong environmental effect, the sample cohort used in this study was sufficient for estimating SNP effects with a high predictive value.

Download Full-text

Biological and practical implications of genome-wide association study of schizophrenia using Bayesian variable selection

npj Schizophrenia ◽

10.1038/s41537-019-0088-6 ◽

2019 ◽

Vol 5 (1) ◽

Author(s):

Benazir Rowe ◽

Xiangning Chen ◽

Zuoheng Wang ◽

Jingchun Chen ◽

Amei Amei

Keyword(s):

Variable Selection ◽

Sample Size ◽

Permutation Test ◽

Association Studies ◽

Statistical Significance ◽

Univariate Analysis ◽

Bayesian Variable Selection ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Genome Wide

AbstractGenome-wide association studies (GWAS) have identified over 100 loci associated with schizophrenia. Most of these studies test genetic variants for association one at a time. In this study, we performed GWAS of the molecular genetics of schizophrenia (MGS) dataset with 5334 subjects using multivariate Bayesian variable selection (BVS) method Posterior Inference via Model Averaging and Subset Selection (piMASS) and compared our results with the previous univariate analysis of the MGS dataset. We showed that piMASS can improve the power of detecting schizophrenia-associated SNPs, potentially leading to new discoveries from existing data without increasing the sample size. We tested SNPs in groups to allow for local additive effects and used permutation test to determine statistical significance in order to compare our results with univariate method. The previous univariate analysis of the MGS dataset revealed no genome-wide significant loci. Using the same dataset, we identified a single region that exceeded the genome-wide significance. The result was replicated using an independent Swedish Schizophrenia Case–Control Study (SSCCS) dataset. Based on the SZGR 2.0 database we found 63 SNPs from the best performing regions that are mapped to 27 genes known to be associated with schizophrenia. Overall, we demonstrated that piMASS could discover association signals that otherwise would need a much larger sample size. Our study has important implication that reanalyzing published datasets with BVS methods like piMASS might have more power to discover new risk variants for many diseases without new sample collection, ascertainment, and genotyping.

Download Full-text

Bayesian variable selection regression for genome-wide association studies and other large-scale problems

The Annals of Applied Statistics ◽

10.1214/11-aoas455 ◽

2011 ◽

Vol 5 (3) ◽

pp. 1780-1815 ◽

Cited By ~ 194

Author(s):

Yongtao Guan ◽

Matthew Stephens

Keyword(s):

Variable Selection ◽

Large Scale ◽

Association Studies ◽

Bayesian Variable Selection ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Large Scale Problems ◽

Genome Wide

Download Full-text

High-Performance Mixed Models Based Genome-Wide Association Analysis with omicABEL software

F1000Research ◽

10.12688/f1000research.4867.1 ◽

2014 ◽

Vol 3 ◽

pp. 200 ◽

Cited By ~ 11

Author(s):

Diego Fabregat-Traver ◽

Sodbo Zh. Sharapov ◽

Caroline Hayward ◽

Igor Rudan ◽

Harry Campbell ◽

...

Keyword(s):

High Performance ◽

Mixed Model ◽

Association Studies ◽

Structured Populations ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Multiple Traits ◽

Multiple Trait ◽

Model Based ◽

Genome Wide

To raise the power of genome-wide association studies (GWAS) and avoid false-positive results in structured populations, one can rely on mixed model based tests. When large samples are used, and when multiple traits are to be studied in the ’omics’ context, this approach becomes computationally challenging. Here we consider the problem of mixed-model based GWAS for arbitrary number of traits, and demonstrate that for the analysis of single-trait and multiple-trait scenarios different computational algorithms are optimal. We implement these optimal algorithms in a high-performance computing framework that uses state-of-the-art linear algebra kernels, incorporates optimizations, and avoids redundant computations,increasing throughput while reducing memory usage and energy consumption. We show that, compared to existing libraries, our algorithms and software achieve considerable speed-ups. The OmicABEL software described in this manuscript is available under the GNUGPL v. 3 license as part of the GenABEL project for statistical genomics at http: //www.genabel.org/packages/OmicABEL.

Download Full-text

Including phenotypic causal networks in genome-wide association studies using mixed effects structural equation models

10.1101/251421 ◽

2018 ◽

Cited By ~ 1

Author(s):

Mehdi Momen ◽

Ahmad Ayatollahi Mehrgardi ◽

Mahmoud Amiri Roudbar ◽

Andreas Kranis ◽

Renan Mercuri Pinto ◽

...

Keyword(s):

Structural Equation ◽

Structural Equation Models ◽

Association Studies ◽

Genome Wide Association ◽

Path Coefficient ◽

Genome Wide Association Studies ◽

Causal Relationships ◽

Multiple Phenotypes ◽

Genome Wide ◽

Highest Posterior Density

AbstractBackgroundPhenotypic networks describing putative causal relationships among multiple phenotypes can be used to infer single-nucleotide polymorphism (SNP) effects in genome-wide association studies (GWAS). In GWAS with multiple phenotypes, reconstructing underlying causal structures among traits and SNPs using a single statistical framework is essential for understanding the entirety of genotype-phenotype maps. A structural equation model (SEM) can be used for such purposes.MethodsWe applied SEM to GWAS (SEM-GWAS) in chickens, taking into account putative causal relationships among body weight (BW), breast meat (BM), hen-house production (HHP), and SNPs. We assessed the performance of SEM-GWAS by comparing the model results with those obtained from traditional multi-trait association analyses (MTM-GWAS).ResultsThree different putative causal path diagrams were inferred from highest posterior density (HPD) intervals of 0.75, 0.85, and 0.95 using the inductive causation algorithm. A positive path coefficient was estimated for BM→BW, and negative values were obtained for BM→HHP and BW→HHP in all implemented scenarios. Further, the application of SEM-GWAS enabled the decomposition of SNP effects into direct, indirect, and total effects, identifying whether a SNP effect is acting directly or indirectly on a given trait. In contrast, MTM-GWAS only captured overall genetic effects on traits, which is equivalent to combining the direct and indirect SNP effects from SEMGWAS.ConclusionsAlthough MTM-GWAS and SEM-GWAS use the same probabilistic models, we provide evidence that SEM-GWAS captures complex relationships and delivers a more comprehensive understanding of SNP effects compared to MTM-GWAS. Our results showed that SEM-GWAS provides important insight regarding the mechanism by which identified SNPs control traits by partitioning them into direct, indirect, and total SNP effects.

Download Full-text

Integrative Bayesian variable selection with gene-based informative priors for genome-wide association studies

BMC Genetics ◽

10.1186/s12863-014-0130-7 ◽

2014 ◽

Vol 15 (1) ◽

Cited By ~ 4

Author(s):

Xiaoshuai Zhang ◽

Fuzhong Xue ◽

Hong Liu ◽

Dianwen Zhu ◽

Bin Peng ◽

...

Keyword(s):

Variable Selection ◽

Association Studies ◽

Bayesian Variable Selection ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Informative Priors ◽

Genome Wide

Download Full-text

Regularized regression method for genome-wide association studies

BMC Proceedings ◽

10.1186/1753-6561-5-s9-s67 ◽

2011 ◽

Vol 5 (S9) ◽

Cited By ~ 2

Author(s):

Jin Liu ◽

Kai Wang ◽

Shuangge Ma ◽

Jian Huang

Keyword(s):

Association Studies ◽

Regression Method ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Regularized Regression ◽

Genome Wide

Download Full-text

Accounting for linkage disequilibrium in genome-wide association studies: a penalized regression method

Statistics and Its Interface ◽

10.4310/sii.2013.v6.n1.a10 ◽

2013 ◽

Vol 6 (1) ◽

pp. 99-115 ◽

Cited By ~ 10

Author(s):

Jian Huang ◽

Jin Liu ◽

Shuangge Ma ◽

Kai Wang

Keyword(s):

Linkage Disequilibrium ◽

Association Studies ◽

Penalized Regression ◽

Regression Method ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Genome Wide

Download Full-text