Fast and Accurate Genome-Wide Association Test of Multiple Quantitative Traits

Computational and Mathematical Methods in Medicine ◽

10.1155/2018/2564531 ◽

2018 ◽

Vol 2018 ◽

pp. 1-9 ◽

Cited By ~ 1

Author(s):

Baolin Wu ◽

James S. Pankow

Keyword(s):

Quantitative Traits ◽

Association Studies ◽

R Package ◽

Genome Wide Association ◽

Type I ◽

Genome Wide Association Studies ◽

Competitive Performance ◽

Multiple Traits ◽

Atherosclerosis Risk In Communities ◽

Genome Wide

Multiple correlated traits are often collected in genetic studies. By jointly analyzing multiple traits, we can increase power by aggregating multiple weak effects and reveal additional insights into the genetic architecture of complex human diseases. In this article, we propose a multivariate linear regression-based method to test the joint association of multiple quantitative traits. It is flexible to accommodate any covariates, has very accurate control of type I errors, and offers very competitive performance. We also discuss fast and accurate significance p value computation especially for genome-wide association studies with small-to-medium sample sizes. We demonstrate through extensive numerical studies that the proposed method has competitive performance. Its usefulness is further illustrated with application to genome-wide association analysis of diabetes-related traits in the Atherosclerosis Risk in Communities (ARIC) study. We found some very interesting associations with diabetes traits which have not been reported before. We implemented the proposed methods in a publicly available R package.

Download Full-text

bGWAS: an R package to perform Bayesian genome wide association studies

Bioinformatics ◽

10.1093/bioinformatics/btaa549 ◽

2020 ◽

Vol 36 (15) ◽

pp. 4374-4376

Author(s):

Ninon Mounier ◽

Zoltán Kutalik

Keyword(s):

Mendelian Randomization ◽

Causal Effect ◽

Association Studies ◽

R Package ◽

Genome Wide Association ◽

Supplementary Information ◽

Genome Wide Association Studies ◽

Biological Mechanisms ◽

Genome Wide ◽

Related Risk

Abstract Summary Increasing sample size is not the only strategy to improve discovery in Genome Wide Association Studies (GWASs) and we propose here an approach that leverages published studies of related traits to improve inference. Our Bayesian GWAS method derives informative prior effects by leveraging GWASs of related risk factors and their causal effect estimates on the focal trait using multivariable Mendelian randomization. These prior effects are combined with the observed effects to yield Bayes Factors, posterior and direct effects. The approach not only increases power, but also has the potential to dissect direct and indirect biological mechanisms. Availability and implementation bGWAS package is freely available under a GPL-2 License, and can be accessed, alongside with user guides and tutorials, from https://github.com/n-mounier/bGWAS. Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text

The effect of different sets of critical values on type I error rates in tiled regression for genome-wide association studies

International Journal of Data Mining and Bioinformatics ◽

10.1504/ijdmb.2016.080030 ◽

2016 ◽

Vol 16 (2) ◽

pp. 111

Author(s):

Heejong Sung ◽

Jeremy A. Sabourin ◽

Alexa J.M. Sorant ◽

Alexander F. Wilson

Keyword(s):

Type I Error ◽

Association Studies ◽

Error Rates ◽

Critical Values ◽

Genome Wide Association ◽

Type I ◽

Genome Wide Association Studies ◽

Type I Error Rates ◽

Genome Wide

Download Full-text

Parallel and serial computing tools for testing single-locus and epistatic SNP effects of quantitative traits in genome-wide association studies

BMC Bioinformatics ◽

10.1186/1471-2105-9-315 ◽

2008 ◽

Vol 9 (1) ◽

pp. 315 ◽

Cited By ~ 65

Author(s):

Li Ma ◽

H Birali Runesha ◽

Daniel Dvorkin ◽

John R Garbe ◽

Yang Da

Keyword(s):

Quantitative Traits ◽

Association Studies ◽

Single Locus ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Genome Wide

Download Full-text

Finding associated variants in genome-wide association studies on multiple traits

Bioinformatics ◽

10.1093/bioinformatics/bty249 ◽

2018 ◽

Vol 34 (13) ◽

pp. i467-i474 ◽

Cited By ~ 4

Author(s):

Lisa Gai ◽

Eleazar Eskin

Keyword(s):

Association Studies ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Multiple Traits ◽

Genome Wide

Download Full-text

GWAPower: a statistical power calculation software for genome-wide association studies with quantitative traits

BMC Genetics ◽

10.1186/1471-2156-12-12 ◽

2011 ◽

Vol 12 (1) ◽

pp. 12 ◽

Cited By ~ 41

Author(s):

Sheng Feng ◽

Shengchu Wang ◽

Chia-Cheng Chen ◽

Lan Lan

Keyword(s):

Statistical Power ◽

Quantitative Traits ◽

Association Studies ◽

Genome Wide Association ◽

Power Calculation ◽

Genome Wide Association Studies ◽

Genome Wide ◽

Statistical Power Calculation ◽

Calculation Software

Download Full-text

What can genome-wide association studies tell us about the evolutionary forces maintaining genetic variation for quantitative traits?

New Phytologist ◽

10.1111/nph.14410 ◽

2017 ◽

Vol 214 (1) ◽

pp. 21-33 ◽

Cited By ~ 38

Author(s):

Emily B. Josephs ◽

John R. Stinchcombe ◽

Stephen I. Wright

Keyword(s):

Genetic Variation ◽

Quantitative Traits ◽

Association Studies ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Evolutionary Forces ◽

Genome Wide

Download Full-text

SumVg: Total heritability explained by all variants in genome-wide association studies based on summary statistics with standard error estimates

10.1101/016857 ◽

2015 ◽

Author(s):

Hon-Cheong SO ◽

Pak C. SHAM

Keyword(s):

Error Estimates ◽

Standard Error ◽

Association Studies ◽

Parametric Bootstrap ◽

R Package ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Summary Statistics ◽

Genome Wide ◽

Key Questions

Genome-wide association studies (GWAS) have become increasingly popular these days and one of the key questions is how much heritability could be explained by all variants in GWAS. We have previously proposed an approach to answer this question, based on recovering the "true" z-statistics from a set of observed z-statistics. Only summary statistics are required. However, methods for standard error (SE) estimation are not available yet, thereby limiting the interpretation of the results. In this study we developed resampling-based approaches to estimate the SE and the methods are implemented in an R package. We found that delete-d-jackknife and parametric bootstrap approaches provide good estimates of the SE. Methods to compute the sum of heritability explained and the corresponding SE are implemented in the R package SumVg, available at https://sites.google.com/site/honcheongso/software/var-totalvg

Download Full-text

CVRMS: Cross-validated Rank-based Marker Selection for Genome-wide Prediction of Low Heritability

10.1101/756130 ◽

2019 ◽

Author(s):

Seongmun Jeong ◽

Jae-Yoon Kim ◽

Namshin Kim

Keyword(s):

Genetic Information ◽

Association Studies ◽

R Package ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Marker Selection ◽

Genome Wide ◽

Precise Prediction ◽

Human Animal ◽

Selection For

AbstractCVRMS is an R package designed to extract marker subsets from repeated rank-based marker datasets generated from genome-wide association studies or marker effects for genome-wide prediction (https://github.com/lovemun/CVRMS). CVRMS provides an optimized genome-wide biomarker set with the best predictability of phenotype by implemented ridge regression using genetic information. Applying our method to human, animal, and plant datasets with wide heritability (zero to one), we selected hundreds to thousands of biomarkers for precise prediction.

Download Full-text

Bayesian multivariate reanalysis of large genetic studies identifies many new associations

10.1101/638882 ◽

2019 ◽

Author(s):

Michael C. Turchin ◽

Matthew Stephens

Keyword(s):

Genetic Variation ◽

Multivariate Analyses ◽

Association Studies ◽

Plasma Lipid ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Multiple Traits ◽

Association Analyses ◽

New Associations ◽

Genome Wide

AbstractGenome-wide association studies (GWAS) have now been conducted for hundreds of phenotypes of relevance to human health. Many such GWAS involve multiple closely-related phenotypes collected on the same samples. However, the vast majority of these GWAS have been analyzed using simple univariate analyses, which consider one phenotype at a time. This is de-spite the fact that, at least in simulation experiments, multivariate analyses have been shown to be more powerful at detecting associations. Here, we conduct multivariate association analyses on 13 different publicly-available GWAS datasets that involve multiple closely-related phenotypes. These data include large studies of anthropometric traits (GIANT), plasma lipid traits (GlobalLipids), and red blood cell traits (HaemgenRBC). Our analyses identify many new associations (433 in total across the 13 studies), many of which replicate when follow-up samples are available. Overall, our results demonstrate that multivariate analyses can help make more effective use of data from both existing and future GWAS.1Author SummaryGenome-wide association studies (GWAS) have become a common and powerful tool for identifying significant correlations between markers of genetic variation and physical traits of interest. Often these studies are conducted by comparing genetic variation against single traits one at a time (‘univariate’); however, it has previously been shown that it is possible to increase your power to detect significant associations by comparing genetic variation against multiple traits simultaneously (‘multivariate’). Despite this apparent increase in power though, researchers still rarely conduct multivariate GWAS, even when studies have multiple traits readily available. Here, we reanalyze 13 previously published GWAS using a multivariate method and find >400 additional associations. Our method makes use of univariate GWAS summary statistics and is available as a software package, thus making it accessible to other researchers interested in conducting the same analyses. We also show, using studies that have multiple releases, that our new associations have high rates of replication. Overall, we argue multivariate approaches in GWAS should no longer be overlooked and how, often, there is low-hanging fruit in the form of new associations by running these methods on data already collected.

Download Full-text

Pleiotropic Mapping and Annotation Selection in Genome-wide Association Studies with Penalized Gaussian Mixture Models

10.1101/256461 ◽

2018 ◽

Author(s):

Ping Zeng ◽

Xinjie Hao ◽

Xiang Zhou

Keyword(s):

Association Mapping ◽

Complex Traits ◽

Association Studies ◽

Penalized Regression ◽

Genome Wide Association ◽

Accurate Estimation ◽

Genome Wide Association Studies ◽

Multiple Traits ◽

Snp Association ◽

Genome Wide

AbstractMotivationGenome-wide association studies (GWASs) have identified many genetic loci associated with complex traits. A substantial fraction of these identified loci are associated with multiple traits – a phenomena known as pleiotropy. Identification of pleiotropic associations can help characterize the genetic relationship among complex traits and can facilitate our understanding of disease etiology. Effective pleiotropic association mapping requires the development of statistical methods that can jointly model multiple traits with genome-wide SNPs together.ResultsWe develop a joint modeling method, which we refer to as the integrative MApping of Pleiotropic association (iMAP). iMAP models summary statistics from GWASs, uses a multivariate Gaussian distribution to account for phenotypic correlation, simultaneously infers genome-wide SNP association pattern using mixture modeling, and has the potential to reveal causal relationship between traits. Importantly, iMAP integrates a large number of SNP functional annotations to substantially improve association mapping power, and, with a sparsity-inducing penalty, is capable of selecting informative annotations from a large, potentially noninformative set. To enable scalable inference of iMAP to association studies with hundreds of thousands of individuals and millions of SNPs, we develop an efficient expectation maximization algorithm based on an approximate penalized regression algorithm. With simulations and comparisons to existing methods, we illustrate the benefits of iMAP both in terms of high association mapping power and in terms of accurate estimation of genome-wide SNP association patterns. Finally, we apply iMAP to perform a joint analysis of 48 traits from 31 GWAS consortia together with 40 tissue-specific SNP annotations generated from the Roadmap Project. iMAP is freely available at www.xzlab.org/software.html.

Download Full-text