A power comparison of the association tests for genome-wide association studies

AbstractGenome-wide association studies (GWASs) have identified many disease associated loci, the majority of which have unknown biological functions. Understanding the mechanism underlying trait associations requires identifying trait-relevant tissues and investigating associations in a trait-specific fashion. Here, we extend the widely used linear mixed model to incorporate multiple SNP functional annotations from omics studies with GWAS summary statistics to facilitate the identification of trait-relevant tissues, with which to further construct powerful association tests. Specifically, we rely on a generalized estimating equation based algorithm for parameter inference, a mixture modeling framework for trait-tissue relevance classification, and a weighted sequence kernel association test constructed based on the identified trait-relevant tissues for powerful association analysis. We refer to our analytic procedure as the Scalable Multiple Annotation integration for trait-Relevant Tissue identification and usage (SMART). With extensive simulations, we show how our method can make use of multiple complementary annotations to improve the accuracy for identifying trait-relevant tissues. In addition, our procedure allows us to make use of the inferred trait-relevant tissues, for the first time, to construct more powerful SNP set tests. We apply our method for an in-depth analysis of 43 traits from 28 GWASs using tissue-specific annotations in 105 tissues derived from ENCODE and Roadmap. Our results reveal new trait-tissue relevance, pinpoint important annotations that are informative of trait-tissue relationship, and illustrate how we can use the inferred trait-relevant tissues to construct more powerful association tests in the Wellcome trust case control consortium study.Author SummaryIdentifying trait-relevant tissues is an important step towards understanding disease etiology. Computational methods have been recently developed to integrate SNP functional annotations generated from omics studies to genome-wide association studies (GWASs) to infer trait-relevant tissues. However, two important questions remain to be answered. First, with the increasing number and types of functional annotations nowadays, how do we integrate multiple annotations jointly into GWASs in a trait-specific fashion to take advantage of the complementary information contained in these annotations to optimize the performance of trait-relevant tissue inference? Second, what to do with the inferred trait-relevant tissues? Here, we develop a new statistical method and software to make progress on both fronts. For the first question, we extend the commonly used linear mixed model, with new algorithms and inference strategies, to incorporate multiple annotations in a trait-specific fashion to improve trait-relevant tissue inference accuracy. For the second question, we rely on the close relationship between our proposed method and the widely-used sequence kernel association test, and use the inferred trait-relevant tissues, for the first time, to construct more powerful association tests. We illustrate the benefits of our method through extensive simulations and applications to a wide range of real data sets.

Download Full-text

Operating Characteristics of the Rank-Based Inverse Normal Transformation for Quantitative Trait Analysis in Genome-Wide Association Studies

10.1101/635706 ◽

2019 ◽

Cited By ~ 2

Author(s):

Zachary R. McCaw ◽

Jacqueline M. Lane ◽

Richa Saxena ◽

Susan Redline ◽

Xihong Lin

Keyword(s):

Type I Error ◽

Association Studies ◽

Genome Wide Association ◽

Type I ◽

Genome Wide Association Studies ◽

Operating Characteristics ◽

Association Tests ◽

Genome Wide ◽

Normal Transformation ◽

Normally Distributed

SummaryQuantitative traits analyzed in Genome-Wide Association Studies (GWAS) are often non-normally distributed. For such traits, association tests based on standard linear regression are subject to reduced power and inflated type I error in finite samples. Applying the rank-based Inverse Normal Transformation (INT) to non-normally distributed traits has become common practice in GWAS. However, the different variations on INT-based association testing have not been formally defined, and guidance is lacking on when to use which approach. In this paper, we formally define and systematically compare the direct (D-INT) and indirect (I-INT) INT-based association tests. We discuss their assumptions, underlying generative models, and connections. We demonstrate that the relative powers of D-INT and I-INT depend on the underlying data generating process. Since neither approach is uniformly most powerful, we combine them into an adaptive omnibus test (O-INT). O-INT is robust to model misspecification, protects the type I error, and is well powered against a wide range of non-normally distributed traits. Extensive simulations were conducted to examine the finite sample operating characteristics of these tests. Our results demonstrate that, for non-normally distributed traits, INT-based tests outperform the standard untransformed association test (UAT), both in terms of power and type I error rate control. We apply the proposed methods to GWAS of spirometry traits in the UK Biobank. O-INT has been implemented in the R package RNOmni, which is available on CRAN.

Download Full-text

Robust Association Tests for the Replication of Genome-Wide Association Studies

BioMed Research International ◽

10.1155/2015/461593 ◽

2015 ◽

Vol 2015 ◽

pp. 1-10

Author(s):

Jungnam Joo ◽

Ju-Hyun Park ◽

Bora Lee ◽

Boram Park ◽

Sohee Kim ◽

...

Keyword(s):

Genome Wide Association Study ◽

Genetic Model ◽

Association Studies ◽

Statistical Significance ◽

Genetic Models ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Association Tests ◽

Genome Wide ◽

Wide Range

In genome-wide association study (GWAS), robust genetic association tests such as maximum of three CATTs (MAX3), each corresponding to recessive, additive, and dominant genetic models, the minimumpvalue of Pearson’s Chi-square test with 2 degrees of freedom, and CATT based on additive genetic model (MIN2), genetic model selection (GMS), and genetic model exclusion (GME) methods have been shown to provide better power performance under wide range of underlying genetic models. In this paper, we demonstrate how these robust tests can be applied to the replication study of GWAS and how the overall statistical significance can be evaluated using the combined test formed bypvalues of the discovery and replication studies.

Download Full-text

Association tests through combining -values for case control genome-wide association studies

Statistics & Probability Letters ◽

10.1016/j.spl.2013.04.021 ◽

2013 ◽

Vol 83 (8) ◽

pp. 1854-1862 ◽

Cited By ~ 14

Author(s):

Zhongxue Chen

Keyword(s):

Association Studies ◽

Case Control ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Association Tests ◽

Genome Wide

Download Full-text

Two-stage association tests for genome-wide association studies based on family data with arbitrary family structure

European Journal of Human Genetics ◽

10.1038/sj.ejhg.5201902 ◽

2007 ◽

Vol 15 (11) ◽

pp. 1169-1175 ◽

Cited By ~ 11

Author(s):

Tao Feng ◽

Shuanglin Zhang ◽

Qiuying Sha

Keyword(s):

Family Structure ◽

Association Studies ◽

Family Data ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Two Stage ◽

Association Tests ◽

Genome Wide

Download Full-text

Network-based metabolite ratios for an improved functional characterization of genome-wide association study results

10.1101/048512 ◽

2016 ◽

Cited By ~ 2

Author(s):

Jan Krumsiek ◽

Ferdinand Stückler ◽

Karsten Suhre ◽

Christian Gieger ◽

Tim D. Spector ◽

...

Keyword(s):

Metabolic Network ◽

Quantitative Traits ◽

Association Studies ◽

Small Sample ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Metabolomics Data ◽

Association Tests ◽

Genome Wide ◽

Study Results

AbstractGenome-wide association studies (GWAS) with metabolite ratios as quantitative traits have successfully deepened our understanding of the complex relationship between genetic variants and metabolic phenotypes. Usually all ratio combinations are selected for association tests. However, with more metabolites being detectable, the quadratic increase of the ratio number becomes challenging from a statistical, computational and interpretational point-of-view. Therefore methods which select biologically meaningful ratios are required.We here present a network-based approach by selecting only closely connected metabolites in a given metabolic network. The feasibility of this approach was tested on in silico data derived from simulated reaction networks. Especially for small effect sizes, network-based metabolite ratios (NBRs) improved the metabolite-based prediction accuracy of genetically-influenced reactions compared to the ‘all ratios’ approach. Evaluating the NBR approach on published GWAS association results, we compared reported ‘all ratio’-SNP hits with results obtained by selecting only NBRs as candidates for association tests. Input networks for NBR selection were derived from public pathway databases or reconstructed from metabolomics data. NBR-candidates covered more than 80% of all significant ratio-SNP associations and we could replicate 7 out of 10 new associations predicted by the NBR approach.In this study we evaluated a network-based approach to select biologically meaningful metabolite ratios as quantitative traits in GWAS. Taking metabolic network information into account facilitated the analysis and the biochemical interpretation of metabolite-gene association results. For upcoming studies, for instance with case-control design, large-scale metabolomics data and small sample numbers, the analysis of all possible metabolite ratios is not feasible due to the correction for multiple testing. Here our NBR approach increases the statistical power and lowers computational demands, allowing for a better understanding of the complex interplay between individual phenotypes, genetics and metabolic profiles.

Download Full-text

Maximizing the Power of Genome-Wide Association Studies: A Novel Class of Powerful Family-Based Association Tests

Statistics in Biosciences ◽

10.1007/s12561-009-9016-z ◽

2009 ◽

Vol 1 (2) ◽

pp. 125-143 ◽

Cited By ~ 4

Author(s):

Sungho Won ◽

Lars Bertram ◽

David Becker ◽

Rudolph E. Tanzi ◽

Christoph Lange

Keyword(s):

Association Studies ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Association Tests ◽

Genome Wide ◽

Family Based

Download Full-text