scholarly journals Using extreme phenotype sampling to identify the rare causal variants of quantitative traits in association studies

2011 ◽  
Vol 35 (8) ◽  
pp. 790-799 ◽  
Author(s):  
Dalin Li ◽  
Juan Pablo Lewinger ◽  
William J. Gauderman ◽  
Cassandra Elizabeth Murcray ◽  
David Conti
Author(s):  
Andrey Ziyatdinov ◽  
Jihye Kim ◽  
Dmitry Prokopenko ◽  
Florian Privé ◽  
Fabien Laporte ◽  
...  

Abstract The effective sample size (ESS) is a metric used to summarize in a single term the amount of correlation in a sample. It is of particular interest when predicting the statistical power of genome-wide association studies (GWAS) based on linear mixed models. Here, we introduce an analytical form of the ESS for mixed-model GWAS of quantitative traits and relate it to empirical estimators recently proposed. Using our framework, we derived approximations of the ESS for analyses of related and unrelated samples and for both marginal genetic and gene-environment interaction tests. We conducted simulations to validate our approximations and to provide a quantitative perspective on the statistical power of various scenarios, including power loss due to family relatedness and power gains due to conditioning on the polygenic signal. Our analyses also demonstrate that the power of gene-environment interaction GWAS in related individuals strongly depends on the family structure and exposure distribution. Finally, we performed a series of mixed-model GWAS on data from the UK Biobank and confirmed the simulation results. We notably found that the expected power drop due to family relatedness in the UK Biobank is negligible.


2021 ◽  
Vol 13 (1) ◽  
Author(s):  
Shuquan Rao ◽  
Yao Yao ◽  
Daniel E. Bauer

AbstractGenome-wide association studies (GWAS) have uncovered thousands of genetic variants that influence risk for human diseases and traits. Yet understanding the mechanisms by which these genetic variants, mainly noncoding, have an impact on associated diseases and traits remains a significant hurdle. In this review, we discuss emerging experimental approaches that are being applied for functional studies of causal variants and translational advances from GWAS findings to disease prevention and treatment. We highlight the use of genome editing technologies in GWAS functional studies to modify genomic sequences, with proof-of-principle examples. We discuss the challenges in interrogating causal variants, points for consideration in experimental design and interpretation of GWAS locus mechanisms, and the potential for novel therapeutic opportunities. With the accumulation of knowledge of functional genetics, therapeutic genome editing based on GWAS discoveries will become increasingly feasible.


Author(s):  
Jianhua Wang ◽  
Dandan Huang ◽  
Yao Zhou ◽  
Hongcheng Yao ◽  
Huanhuan Liu ◽  
...  

Abstract Genome-wide association studies (GWASs) have revolutionized the field of complex trait genetics over the past decade, yet for most of the significant genotype-phenotype associations the true causal variants remain unknown. Identifying and interpreting how causal genetic variants confer disease susceptibility is still a big challenge. Herein we introduce a new database, CAUSALdb, to integrate the most comprehensive GWAS summary statistics to date and identify credible sets of potential causal variants using uniformly processed fine-mapping. The database has six major features: it (i) curates 3052 high-quality, fine-mappable GWAS summary statistics across five human super-populations and 2629 unique traits; (ii) estimates causal probabilities of all genetic variants in GWAS significant loci using three state-of-the-art fine-mapping tools; (iii) maps the reported traits to a powerful ontology MeSH, making it simple for users to browse studies on the trait tree; (iv) incorporates highly interactive Manhattan and LocusZoom-like plots to allow visualization of credible sets in a single web page more efficiently; (v) enables online comparison of causal relations on variant-, gene- and trait-levels among studies with different sample sizes or populations and (vi) offers comprehensive variant annotations by integrating massive base-wise and allele-specific functional annotations. CAUSALdb is freely available at http://mulinlab.org/causaldb.


2016 ◽  
Vol 283 (1835) ◽  
pp. 20160569 ◽  
Author(s):  
M. E. Goddard ◽  
K. E. Kemper ◽  
I. M. MacLeod ◽  
A. J. Chamberlain ◽  
B. J. Hayes

Complex or quantitative traits are important in medicine, agriculture and evolution, yet, until recently, few of the polymorphisms that cause variation in these traits were known. Genome-wide association studies (GWAS), based on the ability to assay thousands of single nucleotide polymorphisms (SNPs), have revolutionized our understanding of the genetics of complex traits. We advocate the analysis of GWAS data by a statistical method that fits all SNP effects simultaneously, assuming that these effects are drawn from a prior distribution. We illustrate how this method can be used to predict future phenotypes, to map and identify the causal mutations, and to study the genetic architecture of complex traits. The genetic architecture of complex traits is even more complex than previously thought: in almost every trait studied there are thousands of polymorphisms that explain genetic variation. Methods of predicting future phenotypes, collectively known as genomic selection or genomic prediction, have been widely adopted in livestock and crop breeding, leading to increased rates of genetic improvement.


Genetics ◽  
2009 ◽  
Vol 183 (3) ◽  
pp. 1153-1164 ◽  
Author(s):  
Bala R. Thumma ◽  
Bronwyn A. Matheson ◽  
Deqiang Zhang ◽  
Christian Meeske ◽  
Roger Meder ◽  
...  

Populations with low linkage disequilibrium (LD) offer unique opportunities to study functional variants influencing quantitative traits. We exploited the low LD in forest trees to identify functional polymorphisms in a Eucalyptus nitens COBRA-like gene (EniCOBL4A), whose Arabidopsis homolog has been implicated in cellulose deposition. Linkage analysis in a full-sib family revealed that EniCOBL4A is the most strongly associated marker in a quantitative trait locus (QTL) region for cellulose content. Analysis of LD by genotyping 11 common single-nucleotide polymorphisms (SNPs) and a simple sequence repeat (SSR) in an association population revealed that LD declines within the length of the gene. Using association studies we fine mapped the effect of the gene to SNP7, a synonymous SNP in exon 5, which occurs between two small haplotype blocks. We observed patterns of allelic expression imbalance (AEI) and differential binding of nuclear proteins to the SNP7 region that indicate that SNP7 is a cis-acting regulatory polymorphism affecting allelic expression. We also observed AEI in SNP7 heterozygotes in a full-sib family that is linked to heritable allele-specific methylation near SNP7. This study demonstrates the potential to reveal functional polymorphisms underlying quantitative traits in low LD populations.


2020 ◽  
Author(s):  
Yanjiao Jin ◽  
Jie Yang ◽  
Shuyue Zhang ◽  
Jin Li ◽  
Songlin Wang

Abstract Background: Oral diseases impact the majority of the world’s population. The following traits are common in oral inflammatory diseases: mouth ulcers, painful gums, bleeding gums, loose teeth, and toothache. Despite the prevalence of genome-wide association studies, the associations between these traits and common genomic variants, and whether pleiotropic loci are shared by some of these traits remain poorly understood. Methods: In this work, we conducted multi-trait joint analyses based on the summary statistics of genome-wide association studies of these five oral inflammatory traits from the UK Biobank, each of which is comprised of over 10,000 cases and over 300,000 controls. We estimated the genetic correlations between the five traits. We conducted fine-mapping and functional annotation based on multi-omics data to better understand the biological functions of the potential causal variants at each locus. To identify the pathways in which the candidate genes were mainly involved, we applied gene-set enrichment analysis, and further performed protein-protein interaction (PPI) analyses.Results: We identified 39 association signals that surpassed genome-wide significance, including three that were shared between two or more oral inflammatory traits, consistent with a strong correlation. Among these genome-wide significant loci, two were novel for both painful gums and toothache. We performed fine-mapping and identified causal variants at each novel locus. Further functional annotation based on multi-omics data suggested IL10 and IL12A/TRIM59 as potential candidate genes at the novel pleiotropic loci, respectively. Subsequent analyses of pathway enrichment and protein-protein interaction networks suggested the involvement of candidate genes at genome-wide significant loci in immune regulation.Conclusions: Our results highlighted the importance of immune regulation in the pathogenesis of oral inflammatory diseases. Some common immune-related pleiotropic loci or genetic variants are shared by multiple oral inflammatory traits. These findings will be beneficial for risk prediction, prevention, and therapy of oral inflammatory diseases.


2021 ◽  
Author(s):  
Richard J Allen ◽  
Beatriz Guillen-Guio ◽  
Emma Croot ◽  
Luke M Kraven ◽  
Samuel Moss ◽  
...  

AbstractGenome-wide association studies (GWAS) of coronavirus disease 2019 (COVID-19) and idiopathic pulmonary fibrosis (IPF) have identified genetic loci associated with both traits, suggesting possible shared biological mechanisms. Using updated GWAS of COVID-19 and IPF, we evaluated the genetic overlap between these two diseases and identified four genetic loci (including one novel) with likely shared causal variants between severe COVID-19 and IPF. Although there was a positive genetic correlation between COVID-19 and IPF, two of these four shared genetic loci had an opposite direction of effect. IPF-associated genetic variants related to telomere dysfunction and spindle assembly showed no association with COVID-19 phenotypes. Together, these results suggest there are both shared and distinct biological processes driving IPF and severe COVID-19 phenotypes.


2019 ◽  
Vol 29 (1) ◽  
pp. 159-167 ◽  
Author(s):  
Lynsey S Hall ◽  
Christopher W Medway ◽  
Oliver Pain ◽  
Antonio F Pardiñas ◽  
Elliott G Rees ◽  
...  

Abstract Schizophrenia is a complex highly heritable disorder. Genome-wide association studies (GWAS) have identified multiple loci that influence the risk of developing schizophrenia, although the causal variants driving these associations and their impacts on specific genes are largely unknown. We identify a significant correlation between schizophrenia risk and expression at 89 genes in the dorsolateral prefrontal cortex (P ≤ 9.43 × 10−6), including 20 novel genes. Genes whose expression correlate with schizophrenia were enriched for those involved in abnormal CNS synaptic transmission (PFDR = 0.02) and antigen processing and presentation of peptide antigen via MHC class I (PFDR = 0.02). Within the CNS synaptic transmission set, we identify individual significant candidate genes to which we assign direction of expression changes in schizophrenia. The findings provide strong candidates for experimentally probing the molecular basis of synaptic pathology in schizophrenia.


2019 ◽  
Vol 47 (22) ◽  
pp. e146-e146 ◽  
Author(s):  
Taeyeop Lee ◽  
Min Kyung Sung ◽  
Seulkee Lee ◽  
Woojin Yang ◽  
Jaeho Oh ◽  
...  

Abstract Major progress in disease genetics has been made through genome-wide association studies (GWASs). One of the key tasks for post-GWAS analyses is to identify causal noncoding variants with regulatory function. Here, on the basis of >2000 functional features, we developed a convolutional neural network framework for combinatorial, nonlinear modeling of complex patterns shared by risk variants scattered among multiple associated loci. When applied for major psychiatric disorders and autoimmune diseases, neural and immune features, respectively, exhibited high explanatory power while reflecting the pathophysiology of the relevant disease. The predicted causal variants were concentrated in active regulatory regions of relevant cell types and tended to be in physical contact with transcription factors while residing in evolutionarily conserved regions and resulting in expression changes of genes related to the given disease. We demonstrate some examples of novel candidate causal variants and associated genes. Our method is expected to contribute to the identification and functional interpretation of potential causal noncoding variants in post-GWAS analyses.


Sign in / Sign up

Export Citation Format

Share Document