scholarly journals FINEMAP: Efficient variable selection using summary data from genome-wide association studies

2015 ◽  
Author(s):  
Christian Benner ◽  
Chris C.A. Spencer ◽  
Samuli Ripatti ◽  
Matti Pirinen

Motivation: The goal of fine-mapping in genomic regions associated with complex diseases and traits is to identify causal variants that point to molecular mechanisms behind the associations. Recent fine-mapping methods using summary data from genome-wide association studies rely on exhaustive search through all possible causal configurations, which is computationally expensive. Results: We introduce FINEMAP, a software package to efficiently explore a set of the most important causal configurations of the region via a shotgun stochastic search algorithm. We show that FINEMAP produces accurate results in a fraction of processing time of existing approaches and is therefore a promising tool for analyzing growing amounts of data produced in genome-wide association studies. Availability: FINEMAP v1.0 is freely available for Mac OS X and Linux at http://www.christianbenner.com.

2017 ◽  
Vol 101 (4) ◽  
pp. 539-551 ◽  
Author(s):  
Christian Benner ◽  
Aki S. Havulinna ◽  
Marjo-Riitta Järvelin ◽  
Veikko Salomaa ◽  
Samuli Ripatti ◽  
...  

Author(s):  
Jianhua Wang ◽  
Dandan Huang ◽  
Yao Zhou ◽  
Hongcheng Yao ◽  
Huanhuan Liu ◽  
...  

Abstract Genome-wide association studies (GWASs) have revolutionized the field of complex trait genetics over the past decade, yet for most of the significant genotype-phenotype associations the true causal variants remain unknown. Identifying and interpreting how causal genetic variants confer disease susceptibility is still a big challenge. Herein we introduce a new database, CAUSALdb, to integrate the most comprehensive GWAS summary statistics to date and identify credible sets of potential causal variants using uniformly processed fine-mapping. The database has six major features: it (i) curates 3052 high-quality, fine-mappable GWAS summary statistics across five human super-populations and 2629 unique traits; (ii) estimates causal probabilities of all genetic variants in GWAS significant loci using three state-of-the-art fine-mapping tools; (iii) maps the reported traits to a powerful ontology MeSH, making it simple for users to browse studies on the trait tree; (iv) incorporates highly interactive Manhattan and LocusZoom-like plots to allow visualization of credible sets in a single web page more efficiently; (v) enables online comparison of causal relations on variant-, gene- and trait-levels among studies with different sample sizes or populations and (vi) offers comprehensive variant annotations by integrating massive base-wise and allele-specific functional annotations. CAUSALdb is freely available at http://mulinlab.org/causaldb.


2013 ◽  
Vol 34 (7) ◽  
pp. 1520-1528 ◽  
Author(s):  
Y. Zheng ◽  
T. O. Ogundiran ◽  
A. G. Falusi ◽  
K. L. Nathanson ◽  
E. M. John ◽  
...  

2016 ◽  
Vol 32 (10) ◽  
pp. 1493-1501 ◽  
Author(s):  
Christian Benner ◽  
Chris C.A. Spencer ◽  
Aki S. Havulinna ◽  
Veikko Salomaa ◽  
Samuli Ripatti ◽  
...  

2017 ◽  
Vol 242 (13) ◽  
pp. 1325-1334 ◽  
Author(s):  
Yizhou Zhu ◽  
Cagdas Tazearslan ◽  
Yousin Suh

Genome-wide association studies have shown that the far majority of disease-associated variants reside in the non-coding regions of the genome, suggesting that gene regulatory changes contribute to disease risk. To identify truly causal non-coding variants and their affected target genes remains challenging but is a critical step to translate the genetic associations to molecular mechanisms and ultimately clinical applications. Here we review genomic/epigenomic resources and in silico tools that can be used to identify causal non-coding variants and experimental strategies to validate their functionalities. Impact statement Most signals from genome-wide association studies (GWASs) map to the non-coding genome, and functional interpretation of these associations remained challenging. We reviewed recent progress in methodologies of studying the non-coding genome and argued that no single approach allows one to effectively identify the causal regulatory variants from GWAS results. By illustrating the advantages and limitations of each method, our review potentially provided a guideline for taking a combinatorial approach to accurately predict, prioritize, and eventually experimentally validate the causal variants.


2021 ◽  
Author(s):  
Chun Chieh Fan ◽  
Robert Loughnan ◽  
Diliana Pechva ◽  
Chi-Hua Chen ◽  
Donald Hagler ◽  
...  

It is important to understand the molecular determinants for microstructures of human brain. However, past genome-wide association studies (GWAS) on microstructures of human brain have had limited results due to methodological constraints. Here, we adopt advanced imaging processing methods and multivariate GWAS on two large scale imaging genetic datasets (UK Biobank and Adolescent Brain Cognitive Development study) to identify and validate key genetic association signals. We discovered 503 unique genetic loci that explained more than 50% of the average heritability across imaging features sensitive to tissue compartments. The genome-wide signals are strongly overlapped with neuropsychiatric diseases, cognitive functions, risk tolerance, and immune responses. Our results implicate the shared molecular mechanisms between tissue microstructures of brain and neuropsychiatric outcomes with astrocyte involvement in the early developmental stage.


2016 ◽  
Author(s):  
Chao Tian ◽  
Bethann S. Hromatka ◽  
Amy K Kiefer ◽  
Nicholas Eriksson ◽  
Joyce Y Tung ◽  
...  

ABSTRACTWe performed 23 genome-wide association studies for common infections, including chickenpox, shingles, cold sores, mononucleosis, mumps, hepatitis B, plantar warts, positive tuberculosis test results, strep throat, scarlet fever, pneumonia, bacterial meningitis, yeast infections, urinary tract infections, tonsillectomy, childhood ear infections, myringotomy, measles, hepatitis A, rheumatic fever, common colds, rubella and chronic sinus infection, in more than 200,000 individuals of European ancestry. For the first time, genome-wide significant associations (P< 5 × 10−8) were identified for many common infections. The associations were mapped to genes with key roles in acquired and innate immunity(HLA, IFNA21, FUT2, ST3GAL4, ABO, IFNL4, LCE3E, DSG1, LTBR, MTMR3, TNFRSF13B, TNFSF13B, NFKB1, CD40) and in regulation of embryonic developmental process(TBX1, FGF, FOXA1 and FOXN1).Several missense mutations were also identified (inLCE5A, DSG1, FUT2, TBX1, CDHR3, PLG, TNFRSF13B, FOXA1, SH2B3, ST5andFOXN1). Missense mutations inFUT2andTBX1were implicated in multiple infections. We applied fine-mapping analysis to dissect associations in the human leukocyte antigen region, which suggested important roles of specific amino acid polymorphisms in the antigen-binding clefts. Our findings provide an important step toward dissecting the host genetic architecture of response to common infections.


2021 ◽  
Author(s):  
Xing Wu ◽  
Wei Jiang ◽  
Christopher Fragoso ◽  
Jing Huang ◽  
Geyu Zhou ◽  
...  

Genome wide association studies (GWAS) can play an essential role in understanding genetic basis of complex traits in plants and animals. Conventional SNP-based linear mixed models (LMM) used in many GWAS that marginally test single nucleotide polymorphisms (SNPs) have successfully identified many loci with major and minor effects. In plants, the relatively small population size in GWAS and the high genetic diversity found many plant species can impede mapping efforts on complex traits. Here we present a novel haplotype-based trait fine-mapping framework, HapFM, to supplement current GWAS methods. HapFM uses genotype data to partition the genome into haplotype blocks, identifies haplotype clusters within each block, and then performs genome-wide haplotype fine-mapping to infer the causal haplotype blocks of trait. We benchmarked HapFM, GEMMA, BSLMM, and GMMAT in both simulation and real plant GWAS datasets. HapFM consistently resulted in higher mapping power than the other GWAS methods in simulations with high polygenicity. Moreover, it resulted in higher mapping resolution, especially in regions of high LD, by identifying small causal blocks in the larger haplotype block. In the Arabidopsis flowering time (FT10) datasets, HapFM identified four novel loci compared to GEMMA results, and its average mapping interval of HapFM was 9.6 times smaller than that of GEMMA. In conclusion, HapFM is tailored for plant GWAS to result in high mapping power on complex traits and improved mapping resolution to facilitate crop improvement.


Sign in / Sign up

Export Citation Format

Share Document