A practical view of fine-mapping and gene prioritization in the post-genome-wide association era

R. V. Broekema; O. B. Bakker; I. H. Jonkers

doi:10.1098/rsob.190221

A practical view of fine-mapping and gene prioritization in the post-genome-wide association era

Open Biology ◽

10.1098/rsob.190221 ◽

2020 ◽

Vol 10 (1) ◽

pp. 190221 ◽

Cited By ~ 8

Author(s):

R. V. Broekema ◽

O. B. Bakker ◽

I. H. Jonkers

Keyword(s):

Fine Mapping ◽

Complex Traits ◽

Association Studies ◽

Population Based ◽

Gene Prioritization ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Genome Wide ◽

Underlying Mechanisms ◽

The Impact

Over the past 15 years, genome-wide association studies (GWASs) have enabled the systematic identification of genetic loci associated with traits and diseases. However, due to resolution issues and methodological limitations, the true causal variants and genes associated with traits remain difficult to identify. In this post-GWAS era, many biological and computational fine-mapping approaches now aim to solve these issues. Here, we review fine-mapping and gene prioritization approaches that, when combined, will improve the understanding of the underlying mechanisms of complex traits and diseases. Fine-mapping of genetic variants has become increasingly sophisticated: initially, variants were simply overlapped with functional elements, but now the impact of variants on regulatory activity and direct variant-gene 3D interactions can be identified. Moreover, gene manipulation by CRISPR/Cas9, the identification of expression quantitative trait loci and the use of co-expression networks have all increased our understanding of the genes and pathways affected by GWAS loci. However, despite this progress, limitations including the lack of cell-type- and disease-specific data and the ever-increasing complexity of polygenic models of traits pose serious challenges. Indeed, the combination of fine-mapping and gene prioritization by statistical, functional and population-based strategies will be necessary to truly understand how GWAS loci contribute to complex traits and diseases.

Download Full-text

Causal Haplotype Block Identification in Plant Genome-Wide Association Studies

10.1101/2021.10.28.466332 ◽

2021 ◽

Author(s):

Xing Wu ◽

Wei Jiang ◽

Christopher Fragoso ◽

Jing Huang ◽

Geyu Zhou ◽

...

Keyword(s):

Fine Mapping ◽

Complex Traits ◽

Haplotype Block ◽

Association Studies ◽

Crop Improvement ◽

Plant Genome ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Haplotype Blocks ◽

Genome Wide

Genome wide association studies (GWAS) can play an essential role in understanding genetic basis of complex traits in plants and animals. Conventional SNP-based linear mixed models (LMM) used in many GWAS that marginally test single nucleotide polymorphisms (SNPs) have successfully identified many loci with major and minor effects. In plants, the relatively small population size in GWAS and the high genetic diversity found many plant species can impede mapping efforts on complex traits. Here we present a novel haplotype-based trait fine-mapping framework, HapFM, to supplement current GWAS methods. HapFM uses genotype data to partition the genome into haplotype blocks, identifies haplotype clusters within each block, and then performs genome-wide haplotype fine-mapping to infer the causal haplotype blocks of trait. We benchmarked HapFM, GEMMA, BSLMM, and GMMAT in both simulation and real plant GWAS datasets. HapFM consistently resulted in higher mapping power than the other GWAS methods in simulations with high polygenicity. Moreover, it resulted in higher mapping resolution, especially in regions of high LD, by identifying small causal blocks in the larger haplotype block. In the Arabidopsis flowering time (FT10) datasets, HapFM identified four novel loci compared to GEMMA results, and its average mapping interval of HapFM was 9.6 times smaller than that of GEMMA. In conclusion, HapFM is tailored for plant GWAS to result in high mapping power on complex traits and improved mapping resolution to facilitate crop improvement.

Download Full-text

Genome-wide association meta-analyses and fine-mapping elucidate pathways influencing albuminuria

Nature Communications ◽

10.1038/s41467-019-11576-0 ◽

2019 ◽

Vol 10 (1) ◽

Cited By ~ 14

Author(s):

Alexander Teumer ◽

Yong Li ◽

Sahar Ghasemi ◽

Bram P. Prins ◽

Matthias Wuttke ◽

...

Keyword(s):

Fine Mapping ◽

Association Studies ◽

Genome Wide Association ◽

European Ancestry ◽

Genome Wide Association Studies ◽

Priority List ◽

Protein Levels ◽

Genome Wide ◽

Underlying Mechanisms ◽

Meta Analyses

Abstract Increased levels of the urinary albumin-to-creatinine ratio (UACR) are associated with higher risk of kidney disease progression and cardiovascular events, but underlying mechanisms are incompletely understood. Here, we conduct trans-ethnic (n = 564,257) and European-ancestry specific meta-analyses of genome-wide association studies of UACR, including ancestry- and diabetes-specific analyses, and identify 68 UACR-associated loci. Genetic correlation analyses and risk score associations in an independent electronic medical records database (n = 192,868) reveal connections with proteinuria, hyperlipidemia, gout, and hypertension. Fine-mapping and trans-Omics analyses with gene expression in 47 tissues and plasma protein levels implicate genes potentially operating through differential expression in kidney (including TGFB1, MUC1, PRKCI, and OAF), and allow coupling of UACR associations to altered plasma OAF concentrations. Knockdown of OAF and PRKCI orthologs in Drosophila nephrocytes reduces albumin endocytosis. Silencing fly PRKCI further impairs slit diaphragm formation. These results generate a priority list of genes and pathways for translational research to reduce albuminuria.

Download Full-text

Faculty Opinions recommendation of Estimation of complex effect-size distributions using summary-level statistics from genome-wide association studies across 32 complex traits.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.733803377.793550136 ◽

2018 ◽

Author(s):

Mohan Liu

Keyword(s):

Effect Size ◽

Complex Traits ◽

Association Studies ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Size Distributions ◽

Complex Effect ◽

Genome Wide ◽

Level Statistics

Download Full-text

CAUSALdb: a database for disease/trait causal variants identified using summary statistics of genome-wide association studies

Nucleic Acids Research ◽

10.1093/nar/gkz1026 ◽

2019 ◽

Cited By ~ 2

Author(s):

Jianhua Wang ◽

Dandan Huang ◽

Yao Zhou ◽

Hongcheng Yao ◽

Huanhuan Liu ◽

...

Keyword(s):

Fine Mapping ◽

Genetic Variants ◽

Association Studies ◽

Complex Trait ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Summary Statistics ◽

Genome Wide ◽

Credible Sets ◽

Causal Variants

Abstract Genome-wide association studies (GWASs) have revolutionized the field of complex trait genetics over the past decade, yet for most of the significant genotype-phenotype associations the true causal variants remain unknown. Identifying and interpreting how causal genetic variants confer disease susceptibility is still a big challenge. Herein we introduce a new database, CAUSALdb, to integrate the most comprehensive GWAS summary statistics to date and identify credible sets of potential causal variants using uniformly processed fine-mapping. The database has six major features: it (i) curates 3052 high-quality, fine-mappable GWAS summary statistics across five human super-populations and 2629 unique traits; (ii) estimates causal probabilities of all genetic variants in GWAS significant loci using three state-of-the-art fine-mapping tools; (iii) maps the reported traits to a powerful ontology MeSH, making it simple for users to browse studies on the trait tree; (iv) incorporates highly interactive Manhattan and LocusZoom-like plots to allow visualization of credible sets in a single web page more efficiently; (v) enables online comparison of causal relations on variant-, gene- and trait-levels among studies with different sample sizes or populations and (vi) offers comprehensive variant annotations by integrating massive base-wise and allele-specific functional annotations. CAUSALdb is freely available at http://mulinlab.org/causaldb.

Download Full-text

Exploring the predictive power of polygenic scores derived from genome-wide association studies: a study of 10 complex traits

Bioinformatics ◽

10.1093/bioinformatics/btw745 ◽

2017 ◽

pp. btw745 ◽

Cited By ~ 8

Author(s):

Hon-Cheong So ◽

Pak C. Sham

Keyword(s):

Complex Traits ◽

Predictive Power ◽

Association Studies ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Genome Wide ◽

Polygenic Scores

Download Full-text

The Impact of Incomplete Linkage Disequilibrium and Genetic Model Choice on the Analysis and Interpretation of Genome-wide Association Studies

Annals of Human Genetics ◽

10.1111/j.1469-1809.2010.00579.x ◽

2010 ◽

Vol 74 (4) ◽

pp. 375-379 ◽

Cited By ~ 6

Author(s):

Mark M. Iles

Keyword(s):

Linkage Disequilibrium ◽

Genetic Model ◽

Association Studies ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Model Choice ◽

Genome Wide ◽

The Impact

Download Full-text

Comprehensive evaluation of mapping complex traits in wheat using genome-wide association studies

Molecular Breeding ◽

10.1007/s11032-021-01272-7 ◽

2021 ◽

Vol 42 (1) ◽

Author(s):

Dinesh K. Saini ◽

Yuvraj Chopra ◽

Jagmohan Singh ◽

Karansher S. Sandhu ◽

Anand Kumar ◽

...

Keyword(s):

Complex Traits ◽

Comprehensive Evaluation ◽

Association Studies ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Genome Wide ◽

Mapping Complex Traits

Download Full-text

Guilt by rewiring: gene prioritization through network rewiring in Genome Wide Association Studies

Human Molecular Genetics ◽

10.1093/hmg/ddt668 ◽

2013 ◽

Vol 23 (10) ◽

pp. 2780-2790 ◽

Cited By ~ 32

Author(s):

L. Hou ◽

M. Chen ◽

C. K. Zhang ◽

J. Cho ◽

H. Zhao

Keyword(s):

Association Studies ◽

Gene Prioritization ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Genome Wide ◽

Network Rewiring

Download Full-text

GWAS of three molecular traits highlights core genes and pathways alongside a highly polygenic background

10.1101/2020.04.20.051631 ◽

2020 ◽

Cited By ~ 6

Author(s):

Nasa Sinnott-Armstrong ◽

Sahin Naqvi ◽

Manuel Rivas ◽

Jonathan K Pritchard

Keyword(s):

Complex Traits ◽

Genetic Basis ◽

Association Studies ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Biological Processes ◽

Uk Biobank ◽

The Core ◽

Genome Wide ◽

Core Genes

SummaryGenome-wide association studies (GWAS) have been used to study the genetic basis of a wide variety of complex diseases and other traits. However, for most traits it remains difficult to interpret what genes and biological processes are impacted by the top hits. Here, as a contrast, we describe UK Biobank GWAS results for three molecular traits—urate, IGF-1, and testosterone—that are biologically simpler than most diseases, and for which we know a great deal in advance about the core genes and pathways. Unlike most GWAS of complex traits, for all three traits we find that most top hits are readily interpretable. We observe huge enrichment of significant signals near genes involved in the relevant biosynthesis, transport, or signaling pathways. We show how GWAS data illuminate the biology of variation in each trait, including insights into differences in testosterone regulation between females and males. Meanwhile, in other respects the results are reminiscent of GWAS for more-complex traits. In particular, even these molecular traits are highly polygenic, with most of the variance coming not from core genes, but from thousands to tens of thousands of variants spread across most of the genome. Given that diseases are often impacted by many distinct biological processes, including these three, our results help to illustrate why so many variants can affect risk for any given disease.

Download Full-text

Leveraging polygenic enrichments of gene features to predict genes underlying complex traits and diseases

10.1101/2020.09.08.20190561 ◽

2020 ◽

Cited By ~ 1

Author(s):

Elle M Weeks ◽

Jacob C Ulirsch ◽

Nathan Y Cheng ◽

Brian L Trippe ◽

Rebecca S Fine ◽

...

Keyword(s):

Complex Traits ◽

Association Studies ◽

Gene Prioritization ◽

Protein Interaction Data ◽

Large Set ◽

Genome Wide Association Studies ◽

Protein Protein Interaction ◽

Genome Wide ◽

Causal Genes ◽

Red Blood Cell Count

Genome-wide association studies (GWAS) are a valuable tool for understanding the biology of complex traits, but the associations found rarely point directly to causal genes. Here, we introduce a new method to identify the causal genes by integrating GWAS summary statistics with gene expression, biological pathway, and predicted protein-protein interaction data. We further propose an approach that effectively leverages both polygenic and locus-specific genetic signals by combining results across multiple gene prioritization methods, increasing confidence in prioritized genes. Using a large set of gold standard genes to evaluate our approach, we prioritize 8,402 unique gene-trait pairs with greater than 75% estimated precision across 113 complex traits and diseases, including known genes such as SORT1 for LDL cholesterol, SMIM1 for red blood cell count, and DRD2 for schizophrenia, as well as novel genes such as TTC39B for cholelithiasis. Our results demonstrate that a polygenic approach is a powerful tool for gene prioritization and, in combination with locus-specific signal, improves upon existing methods.

Download Full-text