scholarly journals Leveraging eQTLs to identify individual-level tissue of interest for a complex trait

2019 ◽  
Author(s):  
Arunabha Majumdar ◽  
Claudia Giambartolomei ◽  
Na Cai ◽  
Tanushree Haldar ◽  
Tommer Schwarz ◽  
...  

AbstractGenetic predisposition for complex traits often acts through multiple tissues at different time points during development. As a simple example, the genetic predisposition for obesity could be manifested either through inherited variants that control metabolism through regulation of genes expressed in the brain, or that control fat storage through dysregulation of genes expressed in adipose tissue, or both. Here we describe a statistical approach that leverages tissue-specific expression quantitative trait loci (eQTLs) corresponding to tissue-specific genes to prioritize a relevant tissue underlying the genetic predisposition of a given individual for a complex trait. Unlike existing approaches that prioritize relevant tissues for the trait in the population, our approach probabilistically quantifies the tissue-wise genetic contribution to the trait for a given individual. We hypothesize that for a subgroup of individuals the genetic contribution to the trait can be mediated primarily through a specific tissue. Through simulations using the UK Biobank, we show that our approach can predict the relevant tissue accurately and can cluster individuals according to their tissue-specific genetic architecture. We analyze body mass index (BMI) and waist to hip ratio adjusted for BMI (WHRadjBMI) in the UK Biobank to identify subgroups of individuals whose genetic predisposition act primarily through brain versus adipose tissue, and adipose versus muscle tissue, respectively. Notably, we find that these individuals have specific phenotypic features beyond BMI and WHRadjBMI that distinguish them from random individuals in the data, suggesting biological effects of tissue-specific genetic contribution for these traits.

2018 ◽  
Author(s):  
Carla Márquez-Luna ◽  
Steven Gazal ◽  
Po-Ru Loh ◽  
Samuel S. Kim ◽  
Nicholas Furlotte ◽  
...  

AbstractGenetic variants in functional regions of the genome are enriched for complex trait heritability. Here, we introduce a new method for polygenic prediction, LDpred-funct, that leverages trait-specific functional priors to increase prediction accuracy. We fit priors using the recently developed baseline-LD model, which includes coding, conserved, regulatory and LD-related annotations. We analytically estimate posterior mean causal effect sizes and then use cross-validation to regularize these estimates, improving prediction accuracy for sparse architectures. LDpred-funct attained higher prediction accuracy than other polygenic prediction methods in simulations using real genotypes. We applied LDpred-funct to predict 21 highly heritable traits in the UK Biobank. We used association statistics from British-ancestry samples as training data (avg N=373K) and samples of other European ancestries as validation data (avg N=22K), to minimize confounding. LDpred-funct attained a +4.6% relative improvement in average prediction accuracy (avg prediction R2=0.144; highest R2=0.413 for height) compared to SBayesR (the best method that does not incorporate functional information). For height, meta-analyzing training data from UK Biobank and 23andMe cohorts (total N=1107K; higher heritability in UK Biobank cohort) increased prediction R2 to 0.431. Our results show that incorporating functional priors improves polygenic prediction accuracy, consistent with the functional architecture of complex traits.


2021 ◽  
Vol 12 (1) ◽  
Author(s):  
Carla Márquez-Luna ◽  
Steven Gazal ◽  
Po-Ru Loh ◽  
Samuel S. Kim ◽  
Nicholas Furlotte ◽  
...  

AbstractPolygenic risk prediction is a widely investigated topic because of its promising clinical applications. Genetic variants in functional regions of the genome are enriched for complex trait heritability. Here, we introduce a method for polygenic prediction, LDpred-funct, that leverages trait-specific functional priors to increase prediction accuracy. We fit priors using the recently developed baseline-LD model, including coding, conserved, regulatory, and LD-related annotations. We analytically estimate posterior mean causal effect sizes and then use cross-validation to regularize these estimates, improving prediction accuracy for sparse architectures. We applied LDpred-funct to predict 21 highly heritable traits in the UK Biobank (avg N = 373 K as training data). LDpred-funct attained a +4.6% relative improvement in average prediction accuracy (avg prediction R2 = 0.144; highest R2 = 0.413 for height) compared to SBayesR (the best method that does not incorporate functional information). For height, meta-analyzing training data from UK Biobank and 23andMe cohorts (N = 1107 K) increased prediction R2 to 0.431. Our results show that incorporating functional priors improves polygenic prediction accuracy, consistent with the functional architecture of complex traits.


2021 ◽  
Author(s):  
Duncan S Palmer ◽  
Wei Zhou ◽  
Liam Abbott ◽  
Nik Baya ◽  
Claire Churchhouse ◽  
...  

In classical statistical genetic theory, a dominance effect is defined as the deviation from a purely additive genetic effect for a biallelic variant. Dominance effects are well documented in model organisms. However, evidence in humans is limited to a handful of traits, particularly those with strong single locus effects such as hair color. We carried out the largest systematic evaluation of dominance effects on phenotypic variance in the UK Biobank. We curated and tested over 1,000 phenotypes for dominance effects through GWAS scans, identifying 175 loci at genome-wide significance correcting for multiple testing (P < 4.7 × 10-11). Power to detect non-additive loci is much lower than power to detect additive effects for complex traits: based on the relative effect sizes at genome-wide significant additive loci, we estimate a factor of 20-30 increase in sample size will be necessary to capture clear evidence of dominance similar to those currently observed for additive effects. However, these localised dominance hits do not extend to a significant aggregate contribution to phenotypic variance genome-wide. By deriving a version of LD-score regression to detect dominance effects tagged by common variation genome-wide (minor allele frequency > 0.05), we found no strong evidence of a contribution to phenotypic variance when accounting for multiple testing. Across the 267 continuous and 793 binary traits the median contribution was 5.73 × 10-4, with unbiased point estimates ranging from -0.261 to 0.131. Finally, we introduce dominance fine-mapping to explore whether the more rapid decay of dominance LD can be leveraged to find causal variants. These results provide the most comprehensive assessment of dominance trait variation in humans to date.


2020 ◽  
Author(s):  
Roni Rasnic ◽  
Nathan Linial ◽  
Michal Linial

AbstractIt is estimated that up to 10% of cancer incidents are attributed to inherited genetic alterations. Despite extensive research, there are still gaps in our understanding of genetic predisposition to cancer. It was theorized that ultra-rare variants partially account for the missing heritable component. We harness the UK BioBank dataset of ∼500,000 individuals, 14% of which were diagnosed with cancer, to detect ultra-rare, possibly high-penetrance cancer predisposition variants. We report on 115 cancer-exclusive ultra-rare variations (CUVs) and nominate 26 variants with additional independent evidence as cancer predisposition variants. We conclude that population cohorts are valuable source for expanding the collection of novel cancer predisposition genes.


BMJ ◽  
2019 ◽  
pp. l4410 ◽  
Author(s):  
Agustin Cerani ◽  
Sirui Zhou ◽  
Vincenzo Forgetta ◽  
John A Morris ◽  
Katerina Trajanoska ◽  
...  

Abstract Objective To determine if genetically increased serum calcium levels are associated with improved bone mineral density and a reduction in osteoporotic fractures. Design Mendelian randomisation study. Setting Cohorts used included: the UK Biobank cohort, providing genotypic and estimated bone mineral density data; 25 cohorts from UK, USA, Europe, and China, providing genotypic and fracture data; and 17 cohorts from Europe, providing genotypic and serum calcium data (summary level statistics). Participants A genome-wide association meta-analysis of serum calcium levels in up to 61 079 individuals was used to identify genetic determinants of serum calcium levels. The UK Biobank study was used to assess the association of genetic predisposition to increased serum calcium with estimated bone mineral density derived from heel ultrasound in 426 824 individuals who had, on average, calcium levels in the normal range. A fracture genome-wide association meta-analysis comprising 24 cohorts and the UK Biobank including a total of 76 549 cases and 470 164 controls, who, on average, also had calcium levels in the normal range was then performed. Results A standard deviation increase in genetically derived serum calcium (0.13 mmol/L or 0.51 mg/dL) was not associated with increased estimated bone mineral density (0.003 g/cm 2 , 95% confidence interval −0.059 to 0.066; P=0.92) or a reduced risk of fractures (odds ratio 1.01, 95% confidence interval 0.89 to 1.15; P=0.85) in inverse-variance weighted mendelian randomisation analyses. Sensitivity analyses did not provide evidence of pleiotropic effects. Conclusions Genetic predisposition to increased serum calcium levels in individuals with normal calcium levels is not associated with an increase in estimated bone mineral density and does not provide clinically relevant protection against fracture. Whether such predisposition mimics the effect of short term calcium supplementation is not known. Given that the same genetically derived increase in serum calcium is associated with an increased risk of coronary artery disease, widespread calcium supplementation in the general population could provide more risk than benefit.


2021 ◽  
Author(s):  
Ghislain Rocheleau ◽  
Iain S Forrest ◽  
Áine Duffy ◽  
Shantanu Bafna ◽  
Amanda Dobbyn ◽  
...  

Background: Phenome-wide association studies conducted in electronic health record (EHR)-linked biobanks have uncovered a large number of genomic loci associated with traits and diseases. However, interpretation of the complex relationships of associated genes and phenotypes is challenging. Results: We constructed a tissue-level phenome-wide network map of colocalized genes and phenotypes. First, we generated colocalized expression quantitative trait loci from 48 tissues of the Genotype-Tissue Expression project and from publicly available genome-wide association study summary statistics from the UK Biobank. We identified 9,151 colocalized genes for 1,411 phenotypes across 48 tissues. Then, we constructed a bipartite network using the colocalized signals to establish links between genes and phenotypes in each tissue. The majority of links are observed in a single tissue whereas only a few are present in all tissues. Finally, we applied the biLouvain clustering algorithm in each tissue-specific bipartite network to identify co-clusters of non-overlapping genes and phenotypes. The majority of co-clusters contains a small number of genes and phenotypes, and 88.6% of co-clusters are found in only one tissue. To demonstrate functionality of the phenome-wide map, we tested if these co-clusters were enriched with known biological and functional gene classes and observed several significant enrichments. Furthermore, we observed that tissue-specific co-clusters are enriched with reported drug side effects for the corresponding drug target genes in clinical trial data. Conclusions: The phenome-wide map provides links between genes, phenotypes and tissues across a wide spectrum of biological classes and can yield biological and clinical discoveries. The phenome-wide map is publicly available at https://rstudio-connect.hpc.mssm.edu/biPheMap/.


2017 ◽  
Author(s):  
Åsa Johansson ◽  
Mathias Rask-Andersen ◽  
Torgny Karlsson ◽  
Weronica E. Ek

AbstractEven though heritability estimates suggest that the risk of asthma, hay fever and eczema is largely due to genetic factors, previous studies have not explained a large part of the genetics behind these diseases. In this GWA study, we include 346,545 Caucasians from the UK Biobank to identify novel loci for asthma, hay fever and eczema. We further investigate if associated lead SNPs have a significantly larger effect for one disease compared to the other diseases, to highlight possible disease specific effects.We identified 141 loci, of which 41 are novel, to be associated (P≤3×10−8) with asthma, hay fever or eczema, analysed separately or as disease phenotypes that includes the presence of different combinations of these diseases. The largest number of loci were associated with the combined phenotype (asthma/hay fever/eczema). However, as many as 20 loci had a significantly larger effect on hay fever/eczema-only compared to their effects on asthma, while 26 loci exhibited larger effects on asthma compared with their effects on hay fever/eczema. At four of the novel loci, TNFRSF8, MYRF, TSPAN8, and BHMG1, the lead SNPs were in LD (> 0.8) with potentially casual missense variants.Our study shows that a large amount of the genetic contribution is shared between the diseases. Nonetheless, a number of SNPs have a significantly larger effect on one of the phenotypes suggesting that part of the genetic contribution is more phenotype specific. Identified loci and probable causal genes may in the future be used as targets for treatments of asthma, hay fever and eczema.


2021 ◽  
Author(s):  
Daniel Ho ◽  
William Schierding ◽  
Sophie L Farrow ◽  
Antony Cooper ◽  
Justin M. O'Sullivan ◽  
...  

Parkinson disease (PD) is a complex neurodegenerative disease with a range of causes and clinical presentations. Over 76 genetic loci (comprising 90 SNPs) have been associated with PD by the most recent GWAS meta-analysis. Most of these PD-associated variants are located in non-coding regions of the genome and it is difficult to understand what they are doing and how they contribute to the aetiology of PD. We hypothesised that PD-associated genetic variants modulate disease risk through tissue-specific expression quantitative trait loci (eQTL) effects. We developed and validated a machine learning approach that integrated tissue-specific eQTL data on known PD-associated genetic variants with PD case and control genotypes from the Wellcome Trust Case Control Consortium, the UK Biobank, and NeuroX. In so doing, our analysis ranked the tissue-specific transcription effects for PD-associated genetic variants and estimated their relative contributions to PD risk. We identified roles for SNPs that are connected with INPP5P, CNTN1, GBA and SNCA in PD. Ranking the variants and tissue-specific eQTL effects contributing most to the machine learning model suggested a key role in the risk of developing PD for two variants (rs7617877 and rs6808178) and eQTL associated transcriptional changes of EAF1-AS1 within the heart atrial appendage. Similarly, effects associated with eQTLs located within the brain cerebellum were also recognized to confer major PD risk. These findings warrant further mechanistic investigations to determine if these transcriptional changes could act as early contributors to PD risk and disease development.


BJPsych Open ◽  
2021 ◽  
Vol 7 (2) ◽  
Author(s):  
Kylie P. Glanville ◽  
Jonathan R. I. Coleman ◽  
David M. Howard ◽  
Oliver Pain ◽  
Ken B. Hanscombe ◽  
...  

Background The UK Biobank contains data with varying degrees of reliability and completeness for assessing depression. A third of participants completed a Mental Health Questionnaire (MHQ) containing the gold-standard Composite International Diagnostic Interview (CIDI) criteria for assessing mental health disorders. Aims To investigate whether multiple observations of depression from sources other than the MHQ can enhance the validity of major depressive disorder (MDD). Method In participants who did not complete the MHQ, we calculated the number of other depression measures endorsed, for example from hospital episode statistics and interview data. We compared cases defined this way with CIDI-defined cases for several estimates: the variance explained by polygenic risk scores (PRS), area under the curve attributable to PRS, single nucleotide polymorphisms (SNPs)-based heritability and genetic correlations with summary statistics from the Psychiatric Genomics Consortium MDD genome-wide association study. Results The strength of the genetic contribution increased with the number of measures endorsed. For example, SNP-based heritability increased from 7% in participants who endorsed only one measure of depression, to 21% in those who endorsed four or five measures of depression. The strength of the genetic contribution to cases defined by at least two measures approximated that for CIDI-defined cases. Most genetic correlations between UK Biobank and the Psychiatric Genomics Consortium MDD study exceeded 0.7, but there was variability between pairwise comparisons. Conclusions Multiple measures of depression can serve as a reliable approximation for case status where the CIDI measure is not available, indicating sample size can be optimised using the entire suite of UK Biobank data.


Sign in / Sign up

Export Citation Format

Share Document