scholarly journals Phenotype-specific enrichment of Mendelian disorder genes near GWAS regions across 62 complex traits

2018 ◽  
Author(s):  
Malika Kumar Freund ◽  
Kathryn Burch ◽  
Huwenbo Shi ◽  
Nicholas Mancuso ◽  
Gleb Kichaev ◽  
...  

ABSTRACTAlthough recent studies provide evidence for a common genetic basis between complex traits and Mendelian disorders, a thorough quantification of their overlap in a phenotype-specific manner remains elusive. Here, we quantify the overlap of genes identified through large-scale genome-wide association studies (GWAS) for 62 complex traits and diseases with genes known to cause 20 broad categories of Mendelian disorders. We identify a significant enrichment of phenotypically-matched Mendelian disorder genes in GWAS gene sets. Further, we observe elevated GWAS effect sizes near phenotypically-matched Mendelian disorder genes. Finally, we report examples of GWAS variants localized at the transcription start site or physically interacting with the promoters of phenotypically-matched Mendelian disorder genes. Our results are consistent with the hypothesis that genes that are disrupted in Mendelian disorders are dysregulated by noncoding variants in complex traits, and demonstrate how leveraging findings from related Mendelian disorders and functional genomic datasets can prioritize genes that are putatively dysregulated by local and distal non-coding GWAS variants.

eLife ◽  
2021 ◽  
Vol 10 ◽  
Author(s):  
Nasa Sinnott-Armstrong ◽  
Sahin Naqvi ◽  
Manuel Rivas ◽  
Jonathan K Pritchard

Genome-wide association studies (GWAS) have been used to study the genetic basis of a wide variety of complex diseases and other traits. We describe UK Biobank GWAS results for three molecular traits—urate, IGF-1, and testosterone—with better-understood biology than most other complex traits. We find that many of the most significant hits are readily interpretable. We observe huge enrichment of associations near genes involved in the relevant biosynthesis, transport, or signaling pathways. We show how GWAS data illuminate the biology of each trait, including differences in testosterone regulation between females and males. At the same time, even these molecular traits are highly polygenic, with many thousands of variants spread across the genome contributing to trait variance. In summary, for these three molecular traits we identify strong enrichment of signal in putative core gene sets, even while most of the SNP-based heritability is driven by a massively polygenic background.


Author(s):  
Nasa Sinnott-Armstrong ◽  
Sahin Naqvi ◽  
Manuel Rivas ◽  
Jonathan K Pritchard

SummaryGenome-wide association studies (GWAS) have been used to study the genetic basis of a wide variety of complex diseases and other traits. However, for most traits it remains difficult to interpret what genes and biological processes are impacted by the top hits. Here, as a contrast, we describe UK Biobank GWAS results for three molecular traits—urate, IGF-1, and testosterone—that are biologically simpler than most diseases, and for which we know a great deal in advance about the core genes and pathways. Unlike most GWAS of complex traits, for all three traits we find that most top hits are readily interpretable. We observe huge enrichment of significant signals near genes involved in the relevant biosynthesis, transport, or signaling pathways. We show how GWAS data illuminate the biology of variation in each trait, including insights into differences in testosterone regulation between females and males. Meanwhile, in other respects the results are reminiscent of GWAS for more-complex traits. In particular, even these molecular traits are highly polygenic, with most of the variance coming not from core genes, but from thousands to tens of thousands of variants spread across most of the genome. Given that diseases are often impacted by many distinct biological processes, including these three, our results help to illustrate why so many variants can affect risk for any given disease.


2018 ◽  
Author(s):  
Doug Speed ◽  
David J Balding

LD Score Regression (LDSC) has been widely applied to the results of genome-wide association studies. However, its estimates of SNP heritability are derived from an unrealistic model in which each SNP is expected to contribute equal heritability. As a consequence, LDSC tends to over-estimate confounding bias, under-estimate the total phenotypic variation explained by SNPs, and provide misleading estimates of the heritability enrichment of SNP categories. Therefore, we present SumHer, software for estimating SNP heritability from summary statistics using more realistic heritability models. After demonstrating its superiority over LDSC, we apply SumHer to the results of 24 large-scale association studies (average sample size 121 000). First we show that these studies have tended to substantially over-correct for confounding, and as a result the number of genome-wide significant loci has under-reported by about 20%. Next we estimate enrichment for 24 categories of SNPs defined by functional annotations. A previous study using LDSC reported that conserved regions were 13-fold enriched, and found a further twelve categories with above 2-fold enrichment. By contrast, our analysis using SumHer finds that conserved regions are only 1.6-fold (SD 0.06) enriched, and that no category has enrichment above 1.7-fold. SumHer provides an improved understanding of the genetic architecture of complex traits, which enables more efficient analysis of future genetic data.


2021 ◽  
Author(s):  
Alex N. Nguyen Ba ◽  
Katherine R. Lawrence ◽  
Artur Rego-Costa ◽  
Shreyas Gopalakrishnan ◽  
Daniel Temko ◽  
...  

Mapping the genetic basis of complex traits is critical to uncovering the biological mechanisms that underlie disease and other phenotypes. Genome-wide association studies (GWAS) in humans and quantitative trait locus (QTL) mapping in model organisms can now explain much of the observed heritability in many traits, allowing us to predict phenotype from genotype. However, constraints on power due to statistical confounders in large GWAS and smaller sample sizes in QTL studies still limit our ability to resolve numerous small-effect variants, map them to causal genes, identify pleiotropic effects across multiple traits, and infer non-additive interactions between loci (epistasis). Here, we introduce barcoded bulk quantitative trait locus (BB-QTL) mapping, which allows us to construct, genotype, and phenotype 100,000 offspring of a budding yeast cross, two orders of magnitude larger than the previous state of the art. We use this panel to map the genetic basis of eighteen complex traits, finding that the genetic architecture of these traits involves hundreds of small-effect loci densely spaced throughout the genome, many with widespread pleiotropic effects across multiple traits. Epistasis plays a central role, with thousands of interactions that provide insight into genetic networks. By dramatically increasing sample size, BB-QTL mapping demonstrates the potential of natural variants in high-powered QTL studies to reveal the highly polygenic, pleiotropic, and epistatic architecture of complex traits.Significance statementUnderstanding the genetic basis of important phenotypes is a central goal of genetics. However, the highly polygenic architectures of complex traits inferred by large-scale genome-wide association studies (GWAS) in humans stand in contrast to the results of quantitative trait locus (QTL) mapping studies in model organisms. Here, we use a barcoding approach to conduct QTL mapping in budding yeast at a scale two orders of magnitude larger than the previous state of the art. The resulting increase in power reveals the polygenic nature of complex traits in yeast, and offers insight into widespread patterns of pleiotropy and epistasis. Our data and analysis methods offer opportunities for future work in systems biology, and have implications for large-scale GWAS in human populations.


2020 ◽  
Author(s):  
Min Zhao ◽  
Hong Qu

Abstract Background: Circular RNAs (circRNAs) play important roles in regulating gene expression through binding miRNAs and RNA binding proteins. Genetic variation of circRNAs may affect complex traits/diseases by changing their binding efficiency to target miRNAs and proteins. There is a growing demand for investigations of the functions of genetic changes using large-scale experimental evidence. However, there is no online genetic resource for circRNA genes. Results: We performed extensive genetic annotation of 295,526 circRNAs integrated from circBase, circNet and circRNAdb. All pre-computed genetic variants were presented at our online resource, circVAR, with data browsing and search functionality. We explored the chromosome-based distribution of circRNAs and their associated variants. We found that, based on mapping to the 1000 Genomes and ClinVAR databases, chromosome 17 has a relatively large number of circRNAs and associated common and health-related genetic variants. Following the annotation of genome wide association studies (GWAS)-based circRNA variants, we found many non-coding variants within circRNAs, suggesting novel mechanisms for common diseases reported from GWAS studies. For cancer-based somatic variants, we found that chromosome 7 has many highly complex mutations that have been overlooked in previous research. Conclusion: We used the circVAR database to collect SNPs and small insertions and deletions (INDELs) in putative circRNA regions and to identify their potential phenotypic information. To provide a reusable resource for the circRNA research community, we have published all the pre-computed genetic data concerning circRNAs and associated genes together with data query and browsing functions at http://soft.bioinfo-minzhao.org/circvar .


2021 ◽  
Author(s):  
Noemie Valenza-Troubat ◽  
Sara Montanari ◽  
Peter Ritchie ◽  
Maren Wellenreuther

AbstractGrowth directly influences production rate and therefore is one of the most important and well-studied trait in animal breeding. However, understanding the genetic basis of growth has been hindered by its typically complex polygenic architecture. Here, we performed quantitative trait locus (QTL) mapping and genome-wide association studies (GWAS) for 10 growth traits that were observed over two years in 1,100 F1 captive-bred trevally (Pseudocaranx georgianus). We constructed the first high-density linkage map for trevally, which included 19,861 single nucleotide polymorphism (SNP) markers, and discovered eight QTLs for height, length and weight on linkage groups 3, 14 and 18. Using GWAS, we further identified 113 SNP-trait associations, uncovering 10 genetic hot spots involved in growth. Two of the markers found in the GWAS co-located with the QTLs previously mentioned, demonstrating that combining QTL mapping and GWAS represents a powerful approach for the identification and validation of loci controlling complex traits. This is the first study of its kind for trevally. Our findings provide important insights into the genetic architecture of growth in this species and supply a basis for fine mapping QTLs, marker-assisted selection, and further detailed functional analysis of the genes underlying growth in trevally.


2018 ◽  
Author(s):  
Karl A. G. Kremling ◽  
Christine H. Diepenbrock ◽  
Michael A. Gore ◽  
Edward S. Buckler ◽  
Nonoy B. Bandillo

AbstractModern improvement of complex traits in agricultural species relies on successful associations of heritable molecular variation with observable phenotypes. Historically, this pursuit has primarily been based on easily measurable genetic markers. The recent advent of new technologies allows assaying and quantifying biological intermediates (hereafter endophenotypes) which are now readily measurable at a large scale across diverse individuals. The potential of using endophenotypes for dissecting traits of interest remains underexplored in plants. The work presented here illustrated the utility of a large-scale (299 genotype and 7 tissue) gene expression resource to dissect traits across multiple levels of biological organization. Using single-tissue- and multi-tissue-based transcriptome-wide association studies (TWAS), we revealed that about half of the functional variation for agronomic and seed quality (carotenoid, tocochromanol) traits is regulatory. Comparing the efficacy of TWAS with genome-wide association studies (GWAS) and an ensemble approach that combines both GWAS and TWAS, we demonstrated that results of TWAS in combination with GWAS increase the power to detect known genes and aid in prioritizing likely causal genes. Using a variance partitioning approach in the independent maize Nested Association Mapping (NAM) population, we also showed that the most strongly associated genes identified by combining GWAS and TWAS explain more heritable variance for a majority of traits, beating the heritability captured by the random genes and the genes identified by GWAS or TWAS alone. This improves not only the ability to link genes to phenotypes, but also highlights the phenotypic consequences of regulatory variation in plants.Author summaryWe examined the ability to associate variability in gene expression directly with terminal phenotypes of interest, as a supplement linking genotype to phenotype. We found that transcriptome-wide association studies (TWAS) are a useful accessory to genome-wide association studies (GWAS). In a combined test with GWAS results, TWAS improves the capacity to re-detect genes known to underlie quantitative trait loci for kernel and agronomic phenotypes. This improves not only the capacity to link genes to phenotypes, but also illustrates the widespread importance of regulation for phenotype.


2016 ◽  
Author(s):  
Bogdan Pasaniuc ◽  
Alkes L. Price

AbstractDuring the past decade, genome-wide association studies (GWAS) have successfully identified tens of thousands of genetic variants associated with complex traits and diseases. These studies have produced vast repositories of genetic variation and trait measurements across millions of individuals, providing tremendous opportunities for further analyses. However, privacy concerns and other logistical considerations often limit access to individual-level genetic data, motivating the development of methods that analyze summary association statistics. Here we review recent progress on statistical methods that leverage summary association data to gain insights into the genetic basis of complex traits and diseases.


2016 ◽  
Vol 47 (6) ◽  
pp. 1116-1125 ◽  
Author(s):  
M. Liu ◽  
S. M. Malone ◽  
U. Vaidyanathan ◽  
M. C. Keller ◽  
G. Abecasis ◽  
...  

BackgroundEndophenotypes are laboratory-based measures hypothesized to lie in the causal chain between genes and clinical disorder, and to serve as a more powerful way to identify genes associated with the disorder. One promise of endophenotypes is that they may assist in elucidating the neurobehavioral mechanisms by which an associated genetic polymorphism affects disorder risk in complex traits. We evaluated this promise by testing the extent to which variants discovered to be associated with schizophrenia through large-scale meta-analysis show associations with psychophysiological endophenotypes.MethodWe genome-wide genotyped and imputed 4905 individuals. Of these, 1837 were whole-genome-sequenced at 11× depth. In a community-based sample, we conducted targeted tests of variants within schizophrenia-associated loci, as well as genome-wide polygenic tests of association, with 17 psychophysiological endophenotypes including acoustic startle response and affective startle modulation, antisaccade, multiple frequencies of resting electroencephalogram (EEG), electrodermal activity and P300 event-related potential.ResultsUsing single variant tests and gene-based tests we found suggestive evidence for an association between contactin 4 (CNTN4) and antisaccade and P300. We were unable to find any other variant or gene within the 108 schizophrenia loci significantly associated with any of our 17 endophenotypes. Polygenic risk scores indexing genetic vulnerability to schizophrenia were not related to any of the psychophysiological endophenotypes after correction for multiple testing.ConclusionsThe results indicate significant difficulty in using psychophysiological endophenotypes to characterize the genetically influenced neurobehavioral mechanisms by which risk loci identified in genome-wide association studies affect disorder risk.


Author(s):  
Max Lam ◽  
Chia-Yen Chen ◽  
Tian Ge ◽  
Yan Xia ◽  
David W. Hill ◽  
...  

AbstractBroad-based cognitive deficits are an enduring and disabling symptom for many patients with severe mental illness, and these impairments are inadequately addressed by current medications. While novel drug targets for schizophrenia and depression have emerged from recent large-scale genome-wide association studies (GWAS) of these psychiatric disorders, GWAS of general cognitive ability can suggest potential targets for nootropic drug repurposing. Here, we (1) meta-analyze results from two recent cognitive GWAS to further enhance power for locus discovery; (2) employ several complementary transcriptomic methods to identify genes in these loci that are credibly associated with cognition; and (3) further annotate the resulting genes using multiple chemoinformatic databases to identify “druggable” targets. Using our meta-analytic data set (N = 373,617), we identified 241 independent cognition-associated loci (29 novel), and 76 genes were identified by 2 or more methods of gene identification. Actin and chromatin binding gene sets were identified as novel pathways that could be targeted via drug repurposing. Leveraging our transcriptomic and chemoinformatic databases, we identified 16 putative genes targeted by existing drugs potentially available for cognitive repurposing.


Sign in / Sign up

Export Citation Format

Share Document