Extensive impact of low-frequency variants on the phenotypic landscape at population-scale

Mapping Intimacies ◽

10.1101/609917 ◽

2019 ◽

Cited By ~ 1

Author(s):

T. Fournier ◽

O. Abou Saada ◽

J. Hou ◽

J. Peter ◽

E. Caudal ◽

...

Keyword(s):

Genetic Variants ◽

Complex Traits ◽

Growth Traits ◽

Association Studies ◽

Significant Proportion ◽

Low Frequency ◽

Population Level ◽

Initial Population ◽

Phenotypic Variance ◽

Genome Wide Association Studies

AbstractGenome-wide association studies (GWAS) allows to dissect the genetic basis of complex traits at the population level1. However, despite the extensive number of trait-associated loci found, they often fail to explain a large part of the observed phenotypic variance2–4. One potential source of this discrepancy could be the preponderance of undetected low-frequency genetic variants in natural populations5,6. To increase the allele frequency of those variants and assess their phenotypic effects at the population level, we generated a diallel panel consisting of 3,025 hybrids, derived from pairwise crosses between a subset of natural isolates from a completely sequenced 1,011 Saccharomyces cerevisiae population. We examined each hybrid across a large number of growth traits, resulting in a total of 148,225 cross/trait combinations. Parental versus hybrid regression analysis showed that while most phenotypic variance is explained by additivity, a significant proportion (29%) is governed by non-additive effects. This is confirmed by the fact that a majority of complete dominance is observed in 25% of the traits. By performing GWAS on the diallel panel, we detected 1,723 significantly associated genetic variants, with 16.3% of them being low-frequency variants in the initial population. These variants, which would not be detected using classical GWAS, explain 21% of the phenotypic variance on average. Altogether, our results demonstrate that low-frequency variants should be accounted for as they contribute to a large part of the phenotypic variation observed in a population.

Download Full-text

Extensive impact of low-frequency variants on the phenotypic landscape at population-scale

eLife ◽

10.7554/elife.49258 ◽

2019 ◽

Vol 8 ◽

Cited By ~ 8

Author(s):

Téo Fournier ◽

Omar Abou Saada ◽

Jing Hou ◽

Jackson Peter ◽

Elodie Caudal ◽

...

Keyword(s):

Complex Traits ◽

Association Studies ◽

Low Frequency ◽

Initial Population ◽

Phenotypic Variance ◽

Genome Wide Association Studies ◽

Common Variants ◽

Complete Dominance ◽

Genome Wide ◽

Natural Isolates

Genome-wide association studies (GWAS) allow to dissect complex traits and map genetic variants, which often explain relatively little of the heritability. One potential reason is the preponderance of undetected low-frequency variants. To increase their allele frequency and assess their phenotypic impact in a population, we generated a diallel panel of 3025 yeast hybrids, derived from pairwise crosses between natural isolates and examined a large number of traits. Parental versus hybrid regression analysis showed that while most phenotypic variance is explained by additivity, a third is governed by non-additive effects, with complete dominance having a key role. By performing GWAS on the diallel panel, we found that associated variants with low frequency in the initial population are overrepresented and explain a fraction of the phenotypic variance as well as an effect size similar to common variants. Overall, we highlighted the relevance of low-frequency variants on the phenotypic variation.

Download Full-text

A transcriptome-wide Mendelian randomization study to uncover tissue-dependent regulatory mechanisms across the human phenome

10.1101/563379 ◽

2019 ◽

Cited By ~ 2

Author(s):

Tom G Richardson ◽

Gibran Hemani ◽

Tom R Gaunt ◽

Caroline L Relton ◽

George Davey Smith

Keyword(s):

Gene Expression ◽

Genetic Variants ◽

Complex Traits ◽

Mendelian Randomization ◽

Drug Repositioning ◽

Association Studies ◽

Thyroid Tissue ◽

Genome Wide Association Studies ◽

Tissue Specific ◽

Genome Wide

AbstractBackgroundDeveloping insight into tissue-specific transcriptional mechanisms can help improve our understanding of how genetic variants exert their effects on complex traits and disease. By applying the principles of Mendelian randomization, we have undertaken a systematic analysis to evaluate transcriptome-wide associations between gene expression across 48 different tissue types and 395 complex traits.ResultsOverall, we identified 100,025 gene-trait associations based on conventional genome-wide corrections (P < 5 × 10−08) that also provided evidence of genetic colocalization. These results indicated that genetic variants which influence gene expression levels in multiple tissues are more likely to influence multiple complex traits. We identified many examples of tissue-specific effects, such as genetically-predicted TPO, NR3C2 and SPATA13 expression only associating with thyroid disease in thyroid tissue. Additionally, FBN2 expression was associated with both cardiovascular and lung function traits, but only when analysed in heart and lung tissue respectively.We also demonstrate that conducting phenome-wide evaluations of our results can help flag adverse on-target side effects for therapeutic intervention, as well as propose drug repositioning opportunities. Moreover, we find that exploring the tissue-dependency of associations identified by genome-wide association studies (GWAS) can help elucidate the causal genes and tissues responsible for effects, as well as uncover putative novel associations.ConclusionsThe atlas of tissue-dependent associations we have constructed should prove extremely valuable to future studies investigating the genetic determinants of complex disease. The follow-up analyses we have performed in this study are merely a guide for future research. Conducting similar evaluations can be undertaken systematically at http://mrcieu.mrsoftware.org/Tissue_MR_atlas/.

Download Full-text

Heritability jointly explained by host genotype and microbiome: will improve traits prediction?

Briefings in Bioinformatics ◽

10.1093/bib/bbaa175 ◽

2020 ◽

Author(s):

Denis Awany ◽

Emile R Chimusa

Keyword(s):

Genetic Variants ◽

Association Studies ◽

Heritability Estimate ◽

Substantial Part ◽

Phenotypic Variance ◽

Genome Wide Association Studies ◽

Host Genotype ◽

Genome Wide ◽

Heritability Estimation

Abstract As we observe the $70$th anniversary of the publication by Robertson that formalized the notion of ‘heritability’, geneticists remain puzzled by the problem of missing/hidden heritability, where heritability estimates from genome-wide association studies (GWASs) fall short of that from twin-based studies. Many possible explanations have been offered for this discrepancy, including existence of genetic variants poorly captured by existing arrays, dominance, epistasis and unaccounted-for environmental factors; albeit these remain controversial. We believe a substantial part of this problem could be solved or better understood by incorporating the host’s microbiota information in the GWAS model for heritability estimation and may also increase human traits prediction for clinical utility. This is because, despite empirical observations such as (i) the intimate role of the microbiome in many complex human phenotypes, (ii) the overlap between genetic variants associated with both microbiome attributes and complex diseases and (iii) the existence of heritable bacterial taxa, current GWAS models for heritability estimate do not take into account the contributory role of the microbiome. Furthermore, heritability estimate from twin-based studies does not discern microbiome component of the observed total phenotypic variance. Here, we summarize the concept of heritability in GWAS and microbiome-wide association studies, focusing on its estimation, from a statistical genetics perspective. We then discuss a possible statistical method to incorporate the microbiome in the estimation of heritability in host GWAS.

Download Full-text

GWAS contribution to atrial fibrillation and atrial fibrillation-related stroke: pathophysiological implications

Pharmacogenomics ◽

10.2217/pgs-2019-0054 ◽

2019 ◽

Vol 20 (10) ◽

pp. 765-780 ◽

Cited By ~ 1

Author(s):

Diana Cruz ◽

Ricardo Pinto ◽

Margarida Freitas-Silva ◽

José Pedro Nunes ◽

Rui Medeiros

Keyword(s):

Atrial Fibrillation ◽

Genetic Variants ◽

Complex Traits ◽

Association Studies ◽

Cardioembolic Stroke ◽

Risk Scores ◽

Genome Wide Association Studies ◽

Genetic Determinants ◽

Genome Wide ◽

Genome Wide Significance

Atrial fibrillation (AF) and stroke are included in a group of complex traits that have been approached regarding of their study by susceptibility genetic determinants. Since 2007, several genome-wide association studies (GWAS) aiming to identify genetic variants modulating AF risk have been conducted. Thus, 11 GWAS have identified 26 SNPs (p < 5 × 10-2), of which 19 reached genome-wide significance (p < 5 × 10-8). From those variants, seven were also associated with cardioembolic stroke and three reached genome-wide significance in stroke GWAS. These associations may shed a light on putative shared etiologic mechanisms between AF and cardioembolic stroke. Additionally, some of these identified variants have been incorporated in genetic risk scores in order to elucidate new approaches of stroke prediction, prevention and treatment.

Download Full-text

A global overview of pleiotropy and genetic architecture in complex traits

10.1101/500090 ◽

2018 ◽

Cited By ~ 19

Author(s):

Kyoko Watanabe ◽

Sven Stringer ◽

Oleksandr Frei ◽

Maša Umićević Mirkov ◽

Tinca J.C. Polderman ◽

...

Keyword(s):

Genetic Variants ◽

Complex Traits ◽

Genetic Architecture ◽

Human Genetics ◽

Association Studies ◽

Regulatory Elements ◽

Genome Wide Association Studies ◽

Multiple Trait ◽

Current Availability ◽

Flanking Regions

ABSTRACTAfter a decade of genome-wide association studies (GWASs), fundamental questions in human genetics are still unanswered, such as the extent of pleiotropy across the genome, the nature of trait-associated genetic variants and the disparate genetic architecture across human traits. The current availability of hundreds of GWAS results provide the unique opportunity to gain insight into these questions. In this study, we harmonized and systematically analysed 4,155 publicly available GWASs. For a subset of well-powered GWAS on 558 unique traits, we provide an extensive overview of pleiotropy and genetic architecture. We show that trait associated loci cover more than half of the genome, and 90% of those loci are associated with multiple trait domains. We further show that potential causal genetic variants are enriched in coding and flanking regions, as well as in regulatory elements, and how trait-polygenicity is related to an estimate of the required sample size to detect 90% of causal genetic variants. Our results provide novel insights into how genetic variation contributes to trait variation. All GWAS results can be queried and visualized at the GWAS ATLAS resource (http://atlas.ctglab.nl).

Download Full-text

An integrated platform to systematically identify causal variants and genes for polygenic human traits

10.1101/813618 ◽

2019 ◽

Cited By ~ 3

Author(s):

Damien J. Downes ◽

Ron Schwessinger ◽

Stephanie J. Hill ◽

Lea Nussbaum ◽

Caroline Scott ◽

...

Keyword(s):

Genetic Variants ◽

Association Studies ◽

Significant Proportion ◽

Genome Wide Association Studies ◽

Effector Genes ◽

Genome Wide ◽

Common Genetic Variants ◽

Integrated Platform ◽

Causal Variants

ABSTRACTGenome-wide association studies (GWAS) have identified over 150,000 links between common genetic variants and human traits or complex diseases. Over 80% of these associations map to polymorphisms in non-coding DNA. Therefore, the challenge is to identify disease-causing variants, the genes they affect, and the cells in which these effects occur. We have developed a platform using ATAC-seq, DNaseI footprints, NG Capture-C and machine learning to address this challenge. Applying this approach to red blood cell traits identifies a significant proportion of known causative variants and their effector genes, which we show can be validated by direct in vivo modelling.

Download Full-text

circVAR database: genome-wide archive of genetic variants for human circular RNAs

10.21203/rs.3.rs-48904/v2 ◽

2020 ◽

Author(s):

Min Zhao ◽

Hong Qu

Keyword(s):

Genetic Variants ◽

Complex Traits ◽

Large Scale ◽

Rna Binding ◽

Rna Binding Proteins ◽

Association Studies ◽

Chromosome 17 ◽

Circular Rnas ◽

Genome Wide Association Studies ◽

Genome Wide

Abstract Background: Circular RNAs (circRNAs) play important roles in regulating gene expression through binding miRNAs and RNA binding proteins. Genetic variation of circRNAs may affect complex traits/diseases by changing their binding efficiency to target miRNAs and proteins. There is a growing demand for investigations of the functions of genetic changes using large-scale experimental evidence. However, there is no online genetic resource for circRNA genes. Results: We performed extensive genetic annotation of 295,526 circRNAs integrated from circBase, circNet and circRNAdb. All pre-computed genetic variants were presented at our online resource, circVAR, with data browsing and search functionality. We explored the chromosome-based distribution of circRNAs and their associated variants. We found that, based on mapping to the 1000 Genomes and ClinVAR databases, chromosome 17 has a relatively large number of circRNAs and associated common and health-related genetic variants. Following the annotation of genome wide association studies (GWAS)-based circRNA variants, we found many non-coding variants within circRNAs, suggesting novel mechanisms for common diseases reported from GWAS studies. For cancer-based somatic variants, we found that chromosome 7 has many highly complex mutations that have been overlooked in previous research. Conclusion: We used the circVAR database to collect SNPs and small insertions and deletions (INDELs) in putative circRNA regions and to identify their potential phenotypic information. To provide a reusable resource for the circRNA research community, we have published all the pre-computed genetic data concerning circRNAs and associated genes together with data query and browsing functions at http://soft.bioinfo-minzhao.org/circvar .

Download Full-text

Polygenic Prediction of Complex Traits with Iterative Screen Regression Models

10.1101/2020.11.29.402180 ◽

2020 ◽

Author(s):

Meng Luo ◽

Shiliang Gu

Keyword(s):

Genetic Variants ◽

Complex Traits ◽

Regression Models ◽

Association Studies ◽

Prediction Methods ◽

Genome Wide Association Studies ◽

Genotype Data ◽

Genetic Prediction ◽

Genome Wide ◽

Genome Prediction

AbstractAlthough genome-wide association studies have successfully identified thousands of markers associated with various complex traits and diseases, our ability to predict such phenotypes remains limited. A perhaps ignored explanation lies in the limitations of the genetic models and statistical techniques commonly used in association studies. However, using genotype data for individuals to perform accurate genetic prediction of complex traits can promote genomic selection in animal and plant breeding and can lead to the development of personalized medicine in humans. Because most complex traits have a polygenic architecture, accurate genetic prediction often requires modeling genetic variants together via polygenic methods. Here, we also utilize our proposed polygenic methods, which refer to as the iterative screen regression model (ISR) for genome prediction. We compared ISR with several commonly used prediction methods with simulations. We further applied ISR to predicting 15 traits, including the five species of cattle, rice, wheat, maize, and mice. The results of the study indicate that the ISR method performs well than several commonly used polygenic methods and stability.

Download Full-text

Genome metabolome integrated network analysis to uncover connections between genetic variants and complex traits: an application to obesity

Journal of The Royal Society Interface ◽

10.1098/rsif.2013.0908 ◽

2014 ◽

Vol 11 (94) ◽

pp. 20130908 ◽

Cited By ~ 14

Author(s):

Beatriz Valcárcel ◽

Timothy M. D. Ebbels ◽

Antti J. Kangas ◽

Pasi Soininen ◽

Paul Elliot ◽

...

Keyword(s):

Network Analysis ◽

Genetic Variants ◽

Complex Traits ◽

Association Studies ◽

System Level ◽

Genome Wide Association Studies ◽

Metabolomics Data ◽

Integrated Network ◽

Molecular Associations ◽

Genome Wide

Current studies of phenotype diversity by genome-wide association studies (GWAS) are mainly focused on identifying genetic variants that influence level changes of individual traits without considering additional alterations at the system-level. However, in addition to level alterations of single phenotypes, differences in association between phenotype levels are observed across different physiological states. Such differences in molecular correlations between states can potentially reveal information about the system state beyond that reported by changes in mean levels alone. In this study, we describe a novel methodological approach, which we refer to as genome metabolome integrated network analysis (GEMINi) consisting of a combination of correlation network analysis and genome-wide correlation study. The proposed methodology exploits differences in molecular associations to uncover genetic variants involved in phenotype variation. We test the performance of the GEMINi approach in a simulation study and illustrate its use in the context of obesity and detailed quantitative metabolomics data on systemic metabolism. Application of GEMINi revealed a set of metabolic associations which differ between normal and obese individuals. While no significant associations were found between genetic variants and body mass index using a standard GWAS approach, further investigation of the identified differences in metabolic association revealed a number of loci, several of which have been previously implicated with obesity-related processes. This study highlights the advantage of using molecular associations as an alternative phenotype when studying the genetic basis of complex traits and diseases.

Download Full-text

Genomic Prediction and Association Analysis with Models Including Dominance Effects for Important Traits in Chinese Simmental Beef Cattle

Animals ◽

10.3390/ani9121055 ◽

2019 ◽

Vol 9 (12) ◽

pp. 1055 ◽

Cited By ~ 2

Author(s):

Ying Liu ◽

Lei Xu ◽

Zezhao Wang ◽

Ling Xu ◽

Yan Chen ◽

...

Keyword(s):

Beef Cattle ◽

Genomic Prediction ◽

Complex Traits ◽

Genetic Variance ◽

Association Studies ◽

Additive Genetic Variance ◽

Additive Models ◽

Carcass Weight ◽

Phenotypic Variance ◽

Genome Wide Association Studies

Non-additive effects play important roles in determining genetic changes with regard to complex traits; however, such effects are usually ignored in genetic evaluation and quantitative trait locus (QTL) mapping analysis. In this study, a two-component genome-based restricted maximum likelihood (GREML) was applied to obtain the additive genetic variance and dominance variance for carcass weight (CW), dressing percentage (DP), meat percentage (MP), average daily gain (ADG), and chuck roll (CR) in 1233 Simmental beef cattle. We estimated predictive abilities using additive models (genomic best linear unbiased prediction (GBLUP) and BayesA) and dominance models (GBLUP-D and BayesAD). Moreover, genome-wide association studies (GWAS) considering both additive and dominance effects were performed using a multi-locus mixed-model (MLMM) approach. We found that the estimated dominance variances accounted for 15.8%, 16.1%, 5.1%, 4.2%, and 9.7% of the total phenotypic variance for CW, DP, MP, ADG, and CR, respectively. Compared with BayesA and GBLUP, we observed 0.5–1.1% increases in predictive abilities of BayesAD and 0.5–0.9% increases in predictive abilities of GBLUP-D, respectively. Notably, we identified a dominance association signal for carcass weight within RIMS2, a candidate gene that has been associated with carcass weight in beef cattle. Our results suggest that dominance effects yield variable degrees of contribution to the total genetic variance of the studied traits in Simmental beef cattle. BayesAD and GBLUP-D are convenient models for the improvement of genomic prediction, and the detection of QTLs using a dominance model shows promise for use in GWAS in cattle.

Download Full-text