scholarly journals The impact of rare variation on gene expression across tissues

Nature ◽  
2017 ◽  
Vol 550 (7675) ◽  
pp. 239-243 ◽  
Author(s):  
Xin Li ◽  
◽  
Yungil Kim ◽  
Emily K. Tsang ◽  
Joe R. Davis ◽  
...  

Abstract Rare genetic variants are abundant in humans and are expected to contribute to individual disease risk1,2,3,4. While genetic association studies have successfully identified common genetic variants associated with susceptibility, these studies are not practical for identifying rare variants1,5. Efforts to distinguish pathogenic variants from benign rare variants have leveraged the genetic code to identify deleterious protein-coding alleles1,6,7, but no analogous code exists for non-coding variants. Therefore, ascertaining which rare variants have phenotypic effects remains a major challenge. Rare non-coding variants have been associated with extreme gene expression in studies using single tissues8,9,10,11, but their effects across tissues are unknown. Here we identify gene expression outliers, or individuals showing extreme expression levels for a particular gene, across 44 human tissues by using combined analyses of whole genomes and multi-tissue RNA-sequencing data from the Genotype-Tissue Expression (GTEx) project v6p release12. We find that 58% of underexpression and 28% of overexpression outliers have nearby conserved rare variants compared to 8% of non-outliers. Additionally, we developed RIVER (RNA-informed variant effect on regulation), a Bayesian statistical model that incorporates expression data to predict a regulatory effect for rare variants with higher accuracy than models using genomic annotations alone. Overall, we demonstrate that rare variants contribute to large gene expression changes across tissues and provide an integrative method for interpretation of rare variants in individual genomes.

2016 ◽  
Author(s):  
Xin Li ◽  
Yungil Kim ◽  
Emily K. Tsang ◽  
Joe R. Davis ◽  
Farhan N. Damani ◽  
...  

AbstractRare genetic variants are abundant in humans yet their functional effects are often unknown and challenging to predict. The Genotype-Tissue Expression (GTEx) project provides a unique opportunity to identify the functional impact of rare variants through combined analyses of whole genomes and multi-tissue RNA-sequencing data. Here, we identify gene expression outliers, or individuals with extreme expression levels, across 44 human tissues, and characterize the contribution of rare variation to these large changes in expression. We find 58% of underexpression and 28% of overexpression outliers have underlying rare variants compared with 9% of non-outliers. Large expression effects are enriched for proximal loss-of-function, splicing, and structural variants, particularly variants near the TSS and at evolutionarily conserved sites. Known disease genes have expression outliers, underscoring that rare variants can contribute to genetic disease risk. To prioritize functional rare regulatory variants, we develop RIVER, a Bayesian approach that integrates RNA and whole genome sequencing data from the same individual. RIVER predicts functional variants significantly better than models using genomic annotations alone, and is an extensible tool for personal genome interpretation. Overall, we demonstrate that rare variants contribute to large gene expression changes across tissues with potential health consequences, and provide an integrative method for interpreting rare variants in individual genomes.


2021 ◽  
Vol 13 (1) ◽  
Author(s):  
Federica Perrone ◽  
Rita Cacace ◽  
Julie van der Zee ◽  
Christine Van Broeckhoven

AbstractKnowledge of the molecular etiology of neurodegenerative brain diseases (NBD) has substantially increased over the past three decades. Early genetic studies of NBD families identified rare and highly penetrant deleterious mutations in causal genes that segregate with disease. Large genome-wide association studies uncovered common genetic variants that influenced disease risk. Major developments in next-generation sequencing (NGS) technologies accelerated gene discoveries at an unprecedented rate and revealed novel pathways underlying NBD pathogenesis. NGS technology exposed large numbers of rare genetic variants of uncertain significance (VUS) in coding regions, highlighting the genetic complexity of NBD. Since experimental studies of these coding rare VUS are largely lacking, the potential contributions of VUS to NBD etiology remain unknown. In this review, we summarize novel findings in NBD genetic etiology driven by NGS and the impact of rare VUS on NBD etiology. We consider different mechanisms by which rare VUS can act and influence NBD pathophysiology and discuss why a better understanding of rare VUS is instrumental for deriving novel insights into the molecular complexity and heterogeneity of NBD. New knowledge might open avenues for effective personalized therapies.


Author(s):  
S. Rubinacci ◽  
D.M. Ribeiro ◽  
R. Hofmeister ◽  
O. Delaneau

AbstractLow-coverage whole genome sequencing followed by imputation has been proposed as a cost-effective genotyping approach for disease and population genetics studies. However, its competitiveness against SNP arrays is undermined as current imputation methods are computationally expensive and unable to leverage large reference panels.Here, we describe a method, GLIMPSE, for phasing and imputation of low-coverage sequencing datasets from modern reference panels. We demonstrate its remarkable performance across different coverages and human populations. It achieves imputation of a full genome for less than $1, outperforming existing methods by orders of magnitude, with an increased accuracy of more than 20% at rare variants. We also show that 1x coverage enables effective association studies and is better suited than dense SNP arrays to access the impact of rare variations. Overall, this study demonstrates the promising potential of low-coverage imputation and suggests a paradigm shift in the design of future genomic studies.


2009 ◽  
Vol 296 (5) ◽  
pp. L713-L725 ◽  
Author(s):  
Li Gao ◽  
Kathleen C. Barnes

It has been well established that acute lung injury (ALI), and the more severe presentation of acute respiratory distress syndrome (ARDS), constitute complex traits characterized by a multigenic and multifactorial etiology. Identification and validation of genetic variants contributing to disease susceptibility and severity has been hampered by the profound heterogeneity of the clinical phenotype and the role of environmental factors, which includes treatment, on outcome. The critical nature of ALI and ARDS, compounded by the impact of phenotypic heterogeneity, has rendered the amassing of sufficiently powered studies especially challenging. Nevertheless, progress has been made in the identification of genetic variants in select candidate genes, which has enhanced our understanding of the specific pathways involved in disease manifestation. Identification of novel candidate genes for which genetic association studies have confirmed a role in disease has been greatly aided by the powerful tool of high-throughput expression profiling. This article will review these studies to date, summarizing candidate genes associated with ALI and ARDS, acknowledging those that have been replicated in independent populations, with a special focus on the specific pathways for which candidate genes identified so far can be clustered.


2021 ◽  
Author(s):  
Aleksejs Sazonovs ◽  
Christine R Stevens ◽  
Guhan R Venkataraman ◽  
Kai Yuan ◽  
Brandon Avila ◽  
...  

Genome-wide association studies (GWAS) have identified hundreds of loci associated with Crohns disease (CD); however, as with all complex diseases, deriving pathogenic mechanisms from these non-coding GWAS discoveries has been challenging. To complement GWAS and better define actionable biological targets, we analysed sequence data from more than 30,000 CD cases and 80,000 population controls. We observe rare coding variants in established CD susceptibility genes as well as ten genes where coding variation directly implicates the gene in disease risk for the first time.


2020 ◽  
Author(s):  
Craig Smail ◽  
Nicole M. Ferraro ◽  
Matthew G. Durrant ◽  
Abhiram S. Rao ◽  
Matthew Aguirre ◽  
...  

SummaryPolygenic risk scores (PRS) aim to quantify the contribution of multiple genetic loci to an individual’s likelihood of a complex trait or disease. However, existing PRS estimate genetic liability using common genetic variants, excluding the impact of rare variants. We identified rare, large-effect variants in individuals with outlier gene expression from the GTEx project and then assessed their impact on PRS predictions in the UK Biobank (UKB). We observed large deviations from the PRS-predicted phenotypes for carriers of multiple outlier rare variants; for example, individuals classified as “low-risk” but in the top 1% of outlier rare variant burden had a 6-fold higher rate of severe obesity. We replicated these findings using data from the NHLBI Trans-Omics for Precision Medicine (TOPMed) biobank and the Million Veteran Program, and demonstrated that PRS across multiple traits will significantly benefit from the inclusion of rare genetic variants.


2021 ◽  
pp. annrheumdis-2020-218359
Author(s):  
Xinyi Meng ◽  
Xiaoyuan Hou ◽  
Ping Wang ◽  
Joseph T Glessner ◽  
Hui-Qi Qu ◽  
...  

ObjectiveJuvenile idiopathic arthritis (JIA) is the most common type of arthritis among children, but a few studies have investigated the contribution of rare variants to JIA. In this study, we aimed to identify rare coding variants associated with JIA for the genome-wide landscape.MethodsWe established a rare variant calling and filtering pipeline and performed rare coding variant and gene-based association analyses on three RNA-seq datasets composed of 228 JIA patients in the Gene Expression Omnibus against different sets of controls, and further conducted replication in our whole-exome sequencing (WES) data of 56 JIA patients. Then we conducted differential gene expression analysis and assessed the impact of recurrent functional coding variants on gene expression and signalling pathway.ResultsBy the RNA-seq data, we identified variants in two genes reported in literature as JIA causal variants, as well as additional 63 recurrent rare coding variants seen only in JIA patients. Among the 44 recurrent rare variants found in polyarticular patients, 10 were replicated by our WES of patients with the same JIA subtype. Several genes with recurrent functional rare coding variants have also common variants associated with autoimmune diseases. We observed immune pathways enriched for the genes with rare coding variants and differentially expressed genes.ConclusionThis study elucidated a novel landscape of recurrent rare coding variants in JIA patients and uncovered significant associations with JIA at the gene pathway level. The convergence of common variants and rare variants for autoimmune diseases is also highlighted in this study.


2021 ◽  
Author(s):  
Henrike O. Heyne ◽  
Juha Karjalainen ◽  
Konrad Karczewski ◽  
Susanna Lemmela ◽  
Wei Zhou ◽  
...  

Identifying Mendelian diseases with recessive inheritance is challenging as the majority of cases are caused by compound heterozygous genotypes which require sequencing data in families to definitively identify. Bottleneck events, such as in the Finnish population, enrich specific homozygous variants to higher frequencies and thus facilitate identification of disease associations through easily recognized homozygous genotypes. Here, we study homozygous and heterozygous effects of 82,516 coding variants on 2,444 disease endpoints using nationwide electronic health record (EHR) data of 176,899 Finns. We find known and novel associations to homozygous genotypes across a broad spectrum of phenotypes such as retinal dystrophy, adult-onset cataract and female infertility (13/20 of which would have been missed by the traditional additive GWAS model). With these results, and supporting simulations, we demonstrate the added benefit of homozygous scans in GWAS. We further use these results to explore inheritance patterns of known Mendelian variants. We find many Mendelian variants whose inheritance cannot be adequately described with the traditional definition of dominant or recessive. In particular, we find disease risk in heterozygous carriers of variants known to cause disease with recessive inheritance, as well as for variants labeled benign in ClinVar. Our results demonstrate how biobank efforts, particularly in founder populations, can broaden our understanding of the impact of genetic variants.


2009 ◽  
Vol 23 (10) ◽  
pp. 1715-1715
Author(s):  
Pei-an Betty Shih ◽  
Lei Wang ◽  
Stephane Chiron ◽  
Gen Wen ◽  
Caroline Nievergelt ◽  
...  

ABSTRACT Rationale Obesity is a heritable trait that contributes to hypertension and subsequent cardiorenal disease risk; thus, the investigation of genetic variation that predisposes individuals to obesity is an important goal. Circulating peptide YY (PYY) is known for its appetite and energy expenditure-regulating properties; linkage and association studies have suggested that PYY genetic variation contributes to susceptibility for obesity, rendering PYY an attractive candidate for study of disease risk. Design To explore whether common genetic variation at the human PYY locus influences plasma PYY or metabolic traits, we systematically resequenced the gene for polymorphism discovery and then genotyped common single-nucleotide polymorphisms across the locus in an extensively phenotyped twin sample to determine associations. Finally, we experimentally validated the marker-on-trait associations using PYY 3′-untranslated region (UTR)/reporter and promoter/reporter analyses in neuroendocrine cells. Results Four common genetic variants were discovered across the locus, and three were typed in phenotyped twins. Plasma PYY was highly heritable (P < 0.0001), and genetic pleiotropy was noted between plasma PYY and body mass index (BMI) (P = 0.03). A PYY haplotype extending from the proximal promoter (A-23G, rs2070592) to the 3′-UTR (C+1134A, rs162431) predicted not only plasma PYY (P = 0.009) but also other metabolic syndrome traits. Functional studies with transfected luciferase reporters confirmed regulatory roles in altering gene expression for both 3′-UTR C+1134A (P < 0.001) and promoter A-23G (P = 0.0016). Conclusions Functional genetic variation at the PYY locus influences multiple heritable metabolic syndrome traits, likely conferring susceptibility to obesity and subsequent cardiorenal disease.


2017 ◽  
Author(s):  
Chun Jimmie Ye ◽  
Jenny Chen ◽  
Alexandra-Chloé Villani ◽  
Rachel E. Gate ◽  
Meena Subramaniam ◽  
...  

AbstractWhile the impact of common genetic variants on gene expression response to cellular stimuli has been analyzed in depth, less is known about how stimulation modulates the genetic control of isoform usage. Analyzing RNA-seq profiles of monocyte-derived dendritic cells from 243 individuals, we uncovered thousands of unannotated isoforms synthesized in response to viral infection and stimulation with type I interferon. We identified more than a thousand single nucleotide polymorphisms associated with isoform usage (isoQTLs), > 40% of which are independent of expression QTLs for the same gene. Compared to eQTLs, isoQTLs are enriched for splice sites and untranslated regions, and depleted of sequences upstream of annotated transcription start sites. Both eQTLs and isoQTLs in stimulated cells explain a significant proportion of the disease heritability attributed to common genetic variants. At the IRF7 locus, we found alternative promoter usage in response to influenza as a possible mechanism by which DNA variants previously associated with immune-related disorders mediate disease risk. At the ERAP2 locus, we shed light on the function of the major haplotype that has been maintained under long-term balancing selection. At baseline and following type 1 interferon stimulation, the major haplotype is associated with absence of ERAP2 expression while the minor haplotype, known to increase Crohn’s disease risk, is associated with high ERAP2 expression. Surprisingly, in response to influenza infection, the major haplotype results in the expression of two uncharacterized, alternatively transcribed, spliced and translated short isoforms. Thus, genetic variants at a single locus could modulate independent gene regulatory processes in the innate immune response, and in the case of ERAP2, may confer a historical fitness advantage in response to virus.


Sign in / Sign up

Export Citation Format

Share Document