scholarly journals The impact of structural variation on human gene expression

2016 ◽  
Author(s):  
Colby Chiang ◽  
Alexandra J. Scott ◽  
Joe R. Davis ◽  
Emily K. Tsang ◽  
Xin Li ◽  
...  

AbstractStructural variants (SVs) are an important source of human genetic diversity but their contribution to traits, disease, and gene regulation remains unclear. The Genotype-Tissue Expression (GTEx) project presents an unprecedented opportunity to address this question due to the availability of deep whole genome sequencing (WGS) and multi-tissue RNA-seq data from 147 individuals. We used comprehensive methods to identify 24,157 high confidence SVs, and mapped cis expression quantitative trait loci (eQTLs) in 13 tissues via joint analysis of SVs, single nucleotide (SNV) and short insertion/deletion (indel) variants. We identified 24,801 eQTLs affecting the expression of 10,101 distinct genes. Based on haplotype structure and heritability partitioning, we estimate that SVs are the causal variant at 3.3-7.0% of eQTLs, which is nearly an order of magnitude higher than prior estimates from low coverage WGS and represents a 26- to 54-fold enrichment relative to their scarcity in the genome. Expression-altering SVs also have significantly larger effect sizes than SNVs and indels. We identified 787 putatively causal SVs predicted to directly alter gene expression, most of which (88.3%) are noncoding variants that show significant enrichment at enhancers and other regulatory elements. By evaluating linkage disequilibrium between SVs, SNVs and indels, we nominate 49 SVs as plausible causal variants at published genome-wide association study (GWAS) loci. Remarkably, 29.9% of the common SV-eQTLs are not well tagged by flanking SNVs, and we observe a notable abundance (relative to SNVs and indels) of rare, high impact SVs associated with aberrant expression of nearby genes. These results suggest that comprehensive WGS-based SV analyses will increase the power of both common and rare variant association studies.

Science ◽  
2020 ◽  
Vol 369 (6509) ◽  
pp. eaba3066 ◽  
Author(s):  
Meritxell Oliva ◽  
Manuel Muñoz-Aguirre ◽  
Sarah Kim-Hellmuth ◽  
Valentin Wucher ◽  
Ariel D. H. Gewirtz ◽  
...  

Many complex human phenotypes exhibit sex-differentiated characteristics. However, the molecular mechanisms underlying these differences remain largely unknown. We generated a catalog of sex differences in gene expression and in the genetic regulation of gene expression across 44 human tissue sources surveyed by the Genotype-Tissue Expression project (GTEx, v8 release). We demonstrate that sex influences gene expression levels and cellular composition of tissue samples across the human body. A total of 37% of all genes exhibit sex-biased expression in at least one tissue. We identify cis expression quantitative trait loci (eQTLs) with sex-differentiated effects and characterize their cellular origin. By integrating sex-biased eQTLs with genome-wide association study data, we identify 58 gene-trait associations that are driven by genetic regulation of gene expression in a single sex. These findings provide an extensive characterization of sex differences in the human transcriptome and its genetic regulation.


2020 ◽  
Vol 117 (48) ◽  
pp. 30639-30648
Author(s):  
Dan Hu ◽  
Emily C. Tjon ◽  
Karin M. Andersson ◽  
Gabriela M. Molica ◽  
Minh C. Pham ◽  
...  

IL-17–producing Th17 cells are implicated in the pathogenesis of rheumatoid arthritis (RA) and TNF-α, a proinflammatory cytokine in the rheumatoid joint, facilitates Th17 differentiation. Anti-TNF therapy ameliorates disease in many patients with rheumatoid arthritis (RA). However, a significant proportion of patients do not respond to this therapy. The impact of anti-TNF therapy on Th17 responses in RA is not well understood. We conducted high-throughput gene expression analysis of Th17-enriched CCR6+CXCR3−CD45RA−CD4+T (CCR6+T) cells isolated from anti-TNF–treated RA patients classified as responders or nonresponders to therapy. CCR6+T cells from responders and nonresponders had distinct gene expression profiles. Proinflammatory signaling was elevated in the CCR6+T cells of nonresponders, and pathogenic Th17 signature genes were up-regulated in these cells. Gene set enrichment analysis on these signature genes identified transcription factor USF2 as their upstream regulator, which was also increased in nonresponders. Importantly, short hairpin RNA targetingUSF2in pathogenic Th17 cells led to reduced expression of proinflammatory cytokines IL-17A, IFN-γ, IL-22, and granulocyte-macrophage colony-stimulating factor (GM-CSF) as well as transcription factor T-bet. Together, our results revealed inadequate suppression of Th17 responses by anti-TNF in nonresponders, and direct targeting of the USF2-signaling pathway may be a potential therapeutic approach in the anti-TNF refractory RA.


Metabolites ◽  
2021 ◽  
Vol 11 (8) ◽  
pp. 513
Author(s):  
Grace H. Yang ◽  
Danielle A. Fontaine ◽  
Sukanya Lodh ◽  
Joseph T. Blumer ◽  
Avtar Roopra ◽  
...  

Transcription factor 19 (TCF19) is a gene associated with type 1 diabetes (T1DM) and type 2 diabetes (T2DM) in genome-wide association studies. Prior studies have demonstrated that Tcf19 knockdown impairs β-cell proliferation and increases apoptosis. However, little is known about its role in diabetes pathogenesis or the effects of TCF19 gain-of-function. The aim of this study was to examine the impact of TCF19 overexpression in INS-1 β-cells and human islets on proliferation and gene expression. With TCF19 overexpression, there was an increase in nucleotide incorporation without any change in cell cycle gene expression, alluding to an alternate process of nucleotide incorporation. Analysis of RNA-seq of TCF19 overexpressing cells revealed increased expression of several DNA damage response (DDR) genes, as well as a tightly linked set of genes involved in viral responses, immune system processes, and inflammation. This connectivity between DNA damage and inflammatory gene expression has not been well studied in the β-cell and suggests a novel role for TCF19 in regulating these pathways. Future studies determining how TCF19 may modulate these pathways can provide potential targets for improving β-cell survival.


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Jamie W. Robinson ◽  
Richard M. Martin ◽  
Spiridon Tsavachidis ◽  
Amy E. Howell ◽  
Caroline L. Relton ◽  
...  

AbstractGenome-wide association studies (GWAS) have discovered 27 loci associated with glioma risk. Whether these loci are causally implicated in glioma risk, and how risk differs across tissues, has yet to be systematically explored. We integrated multi-tissue expression quantitative trait loci (eQTLs) and glioma GWAS data using a combined Mendelian randomisation (MR) and colocalisation approach. We investigated how genetically predicted gene expression affects risk across tissue type (brain, estimated effective n = 1194 and whole blood, n = 31,684) and glioma subtype (all glioma (7400 cases, 8257 controls) glioblastoma (GBM, 3112 cases) and non-GBM gliomas (2411 cases)). We also leveraged tissue-specific eQTLs collected from 13 brain tissues (n = 114 to 209). The MR and colocalisation results suggested that genetically predicted increased gene expression of 12 genes were associated with glioma, GBM and/or non-GBM risk, three of which are novel glioma susceptibility genes (RETREG2/FAM134A, FAM178B and MVB12B/FAM125B). The effect of gene expression appears to be relatively consistent across glioma subtype diagnoses. Examining how risk differed across 13 brain tissues highlighted five candidate tissues (cerebellum, cortex, and the putamen, nucleus accumbens and caudate basal ganglia) and four previously implicated genes (JAK1, STMN3, PICK1 and EGFR). These analyses identified robust causal evidence for 12 genes and glioma risk, three of which are novel. The correlation of MR estimates in brain and blood are consistently low which suggested that tissue specificity needs to be carefully considered for glioma. Our results have implicated genes yet to be associated with glioma susceptibility and provided insight into putatively causal pathways for glioma risk.


2021 ◽  
Author(s):  
Milton Pividori ◽  
Sumei Lu ◽  
Binglan Li ◽  
Chun Su ◽  
Matthew E. Johnson ◽  
...  

Understanding how dysregulated transcriptional processes result in tissue-specific pathology requires a mechanistic interpretation of expression regulation across different cell types. It has been shown that this insight is key for the development of new therapies. These mechanisms can be identified with transcriptome-wide association studies (TWAS), which have represented an important step forward to test the mediating role of gene expression in GWAS associations. However, due to pervasive eQTL sharing across tissues, TWAS has not been successful in identifying causal tissues, and other methods generally do not take advantage of the large amounts of RNA-seq data publicly available. Here we introduce a polygenic approach that leverages gene modules (genes with similar co-expression patterns) to project both gene-trait associations and pharmacological perturbation data into a common latent representation for a joint analysis. We observed that diseases were significantly associated with gene modules expressed in relevant cell types, such as hypothyroidism with T cells and thyroid, hypertension and lipids with adipose tissue, and coronary artery disease with cardiomyocytes. Our approach was more accurate in predicting known drug-disease pairs and revealed stable trait clusters, including a complex branch involving lipids with cardiovascular, autoimmune, and neuropsychiatric disorders. Furthermore, using a CRISPR-screen, we show that genes involved in lipid regulation exhibit more consistent trait associations through gene modules than individual genes. Our results suggest that a gene module perspective can contextualize genetic associations and prioritize alternative treatment targets when GWAS hits are not druggable.


2019 ◽  
Vol 28 (17) ◽  
pp. 2976-2986 ◽  
Author(s):  
Irfahan Kassam ◽  
Yang Wu ◽  
Jian Yang ◽  
Peter M Visscher ◽  
Allan F McRae

Abstract Despite extensive sex differences in human complex traits and disease, the male and female genomes differ only in the sex chromosomes. This implies that most sex-differentiated traits are the result of differences in the expression of genes that are common to both sexes. While sex differences in gene expression have been observed in a range of different tissues, the biological mechanisms for tissue-specific sex differences (TSSDs) in gene expression are not well understood. A total of 30 640 autosomal and 1021 X-linked transcripts were tested for heterogeneity in sex difference effect sizes in n = 617 individuals across 40 tissue types in Genotype–Tissue Expression (GTEx). This identified 65 autosomal and 66 X-linked TSSD transcripts (corresponding to unique genes) at a stringent significance threshold. Results for X-linked TSSD transcripts showed mainly concordant direction of sex differences across tissues and replicate previous findings. Autosomal TSSD transcripts had mainly discordant direction of sex differences across tissues. The top cis-expression quantitative trait loci (eQTLs) across tissues for autosomal TSSD transcripts are located a similar distance away from the nearest androgen and estrogen binding motifs and the nearest enhancer, as compared to cis-eQTLs for transcripts with stable sex differences in gene expression across tissue types. Enhancer regions that overlap top cis-eQTLs for TSSD transcripts, however, were found to be more dispersed across tissues. These observations suggest that androgen and estrogen regulatory elements in a cis region may play a common role in sex differences in gene expression, but TSSD in gene expression may additionally be due to causal variants located in tissue-specific enhancer regions.


2020 ◽  
Vol 25 (6) ◽  
pp. 568-580
Author(s):  
Natali Papanicolaou ◽  
Alessandro Bonetti

Common diseases are complex, multifactorial disorders whose pathogenesis is influenced by the interplay of genetic predisposition and environmental factors. Genome-wide association studies have interrogated genetic polymorphisms across genomes of individuals to test associations between genotype and susceptibility to specific disorders, providing insights into the genetic architecture of several complex disorders. However, genetic variants associated with the susceptibility to common diseases are often located in noncoding regions of the genome, such as tissue-specific enhancers or long noncoding RNAs, suggesting that regulatory elements might play a relevant role in human diseases. Enhancers are cis-regulatory genomic sequences that act in concert with promoters to regulate gene expression in a precise spatiotemporal manner. They can be located at a considerable distance from their cognate target promoters, increasing the difficulty of their identification. Genomes are organized in domains of chromatin folding, namely topologically associating domains (TADs). Identification of enhancer–promoter interactions within TADs has revealed principles of cell-type specificity across several organisms and tissues. The vast majority of mammalian genomes are pervasively transcribed, accounting for a previously unappreciated complexity of the noncoding RNA fraction. Particularly, long noncoding RNAs have emerged as key players for the establishment of chromatin architecture and regulation of gene expression. In this perspective, we describe the new advances in the fields of transcriptomics and genome organization, focusing on the role of noncoding genomic variants in the predisposition of common diseases. Finally, we propose a new framework for the identification of the next generation of pharmacological targets for common human diseases.


2020 ◽  
Author(s):  
Helian Feng ◽  
Nicholas Mancuso ◽  
Alexander Gusev ◽  
Arunabha Majumdar ◽  
Megan Major ◽  
...  

AbstractTranscriptome-wide association studies (TWAS) test the association between traits and genetically predicted gene expression levels. The power of a TWAS depends in part on the strength of the correlation between a genetic predictor of gene expression and the causally relevant gene expression values. Consequently, TWAS power can be low when expression quantitative trait locus (eQTL) data used to train the genetic predictors have small sample sizes, or when data from causally relevant tissues are not available. Here, we propose to address these issues by integrating multiple tissues in the TWAS using sparse canonical correlation analysis (sCCA). We show that sCCA-TWAS combined with single-tissue TWAS using an aggregate Cauchy association test (ACAT) outperforms traditional single-tissue TWAS. In empirically motivated simulations, the sCCA+ACAT approach yielded the highest power to detect a gene associated with phenotype, even when expression in the causal tissue was not directly measured, while controlling the Type I error when there is no association between gene expression and phenotype. For example, when gene expression explains 2% of the variability in outcome, and the GWAS sample size is 20,000, the average power difference between the ACAT combined test of sCCA features and single-tissue, versus single-tissue combined with Generalized Berk-Jones (GBJ) method, single-tissue combined with S-MultiXcan or summarizing cross-tissue expression patterns using Principal Component Analysis (PCA) approaches was 5%, 8%, and 38%, respectively. The gain in power is likely due to sCCA cross-tissue features being more likely to be detectably heritable. When applied to publicly available summary statistics from 10 complex traits, the sCCA+ACAT test was able to increase the number of testable genes and identify on average an additional 400 additional gene-trait associations that single-trait TWAS missed. Our results suggest that aggregating eQTL data across multiple tissues using sCCA can improve the sensitivity of TWAS while controlling for the false positive rate.Author summaryTranscriptome-wide association studies (TWAS) can improve the statistical power of genetic association studies by leveraging the relationship between genetically predicted transcript expression levels and an outcome. We propose a new TWAS pipeline that integrates data on the genetic regulation of expression levels across multiple tissues. We generate cross-tissue expression features using sparse canonical correlation analysis and then combine evidence for expression-outcome association across cross- and single-tissue features using the aggregate Cauchy association test. We show that this approach has substantially higher power than traditional single-tissue TWAS methods. Application of these methods to publicly available summary statistics for ten complex traits also identifies associations missed by single-tissue methods.


2019 ◽  
Author(s):  
Martin Silvert ◽  
Lluis Quintana-Murci ◽  
Maxime Rotival

AbstractArchaic admixture is increasingly recognized as an important source of diversity in modern humans, with Neanderthal haplotypes covering 1-3% of the genome of present-day Eurasians. Recent work has shown that archaic introgression has contributed to human phenotypic diversity, mostly through the regulation of gene expression. Yet, the mechanisms through which archaic variants alter gene expression, and the forces driving the introgression landscape at regulatory regions remain elusive. Here, we explored the impact of archaic introgression on transcriptional and post-transcriptional regulation, focusing on promoters and enhancers across 127 different tissues as well as microRNA-mediated regulation. Although miRNAs themselves harbor few archaic variants, we found that some of these variants may have a strong impact on miRNA-mediated gene regulation. Enhancers were by far the regulatory elements most affected by archaic introgression, with one third of the tissues tested presenting significant enrichments. Specifically, we found strong enrichments of archaic variants in adipose-related tissues and primary T cells, even after accounting for various genomic and evolutionary confounders such as recombination rate and background selection. Interestingly, we identified signatures of adaptive introgression at enhancers of some key regulators of adipogenesis, raising the interesting hypothesis of a possible adaptation of early Eurasians to colder climates. Collectively, this study sheds new light onto the mechanisms through which archaic admixture have impacted gene regulation in Eurasians and, more generally, increases our understanding of the contribution of Neanderthals to the regulation of acquired immunity and adipose homeostasis in modern humans.


2021 ◽  
Vol 12 ◽  
Author(s):  
Chi-Lam Poon ◽  
Cho-Yi Chen

BackgroundThe development of complex diseases is contributed by the combination of multiple factors and complicated interactions between them. Inflammation has recently been associated with many complex diseases and may cause long-term damage to the human body. In this study, we examined whether two types of complex disease, cerebrovascular disease (CVD) or major depression (MD), systematically altered the transcriptomes of non-diseased human tissues and whether inflammation is linked to identifiable molecular signatures, using post-mortem samples from the Genotype-Tissue Expression (GTEx) project.ResultsFollowing a series of differential expression analyses, dozens to hundreds of differentially expressed genes (DEGs) were identified in multiple tissues between subjects with and without a history of CVD or MD. DEGs from these disease-associated tissues—the visceral adipose, tibial artery, caudate, and spinal cord for CVD; and the hypothalamus, putamen, and spinal cord for MD—were further analyzed for functional enrichment. Many pathways associated with immunological events were enriched in the upregulated DEGs of the CVD-associated tissues, as were the neurological and metabolic pathways in DEGs of the MD-associated tissues. Eight gene-tissue pairs were found to overlap with those prioritized by our transcriptome-wide association studies, indicating a potential genetic effect on gene expression for circulating cytokine phenotypes.ConclusionCerebrovascular disease and major depression cause detectable changes in the gene expression of non-diseased tissues, suggesting that a possible long-term impact of diseases, lifestyles and environmental factors may together contribute to the appearance of “transcriptomic scars” on the human body. Furthermore, inflammation is probably one of the systemic and long-lasting effects of cerebrovascular events.


Sign in / Sign up

Export Citation Format

Share Document