scholarly journals Screening for functional transcriptional and splicing regulatory variants with GenIE

2020 ◽  
Vol 48 (22) ◽  
pp. e131-e131
Author(s):  
Sarah E Cooper ◽  
Jeremy Schwartzentruber ◽  
Erica Bello ◽  
Eve L Coomber ◽  
Andrew R Bassett

Abstract Genome-wide association studies (GWAS) have identified numerous genetic loci underlying human diseases, but a fundamental challenge remains to accurately identify the underlying causal genes and variants. Here, we describe an arrayed CRISPR screening method, Genome engineering-based Interrogation of Enhancers (GenIE), which assesses the effects of defined alleles on transcription or splicing when introduced in their endogenous genomic locations. We use this sensitive assay to validate the activity of transcriptional enhancers and splice regulatory elements in human induced pluripotent stem cells (hiPSCs), and develop a software package (rgenie) to analyse the data. We screen the 99% credible set of Alzheimer's disease (AD) GWAS variants identified at the clusterin (CLU) locus to identify a subset of likely causal variants, and employ GenIE to understand the impact of specific mutations on splicing efficiency. We thus establish GenIE as an efficient tool to rapidly screen for the role of transcribed variants on gene expression.

PLoS ONE ◽  
2021 ◽  
Vol 16 (9) ◽  
pp. e0257265
Author(s):  
Seung-Soo Kim ◽  
Adam D. Hudgins ◽  
Jiping Yang ◽  
Yizhou Zhu ◽  
Zhidong Tu ◽  
...  

Type 1 diabetes (T1D) is an organ-specific autoimmune disease, whereby immune cell-mediated killing leads to loss of the insulin-producing β cells in the pancreas. Genome-wide association studies (GWAS) have identified over 200 genetic variants associated with risk for T1D. The majority of the GWAS risk variants reside in the non-coding regions of the genome, suggesting that gene regulatory changes substantially contribute to T1D. However, identification of causal regulatory variants associated with T1D risk and their affected genes is challenging due to incomplete knowledge of non-coding regulatory elements and the cellular states and processes in which they function. Here, we performed a comprehensive integrated post-GWAS analysis of T1D to identify functional regulatory variants in enhancers and their cognate target genes. Starting with 1,817 candidate T1D SNPs defined from the GWAS catalog and LDlink databases, we conducted functional annotation analysis using genomic data from various public databases. These include 1) Roadmap Epigenomics, ENCODE, and RegulomeDB for epigenome data; 2) GTEx for tissue-specific gene expression and expression quantitative trait loci data; and 3) lncRNASNP2 for long non-coding RNA data. Our results indicated a prevalent enhancer-based immune dysregulation in T1D pathogenesis. We identified 26 high-probability causal enhancer SNPs associated with T1D, and 64 predicted target genes. The majority of the target genes play major roles in antigen presentation and immune response and are regulated through complex transcriptional regulatory circuits, including those in HLA (6p21) and non-HLA (16p11.2) loci. These candidate causal enhancer SNPs are supported by strong evidence and warrant functional follow-up studies.


PLoS Genetics ◽  
2020 ◽  
Vol 16 (12) ◽  
pp. e1009060
Author(s):  
Corbin Quick ◽  
Xiaoquan Wen ◽  
Gonçalo Abecasis ◽  
Michael Boehnke ◽  
Hyun Min Kang

Gene-based association tests aggregate genotypes across multiple variants for each gene, providing an interpretable gene-level analysis framework for genome-wide association studies (GWAS). Early gene-based test applications often focused on rare coding variants; a more recent wave of gene-based methods, e.g. TWAS, use eQTLs to interrogate regulatory associations. Regulatory variants are expected to be particularly valuable for gene-based analysis, since most GWAS associations to date are non-coding. However, identifying causal genes from regulatory associations remains challenging and contentious. Here, we present a statistical framework and computational tool to integrate heterogeneous annotations with GWAS summary statistics for gene-based analysis, applied with comprehensive coding and tissue-specific regulatory annotations. We compare power and accuracy identifying causal genes across single-annotation, omnibus, and annotation-agnostic gene-based tests in simulation studies and an analysis of 128 traits from the UK Biobank, and find that incorporating heterogeneous annotations in gene-based association analysis increases power and performance identifying causal genes.


2021 ◽  
Author(s):  
Bohan Zhu ◽  
Richard I Ainsworth ◽  
Zengmiao Wang ◽  
Salvador Sierra ◽  
Chengyu Deng ◽  
...  

Genome wide association studies have revealed >150 loci associated with schizophrenia risk, yet these genetic factors do not seem to be sufficient to fully explain the molecular determinants behind this psychiatric condition. Epigenetic marks such as post-translational histone modifications remain largely plastic during development and adulthood, allowing a dynamic impact of environmental factors, including antipsychotic medications, on access to genes and regulatory elements. However, no study so far has profiled cell-specific genome-wide histone modifications in postmortem brain samples from schizophrenia subjects or the effect of antipsychotic treatment on such epigenetic marks. Here we show the first comprehensive epigenomic characterization of the frontal cortex of 29 individuals with schizophrenia and 29 matched controls, including histone modifications associated with active promoters and enhancers H3K4me3 and H3K27ac along with RNA expression in neuronal and glial nuclei. Schizophrenia subjects exhibited thousands of cell type-specific epigenetic differences at regions that included several susceptibility genetic loci, such as NRG1, RGS4 and HTR2A. Comparing untreated and treated schizophrenia subjects with controls, our findings provide entirely new insights into differentially modified genes associated with unexpected pathways that are potential markers of antipsychotic treatment. Additionally, we show that the effect of age on the epigenomic landscape is more pronounced in frontal cortex samples of antipsychotic-treated schizophrenia subjects. Together, our data provide important evidence of epigenetic alterations in the frontal cortex of individuals with schizophrenia, and remark the impact of age and antipsychotic treatment on chromatin organization.


2019 ◽  
Author(s):  
Corbin Quick ◽  
Xiaoquan Wen ◽  
Gonçalo Abecasis ◽  
Michael Boehnke ◽  
Hyun Min Kang

AbstractGene-based association tests aggregate genotypes across multiple variants for each gene, providing an interpretable gene-level analysis framework for genome-wide association studies (GWAS). Early gene-based test applications often focused on rare coding variants; a more recent wave of gene-based methods, e.g. TWAS, use eQTLs to interrogate regulatory associations. Regulatory variants are expected to be particularly valuable for gene-based analysis, since most GWAS associations to date are non-coding. However, identifying causal genes from regulatory associations remains challenging and contentious. Here, we present a statistical framework and computational tool to integrate heterogeneous annotations with GWAS summary statistics for gene-based analysis, applied with comprehensive coding and tissue-specific regulatory annotations. We compare power and accuracy identifying causal genes across single-annotation, omnibus, and annotation-agnostic gene-based tests in simulation studies and an analysis of 128 traits from the UK Biobank, and find that incorporating heterogeneous annotations in gene-based association analysis increases power and performance identifying causal genes.


2018 ◽  
Author(s):  
Nicholas Mancuso ◽  
Simon Gayther ◽  
Alexander Gusev ◽  
Wei Zheng ◽  
Kathryn L. Penney ◽  
...  

AbstractAlthough genome-wide association studies (GWAS) for prostate cancer (PrCa) have identified more than 100 risk regions, most of the risk genes at these regions remain largely unknown. Here, we integrate the largest PrCa GWAS (N=142,392) with gene expression measured in 45 tissues (N=4,458), including normal and tumor prostate, to perform a multi-tissue transcriptomewide association study (TWAS) for PrCa. We identify 235 genes at 87 independent 1Mb regions associated with PrCa risk, 9 of which are regions with no genome-wide significant SNP within 2Mb. 24 genes are significant in TWAS only for alternative splicing models in prostate tumor thus supporting the hypothesis of splicing driving risk for continued oncogenesis. Finally, we use a Bayesian probabilistic approach to estimate credible sets of genes containing the causal gene at pre-defined level; this reduced the list of 235 associations to 120 genes in the 90% credible set. Overall, our findings highlight the power of integrating expression with PrCa GWAS to identify novel risk loci and prioritize putative causal genes at known risk loci.


2021 ◽  
Vol 118 (47) ◽  
pp. e2105191118
Author(s):  
Shiyang Ma ◽  
James Dalgleish ◽  
Justin Lee ◽  
Chen Wang ◽  
Linxi Liu ◽  
...  

Gene-based tests are valuable techniques for identifying genetic factors in complex traits. Here, we propose a gene-based testing framework that incorporates data on long-range chromatin interactions, several recent technical advances for region-based tests, and leverages the knockoff framework for synthetic genotype generation for improved gene discovery. Through simulations and applications to genome-wide association studies (GWAS) and whole-genome sequencing data for multiple diseases and traits, we show that the proposed test increases the power over state-of-the-art gene-based tests in the literature, identifies genes that replicate in larger studies, and can provide a more narrow focus on the possible causal genes at a locus by reducing the confounding effect of linkage disequilibrium. Furthermore, our results show that incorporating genetic variation in distal regulatory elements tends to improve power over conventional tests. Results for UK Biobank and BioBank Japan traits are also available in a publicly accessible database that allows researchers to query gene-based results in an easy fashion.


2020 ◽  
Vol 36 (9) ◽  
pp. 2936-2937 ◽  
Author(s):  
Gareth Peat ◽  
William Jones ◽  
Michael Nuhn ◽  
José Carlos Marugán ◽  
William Newell ◽  
...  

Abstract Motivation Genome-wide association studies (GWAS) are a powerful method to detect even weak associations between variants and phenotypes; however, many of the identified associated variants are in non-coding regions, and presumably influence gene expression regulation. Identifying potential drug targets, i.e. causal protein-coding genes, therefore, requires crossing the genetics results with functional data. Results We present a novel data integration pipeline that analyses GWAS results in the light of experimental epigenetic and cis-regulatory datasets, such as ChIP-Seq, Promoter-Capture Hi-C or eQTL, and presents them in a single report, which can be used for inferring likely causal genes. This pipeline was then fed into an interactive data resource. Availability and implementation The analysis code is available at www.github.com/Ensembl/postgap and the interactive data browser at postgwas.opentargets.io.


Metabolites ◽  
2021 ◽  
Vol 11 (8) ◽  
pp. 513
Author(s):  
Grace H. Yang ◽  
Danielle A. Fontaine ◽  
Sukanya Lodh ◽  
Joseph T. Blumer ◽  
Avtar Roopra ◽  
...  

Transcription factor 19 (TCF19) is a gene associated with type 1 diabetes (T1DM) and type 2 diabetes (T2DM) in genome-wide association studies. Prior studies have demonstrated that Tcf19 knockdown impairs β-cell proliferation and increases apoptosis. However, little is known about its role in diabetes pathogenesis or the effects of TCF19 gain-of-function. The aim of this study was to examine the impact of TCF19 overexpression in INS-1 β-cells and human islets on proliferation and gene expression. With TCF19 overexpression, there was an increase in nucleotide incorporation without any change in cell cycle gene expression, alluding to an alternate process of nucleotide incorporation. Analysis of RNA-seq of TCF19 overexpressing cells revealed increased expression of several DNA damage response (DDR) genes, as well as a tightly linked set of genes involved in viral responses, immune system processes, and inflammation. This connectivity between DNA damage and inflammatory gene expression has not been well studied in the β-cell and suggests a novel role for TCF19 in regulating these pathways. Future studies determining how TCF19 may modulate these pathways can provide potential targets for improving β-cell survival.


Genes ◽  
2021 ◽  
Vol 12 (8) ◽  
pp. 1175
Author(s):  
Amarni L. Thomas ◽  
Judith Marsman ◽  
Jisha Antony ◽  
William Schierding ◽  
Justin M. O’Sullivan ◽  
...  

The RUNX1/AML1 gene encodes a developmental transcription factor that is an important regulator of haematopoiesis in vertebrates. Genetic disruptions to the RUNX1 gene are frequently associated with acute myeloid leukaemia. Gene regulatory elements (REs), such as enhancers located in non-coding DNA, are likely to be important for Runx1 transcription. Non-coding elements that modulate Runx1 expression have been investigated over several decades, but how and when these REs function remains poorly understood. Here we used bioinformatic methods and functional data to characterise the regulatory landscape of vertebrate Runx1. We identified REs that are conserved between human and mouse, many of which produce enhancer RNAs in diverse tissues. Genome-wide association studies detected single nucleotide polymorphisms in REs, some of which correlate with gene expression quantitative trait loci in tissues in which the RE is active. Our analyses also suggest that REs can be variant in haematological malignancies. In summary, our analysis identifies features of the RUNX1 regulatory landscape that are likely to be important for the regulation of this gene in normal and malignant haematopoiesis.


2021 ◽  
Author(s):  
Robin N Beaumont ◽  
Isabelle K Mayne ◽  
Rachel M Freathy ◽  
Caroline F Wright

Abstract Birth weight is an important factor in newborn survival; both low and high birth weights are associated with adverse later-life health outcomes. Genome-wide association studies (GWAS) have identified 190 loci associated with maternal or fetal effects on birth weight. Knowledge of the underlying causal genes is crucial to understand how these loci influence birth weight and the links between infant and adult morbidity. Numerous monogenic developmental syndromes are associated with birth weights at the extreme ends of the distribution. Genes implicated in those syndromes may provide valuable information to prioritize candidate genes at the GWAS loci. We examined the proximity of genes implicated in developmental disorders (DDs) to birth weight GWAS loci using simulations to test whether they fall disproportionately close to the GWAS loci. We found birth weight GWAS single nucleotide polymorphisms (SNPs) fall closer to such genes than expected both when the DD gene is the nearest gene to the birth weight SNP and also when examining all genes within 258 kb of the SNP. This enrichment was driven by genes causing monogenic DDs with dominant modes of inheritance. We found examples of SNPs in the intron of one gene marking plausible effects via different nearby genes, highlighting the closest gene to the SNP not necessarily being the functionally relevant gene. This is the first application of this approach to birth weight, which has helped identify GWAS loci likely to have direct fetal effects on birth weight, which could not previously be classified as fetal or maternal owing to insufficient statistical power.


Sign in / Sign up

Export Citation Format

Share Document