scholarly journals MARGE: Mutation Analysis for Regulatory Genomic Elements

2018 ◽  
Author(s):  
Verena M. Link ◽  
Casey E. Romanoski ◽  
Dirk Metzler ◽  
Christopher K. Glass

AbstractCell-specific patterns of gene expression are determined by combinatorial actions of sequence-specific transcription factors at cis-regulatory elements. Studies indicate that relatively simple combinations of lineage-determining transcription factors (LDTFs) play dominant roles in the selection of enhancers that establish cell identities and functions. LDTFs require collaborative interactions with additional transcription factors to mediate enhancer function, but the identities of these factors are often unknown. We have shown that natural genetic variation between individuals has great utility for discovering collaborative transcription factors. Here, we introduce MARGE (Mutation Analysis of Regulatory Genomic Elements), the first publicly available suite of software tools that integrates genome-wide genetic variation with epigenetic data to identify collaborative transcription factor pairs. MARGE is optimized to work with chromatin accessibility assays (such as ATAC-seq or DNase I hypersensitivity), as well as transcription factor binding data collected by ChlP-seq. Herein, we provide investigators with rationale for each step in the MARGE pipeline and key differences for analysis of datasets with different experimental designs. We demonstrate the utility of MARGE using mouse peritoneal macrophages, liver cells, and human lymphoblastoid cells. MARGE provides a powerful tool to identify combinations of cell type-specific transcription factors while simultaneously interpreting functional effects of non-coding genetic variation.

Blood ◽  
2016 ◽  
Vol 128 (22) ◽  
pp. 3879-3879
Author(s):  
Vivek Behera ◽  
Perry Evans ◽  
Carolyne J Face ◽  
Laavanya Sankaranarayanan ◽  
Gerd A. Blobel

Abstract Erythroid transcription factors (TFs) control gene expression programs, lineage decisions, and disease outcomes. How transcription factors contact DNA has been studied extensively in vitro, but in vivo binding characteristics are less well understood as they are influenced in a reciprocal manner by chromatin accessibility and neighboring transcription factors. Here, we present a comparative analysis approach that takes advantage of non-coding sequence variation between functionally equivalent erythroid cell lines to conduct an in-depth analysis of erythroid TF binding profiles and chromatin features. Specifically, we analyzed ChIP-seq datasets to identify millions of genetic non-coding variants between the mouse erythroleukemia cell line (MEL), a GATA1-inducible erythroid progenitor cell line (G1E-ER4), and primary murine erythroblast cells. We found that while these cell lines are highly positively correlated in chromatin features, larger differences in TF binding intensity are correlated with higher degrees of genetic variation between cell lines. We next examined discriminatory genetic variants between the cell lines that are located in ChIP-seq peaks of the erythroid transcription factor GATA1. Hundreds of such variants fall within GATA1 motifs. Differential GATA1 binding intensities associated with the variants revealed nucleotide positions that contribute most to in vivo GATA1 chromatin occupancy and identified which alternative nucleotides are most likely to disrupt binding. Notably, this additional information about GATA1's in vivo nucleotide binding preferences improved prediction of GATA1 binding sites genome-wide. We applied similar approaches to determine the bp-resolution in vivo binding preferences of TAL1/SCL and CTCF. We additionally identified thousands of discriminatory genetic variants within GATA1 sites that fall outside canonical GATA elements but within binding sites of other known TFs. Association of these variants with differential GATA1 binding intensities revealed that the hematopoietic transcription factors TAL1/SCL and KLF1 positively regulate GATA1 chromatin occupancy. Strikingly, we identified a number of motifs not previously implicated in cooperating with GATA1 that positively impact GATA1 chromatin binding. Notably, we also defined motifs associated with negative regulation of GATA1 chromatin occupancy. Applying a similar analysis to TAL1/SCL and CTCF revealed additional motifs involved in regulating the chromatin occupancy of these TFs. Finally, we associated discriminatory genetic variation between erythroid cell lines with large changes in sub-kb-scale DNase hypersensitivity. We found that single base pair substitutions within or near a number of erythroid TF motifs, including that for the RUNX family of nuclear factors, are strongly associated with changes in chromatin accessibility. Our findings use novel methods in comparative ChIP-seq and DNase-seq analysis to reveal new insights about the genetic basis for erythroid TF chromatin occupancy and chromatin accessibility. Disclosures No relevant conflicts of interest to declare.


2020 ◽  
Author(s):  
Swann Floc’hlay ◽  
Emily Wong ◽  
Bingqing Zhao ◽  
Rebecca R. Viales ◽  
Morgane Thomas-Chollier ◽  
...  

AbstractPrecise patterns of gene expression are driven by interactions between transcription factors, regulatory DNA sequence, and chromatin. How DNA mutations affecting any one of these regulatory ‘layers’ is buffered or propagated to gene expression remains unclear. To address this, we quantified allele-specific changes in chromatin accessibility, histone modifications, and gene expression in F1 embryos generated from eight Drosophila crosses, at three embryonic stages, yielding a comprehensive dataset of 240 samples spanning multiple regulatory layers. Genetic variation in cis-regulatory elements is common, highly heritable, and surprisingly consistent in its effects across embryonic stages. Much of this variation does not propagate to gene expression. When it does, it acts through H3K4me3 or alternatively through chromatin accessibility and H3K27ac. The magnitude and evolutionary impact of mutations is influenced by a genes’ regulatory complexity (i.e. enhancer number), with transcription factors being most robust to cis-acting, and most influenced by trans-acting, variation. Overall, the impact of genetic variation on regulatory phenotypes appears context-dependent even within the constraints of embryogenesis.


2019 ◽  
Author(s):  
Gregory R. Keele ◽  
Bryan C. Quach ◽  
Jennifer W. Israel ◽  
Grace A. Chappell ◽  
Lauren Lewis ◽  
...  

AbstractGene transcription profiles across tissues are largely defined by the activity of regulatory elements, most of which correspond to regions of accessible chromatin. Regulatory element activity is in turn modulated by genetic variation, resulting in variable transcription rates across individuals. The interplay of these factors, however, is poorly understood. Here we characterize expression and chromatin state dynamics across three tissues—liver, lung, and kidney—in 47 strains of the Collaborative Cross (CC) mouse population, examining the regulation of these dynamics by expression quantitative trait loci (eQTL) and chromatin QTL (cQTL). QTL whose allelic effects were consistent across tissues were detected for 1,101 genes and 133 chromatin regions. Also detected were eQTL and cQTL whose allelic effects differed across tissues, including local-eQTL for Pik3c2g detected in all three tissues but with distinct allelic effects. Leveraging overlapping measurements of gene expression and chromatin accessibility on the same mice from multiple tissues, we used mediation analysis to identify chromatin and gene expression intermediates of eQTL effects. Based on QTL and mediation analyses over multiple tissues, we propose a causal model for the distal genetic regulation of Akr1e1, a gene involved in glycogen metabolism, through the zinc finger transcription factor Zfp985 and chromatin intermediates. This analysis demonstrates the complexity of transcriptional and chromatin dynamics and their regulation over multiple tissues, as well as the value of the CC and related genetic resource populations for identifying specific regulatory mechanisms within cells and tissues.Author summaryGenetic variation can drive alterations in gene expression levels and chromatin accessibility, the latter of which defines gene regulatory elements genome-wide. The same genetic variants may associate with both molecular events, and these may be connected within the same causal path: a variant that reduces promoter region chromatin accessibility, potentially by affecting transcription factor binding, may lead to reduced expression of that gene. Moreover, these causal regulatory paths can differ between tissues depending on functions and cellular activity specific to each tissue. We identify cross-tissue and tissue-selective genetic regulators of gene expression and chromatin accessibility in liver, lung, and kidney tissues using a panel of genetically diverse inbred mouse strains. Further, we identify a number of candidate causal mediators of the genetic regulation of gene expression, including a zinc finger protein that helps silence the Akr1e1 gene. Our analyses are consistent with chromatin accessibility playing a role in the regulation of transcription. Our study demonstrates the power of genetically diverse, multi-parental mouse populations, such as the Collaborative Cross, for large-scale studies of genetic drivers of gene regulation that underlie complex phenotypes, as well as identifying causal intermediates that drive variable activity of specific genes and pathways.


2021 ◽  
Author(s):  
Vasiliki Theodorou ◽  
Aikaterini Stefanaki ◽  
Minas Drakos ◽  
Dafne Triantafyllou ◽  
Christos Delidakis

Background: ASC/ASCL proneural transcription factors are oncogenic and exhibit impressive reprogramming and pioneer activities. In both Drosophila and mammals, these factors are central in the early specification of the neural fate, where they act in opposition to Notch signalling. However, the role of ASC on the chromatin during CNS neural stem cells birth remains elusive. Results: We investigated the chromatin changes accompanying neural commitment using an integrative genetics and genomics methodology. We found that ASC factors bind equally strongly to two distinct classes of cis-regulatory elements: open regions remodeled earlier during maternal to zygotic transition by Zelda and Zelda-independent, less accessible regions. Both classes cis-elements exhibit enhanced chromatin accessibility during neural specification and correlate with transcriptional regulation of genes involved in many biological processes necessary for neuroblast function. We identified an ASC-Notch regulated TF network that most likely act as the prime regulators of neuroblast function. Using a cohort of ASC target genes, we report that ASC null neuroblasts are defectively specified, remaining initially stalled, lacking expression of many proneural targets and unable to divide. When they eventually start proliferating, they produce compromised progeny. Generation of lacZ reporter lines driven by proneural-bound elements display enhancer activity within neuroblasts and proneural dependency. Therefore, the partial neuroblast identity seen in the absence of ASC genes is driven by other, proneural-independent, cis-elements. Neuroblast impairment and the late differentiation defects of ASC mutants are corrected by ectodermal induction of individual ASC genes but not by individual members of the TF network downstream of ASC. However, in wild type embryos induction of individual members of this network induces CNS hyperplasia, suggesting that they synergize with the activating function of ASC to establish the chromatin dynamics that promote neural specification. Conclusion: ASC factors bind a large number of enhancers to orchestrate the timely activation of the neural chromatin program during neuroectodermal to neuroblast transition. This early chromatin remodeling is crucial for both neuroblast homeostasis as well as future progeny fidelity.


2020 ◽  
Vol 223 (14) ◽  
pp. jeb221622
Author(s):  
Sarah M. Ryan ◽  
Kaitie Wildman ◽  
Briseida Oceguera-Perez ◽  
Scott Barbee ◽  
Nathan T. Mortimer ◽  
...  

ABSTRACTAs organisms are constantly exposed to the damaging effects of oxidative stress through both environmental exposure and internal metabolic processes, they have evolved a variety of mechanisms to cope with this stress. One such mechanism is the highly conserved p38 MAPK (p38K) pathway, which is known to be post-translationally activated in response to oxidative stress, resulting in the activation of downstream antioxidant targets. However, little is known about the role of p38K transcriptional regulation in response to oxidative stress. Therefore, we analyzed the p38K gene family across the genus Drosophila to identify conserved regulatory elements. We found that oxidative stress exposure results in increased p38K protein levels in multiple Drosophila species and is associated with increased oxidative stress resistance. We also found that the p38Kb genomic locus includes conserved AP-1 and lola-PT transcription factor consensus binding sites. Accordingly, over-expression of these transcription factors in D. melanogaster is sufficient to induce transcription of p38Kb and enhances resistance to oxidative stress. We further found that the presence of a putative lola-PT binding site in the p38Kb locus of a given species is predictive of the species' survival in response to oxidative stress. Through our comparative genomics approach, we have identified biologically relevant putative transcription factor binding sites that regulate the expression of p38Kb and are associated with resistance to oxidative stress. These findings reveal a novel mode of regulation for p38K genes and suggest that transcription may play as important a role in p38K-mediated stress responses as post-translational modifications.


2007 ◽  
Vol 4 (2) ◽  
pp. 1-23
Author(s):  
Amitava Karmaker ◽  
Kihoon Yoon ◽  
Mark Doderer ◽  
Russell Kruzelock ◽  
Stephen Kwek

Summary Revealing the complex interaction between trans- and cis-regulatory elements and identifying these potential binding sites are fundamental problems in understanding gene expression. The progresses in ChIP-chip technology facilitate identifying DNA sequences that are recognized by a specific transcription factor. However, protein-DNA binding is a necessary, but not sufficient, condition for transcription regulation. We need to demonstrate that their gene expression levels are correlated to further confirm regulatory relationship. Here, instead of using a linear correlation coefficient, we used a non-linear function that seems to better capture possible regulatory relationships. By analyzing tissue-specific gene expression profiles of human and mouse, we delineate a list of pairs of transcription factor and gene with highly correlated expression levels, which may have regulatory relationships. Using two closely-related species (human and mouse), we perform comparative genome analysis to cross-validate the quality of our prediction. Our findings are confirmed by matching publicly available TFBS databases (like TRANFAC and ConSite) and by reviewing biological literature. For example, according to our analysis, 80% and 85.71% of the targets genes associated with E2F5 and RELB transcription factors have the corresponding known binding sites. We also substantiated our results on some oncogenes with the biomedical literature. Moreover, we performed further analysis on them and found that BCR and DEK may be regulated by some common transcription factors. Similar results for BTG1, FCGR2B and LCK genes were also reported.


2021 ◽  
Author(s):  
Jonathan P. Karr ◽  
John J. Ferrie ◽  
Robert Tjian ◽  
Xavier Darzacq

How distal cis-regulatory elements (e.g., enhancers) communicate with promoters remains an unresolved question of fundamental importance. Although transcription factors and cofactors are known to mediate this communication, the mechanism by which diffusible molecules relay regulatory information from one position to another along the chromosome is a biophysical puzzle—one that needs to be revisited in light of recent data that cannot easily fit into previous solutions. Here we propose a new model that diverges from the textbook enhancer–promoter looping paradigm and offer a synthesis of the literature to make a case for its plausibility, focusing on the coactivator p300.


2020 ◽  
Author(s):  
Pei-Suen Tsou ◽  
Pamela J. Palisoc ◽  
Mustafa Ali ◽  
Dinesh Khanna ◽  
Amr H Sawalha

AbstractSystemic sclerosis (SSc) is a rare autoimmune disease of unknown etiology characterized by widespread fibrosis and vascular complications. We utilized an assay for genome-wide chromatin accessibility to examine the chromatin landscape and transcription factor footprints in both endothelial cells (ECs) and fibroblasts isolated from healthy controls and patients with diffuse cutaneous (dc) SSc. In both cell types, chromatin accessibility was significantly reduced in SSc patients compared to healthy controls. Genes annotated from differentially accessible chromatin regions were enriched in pathways and gene ontologies involved in the nervous system. In addition, our data revealed that chromatin binding of transcription factors SNAI2, ETV2, and ELF1 was significantly increased in dcSSc ECs, while recruitment of RUNX1 and RUNX2 was enriched in dcSSc fibroblasts. Significant elevation of SNAI2 and ETV2 levels in dcSSc ECs, and RUNX2 levels in dcSSc fibroblasts were confirmed. Further analysis of publicly available ETV2-target genes suggests that ETV2 may play a critical role in EC dysfunction in dcSSc. Our data, for the first time, uncovered the chromatin blueprint of dcSSc ECs and fibroblasts, and suggested that neural-related characteristics of SSc ECs and fibroblasts could be a culprit for dysregulated angiogenesis and enhanced fibrosis. Targeting these pathways and the key transcription factors identified might present novel therapeutic approaches for this disease.


2016 ◽  
Author(s):  
David Felix Lamparter ◽  
Daniel Marbach ◽  
Rico Rueedi ◽  
Sven Bergmann ◽  
Zoltan Kutalik

To better understand genome regulation, it is important to uncover the role of transcription factors in the process of chromatin structure establishment and maintenance. Here we present a data-driven approach to systematically characterize transcription factors that are relevant for this process. Our method uses a linear mixed modeling approach to combine data sets of transcription factor binding motif enrichments in open chromatin and gene expression across the same set of cell lines. Applying this approach to the ENCODE data set we confirm already known and imply numerous novel transcription factors in playing a role in the establishment or maintenance of open chromatin.


2018 ◽  
Author(s):  
Ignacio J. Tripodi ◽  
Mary A. Allen ◽  
Robin D. Dowell

AbstractTranscription factors are managers of the cellular factory, and key components to many diseases. Many non-coding single nucleotide polymorphisms affect transcription factors, either by directly altering the protein or its functional activity at individual binding sites. Here we first briefly summarize high throughput approaches to studying transcription factor activity. We then demonstrate, using published chromatin accessibility data (specifically ATAC-seq), that the genome wide profile of TF recognition motifs relative to regions of open chromatin can determine the key transcription factor altered by a perturbation. Our method of determining which TF are altered by a perturbation is simple, quick to implement and can be used when biological samples are limited. In the future, we envision this method could be applied to determining which TFs show altered activity in response to a wide variety of drugs and diseases.


Sign in / Sign up

Export Citation Format

Share Document