scholarly journals A systematic comparison reveals substantial differences in chromosomal versus episomal encoding of enhancer activity

2016 ◽  
Author(s):  
Fumitaka Inoue ◽  
Martin Kircher ◽  
Beth Martin ◽  
Gregory M. Cooper ◽  
Daniela M. Witten ◽  
...  

AbstractCandidate enhancers can be identified on the basis of chromatin modifications, the binding of chromatin modifiers and transcription factors and cofactors, or chromatin accessibility. However, validating such candidates as bona fide enhancers requires functional characterization, typically achieved through reporter assays that test whether a sequence can drive expression of a transcriptional reporter via a minimal promoter. A longstanding concern is that reporter assays are mainly implemented on episomes, which are thought to lack physiological chromatin. However, the magnitude and determinants of differences incis-regulation for regulatory sequences residing in episomes versus chromosomes remain almost completely unknown. To address this question in a systematic manner, we developed and applied a novel lentivirus-based massively parallel reporter assay (lentiMPRA) to directly compare the functional activities of 2,236 candidate liver enhancers in an episomal versus a chromosomally integrated context. We find that the activities of chromosomally integrated sequences are substantially different from the activities of the identical sequences assayed on episomes, and furthermore are correlated with different subsets of ENCODE annotations. The results of chromosomally-based reporter assays are also more reproducible and more strongly predictable by both ENCODE annotations and sequence-based models. With a linear model that combines chromatin annotations and sequence information, we achieve a Pearson’s R2of 0.347 for predicting the results of chromosomally integrated reporter assays. This level of prediction is better than with either chromatin annotations or sequence information alone and also outperforms predictive models of episomal assays. Our results have broad implications for howcis-regulatory elements are identified, prioritized and functionally validated.

2017 ◽  
Vol 114 (7) ◽  
pp. E1291-E1300 ◽  
Author(s):  
Sharon R. Grossman ◽  
Xiaolan Zhang ◽  
Li Wang ◽  
Jesse Engreitz ◽  
Alexandre Melnikov ◽  
...  

Enhancers regulate gene expression through the binding of sequence-specific transcription factors (TFs) to cognate motifs. Various features influence TF binding and enhancer function—including the chromatin state of the genomic locus, the affinities of the binding site, the activity of the bound TFs, and interactions among TFs. However, the precise nature and relative contributions of these features remain unclear. Here, we used massively parallel reporter assays (MPRAs) involving 32,115 natural and synthetic enhancers, together with high-throughput in vivo binding assays, to systematically dissect the contribution of each of these features to the binding and activity of genomic regulatory elements that contain motifs for PPARγ, a TF that serves as a key regulator of adipogenesis. We show that distinct sets of features govern PPARγ binding vs. enhancer activity. PPARγ binding is largely governed by the affinity of the specific motif site and higher-order features of the larger genomic locus, such as chromatin accessibility. In contrast, the enhancer activity of PPARγ binding sites depends on varying contributions from dozens of TFs in the immediate vicinity, including interactions between combinations of these TFs. Different pairs of motifs follow different interaction rules, including subadditive, additive, and superadditive interactions among specific classes of TFs, with both spatially constrained and flexible grammars. Our results provide a paradigm for the systematic characterization of the genomic features underlying regulatory elements, applicable to the design of synthetic regulatory elements or the interpretation of human genetic variation.


F1000Research ◽  
2017 ◽  
Vol 6 ◽  
pp. 939 ◽  
Author(s):  
David Santiago-Algarra ◽  
Lan T.M. Dao ◽  
Lydie Pradel ◽  
Alexandre España ◽  
Salvatore Spicuglia

The regulation of gene transcription in higher eukaryotes is accomplished through the involvement of transcription start site (TSS)-proximal (promoters) and -distal (enhancers) regulatory elements. It is now well acknowledged that enhancer elements play an essential role during development and cell differentiation, while genetic alterations in these elements are a major cause of human disease. Many strategies have been developed to identify and characterize enhancers. Here, we discuss recent advances in high-throughput approaches to assess enhancer activity, from the well-established massively parallel reporter assays to the recent clustered regularly interspaced short palindromic repeats (CRISPR)/Cas9-based technologies. We highlight how these approaches contribute toward a better understanding of enhancer function, eventually leading to the discovery of new types of regulatory sequences, and how the alteration of enhancers can affect transcriptional regulation.


2021 ◽  
Author(s):  
Cesar Arenas-Mena ◽  
Sofija Miljovska ◽  
Sevinc Ercan ◽  
Tanvi Shashikant ◽  
Charles G. Danko ◽  
...  

The transcription of developmental regulatory genes is often controlled by multiple cis-regulatory elements. The identification and functional characterization of distal regulatory elements remains challenging, even in tractable model organisms like sea urchins. We evaluate the use of chromatin accessibility, transcription and RNA Polymerase II for their ability to predict enhancer activity of genomic regions in sea urchin embryos. ATAC-seq, PRO-seq, and Pol II ChIP-seq from early and late blastula embryos are manually contrasted with experimental cis-regulatory analyses available in sea urchin embryos, with particular attention to common developmental regulatory elements known to have enhancer and silencer functions differentially deployed among embryonic territories. Using the three functional genomic data types, machine learning models are trained and tested to classify and quantitatively predict the enhancer activity of several hundred genomic regions previously validated with reporter constructs in vivo. Overall, chromatin accessibility and transcription have substantial power for predicting enhancer activity. For promoter-overlapping cis-regulatory elements in particular, the distribution of Pol II is the best predictor of enhancer activity in blastula embryos. Furthermore, ATAC- and PRO-seq predictive value is stage dependent for the promoter-overlapping subset. This suggests that the sequence of regulatory mechanisms leading to transcriptional activation have distinct relevance at different levels of the developmental gene regulatory hierarchy deployed during embryogenesis.


2015 ◽  
Author(s):  
Ilias Georgakopoulos-Soares ◽  
Naman Jain ◽  
Jesse Gray ◽  
Martin Hemberg

DNA regulatory elements contain short motifs where transcription factors (TFs) can bind to modulate gene expression. Although the broad principles of TF regulation are well understood, the rules that dictate how combinatorial TF binding translates into transcriptional activity remain largely unknown. With the rapid advances in DNA synthesis and sequencing technologies and the continuing decline in the associated costs, high-throughput experiments can be performed to investigate the regulatory role of thousands of oligonucleotide sequences simultaneously. Nevertheless, designing high-throughput reporter assay experiments such as Massively Parallel Reporter Assays (MPRAs) and similar methods remains challenging. We introduce MPRAnator, a set of tools that facilitate rapid design of MPRA experiments. With MPRA Motif design, a set of variables provides fine control of how motifs are placed into sequences therefore allowing the user to investigate the rules that govern TF occupancy. MPRA SNP design can be used to investigate the functional effects of single or combinations of SNPs at regulatory sequences. Finally, the Transmutation tool allows for the design of negative controls by permitting scrambling, reversing, complementing or introducing multiple random mutations in the input sequences or motifs.


2021 ◽  
Author(s):  
Vasiliki Theodorou ◽  
Aikaterini Stefanaki ◽  
Minas Drakos ◽  
Dafne Triantafyllou ◽  
Christos Delidakis

Background: ASC/ASCL proneural transcription factors are oncogenic and exhibit impressive reprogramming and pioneer activities. In both Drosophila and mammals, these factors are central in the early specification of the neural fate, where they act in opposition to Notch signalling. However, the role of ASC on the chromatin during CNS neural stem cells birth remains elusive. Results: We investigated the chromatin changes accompanying neural commitment using an integrative genetics and genomics methodology. We found that ASC factors bind equally strongly to two distinct classes of cis-regulatory elements: open regions remodeled earlier during maternal to zygotic transition by Zelda and Zelda-independent, less accessible regions. Both classes cis-elements exhibit enhanced chromatin accessibility during neural specification and correlate with transcriptional regulation of genes involved in many biological processes necessary for neuroblast function. We identified an ASC-Notch regulated TF network that most likely act as the prime regulators of neuroblast function. Using a cohort of ASC target genes, we report that ASC null neuroblasts are defectively specified, remaining initially stalled, lacking expression of many proneural targets and unable to divide. When they eventually start proliferating, they produce compromised progeny. Generation of lacZ reporter lines driven by proneural-bound elements display enhancer activity within neuroblasts and proneural dependency. Therefore, the partial neuroblast identity seen in the absence of ASC genes is driven by other, proneural-independent, cis-elements. Neuroblast impairment and the late differentiation defects of ASC mutants are corrected by ectodermal induction of individual ASC genes but not by individual members of the TF network downstream of ASC. However, in wild type embryos induction of individual members of this network induces CNS hyperplasia, suggesting that they synergize with the activating function of ASC to establish the chromatin dynamics that promote neural specification. Conclusion: ASC factors bind a large number of enhancers to orchestrate the timely activation of the neural chromatin program during neuroectodermal to neuroblast transition. This early chromatin remodeling is crucial for both neuroblast homeostasis as well as future progeny fidelity.


2021 ◽  
Author(s):  
Arushi Varshney ◽  
Yasuhiro Kyono ◽  
Venkateswaran Ramamoorthi Elangovan ◽  
Collin Wang ◽  
Michael R. Erdos ◽  
...  

Identifying the tissue-specific molecular signatures of active regulatory elements is critical to understand gene regulatory mechanisms. Here, we identify transcription start sites (TSS) using cap analysis of gene expression (CAGE) across 57 human pancreatic islet samples. We identify 9,954 reproducible CAGE tag clusters (TCs), ~20% of which are islet-specific and occur mostly distal to known gene TSSs. We integrated islet CAGE data with histone modification and chromatin accessibility profiles to identify epigenomic signatures of transcription initiation. Using a massively parallel reporter assay, we validated the transcriptional enhancer activity for 2,279 of 3,378 (~68%) tested islet CAGE elements (5% FDR). TCs within accessible enhancers show higher enrichment to overlap type 2 diabetes genome-wide association study (GWAS) signals than existing islet annotations, which emphasizes the utility of mapping CAGE profiles in disease-relevant tissue. This work provides a high-resolution map of transcriptional initiation in human pancreatic islets with utility for dissecting active enhancers at GWAS loci.


2021 ◽  
Author(s):  
Arushi Varshney ◽  
Yasuhiro Kyono ◽  
Venkateswaran Ramamoorthi Elangovan ◽  
Collin Wang ◽  
Michael R. Erdos ◽  
...  

Identifying the tissue-specific molecular signatures of active regulatory elements is critical to understand gene regulatory mechanisms. Here, we identify transcription start sites (TSS) using cap analysis of gene expression (CAGE) across 57 human pancreatic islet samples. We identify 9,954 reproducible CAGE tag clusters (TCs), ~20% of which are islet-specific and occur mostly distal to known gene TSSs. We integrated islet CAGE data with histone modification and chromatin accessibility profiles to identify epigenomic signatures of transcription initiation. Using a massively parallel reporter assay, we validated the transcriptional enhancer activity for 2,279 of 3,378 (~68%) tested islet CAGE elements (5% FDR). TCs within accessible enhancers show higher enrichment to overlap type 2 diabetes genome-wide association study (GWAS) signals than existing islet annotations, which emphasizes the utility of mapping CAGE profiles in disease-relevant tissue. This work provides a high-resolution map of transcriptional initiation in human pancreatic islets with utility for dissecting active enhancers at GWAS loci.


2020 ◽  
Author(s):  
Sierra S. Nishizaki ◽  
Torrin L. McDonald ◽  
Gregory A. Farnum ◽  
Monica J. Holmes ◽  
Melissa L. Drexel ◽  
...  

AbstractBackgroundZebrafish are a foundational model organism for studying the spatio-temporal activity of genes and their regulatory sequences. A variety of approaches are currently available for editing genes and modifying gene expression in zebrafish, including RNAi, Cre/lox, and CRISPR-Cas9. However, the lac operator-repressor system, a component of the E. coli lac operon which has been adapted for use in many other species and is a valuable, flexible tool for studying the inducible modulation of gene expression, has not previously been tested in zebrafish.ResultsHere we demonstrate that the lac operator-repressor system robustly decreases expression of firefly luciferase in cultured zebrafish fibroblast cells. Our work establishes the lac operator-repressor system as a promising tool for the manipulation of gene expression in whole zebrafish.ConclusionsOur results lay the groundwork for the development of lac-based reporter assays in zebrafish, and adds to the tools available for investigating dynamic gene expression in embryogenesis. We believe that this work will catalyze the development of new reporter assay systems to investigate uncharacterized regulatory elements and their cell-type specific activities.


2021 ◽  
Author(s):  
Ignacio L. Ibarra ◽  
Vikram S. Ratnu ◽  
Lucia Gordillo ◽  
In-Young Hwang ◽  
Luca Mariani ◽  
...  

Neuronal activity induced by brain-derived neurotrophic factor (BDNF) triggers gene expression, which is crucial for neuronal survival, differentiation, synaptic plasticity, memory formation, and neurocognitive health. However, its role in chromatin regulation is unclear. Here, using temporal profiling of chromatin accessibility and transcription in mouse primary cortical neurons upon either BDNF stimulation or depolarization (KCl), we identify features that define BDNF-specific chromatin-to-gene expression programs. Enhancer activation is an early event in the regulatory control of BDNF-treated neurons, where the bZIP motif-binding Fos protein pioneered chromatin opening and cooperated with co-regulatory transcription factors (Homeobox, EGRs, and CTCF) to induce transcription. Deleting cis-regulatory sequences decreased BDNF-mediated Arc expression, a regulator of synaptic plasticity. BDNF-induced accessible regions are linked to preferential exon usage by neurodevelopmental disorder-related genes and heritability of neuronal complex traits, which were validated in human iPSC-derived neurons. Thus, we provide a comprehensive view of BDNF-mediated genome regulatory features using comparative genomic approaches to dissect mammalian neuronal activity.


2020 ◽  
Vol 16 (11) ◽  
pp. e1008334
Author(s):  
Ling Chen ◽  
John A. Capra

Deep neural networks (DNNs) have achieved state-of-the-art performance in identifying gene regulatory sequences, but they have provided limited insight into the biology of regulatory elements due to the difficulty of interpreting the complex features they learn. Several models of how combinatorial binding of transcription factors, i.e. the regulatory grammar, drives enhancer activity have been proposed, ranging from the flexible TF billboard model to the stringent enhanceosome model. However, there is limited knowledge of the prevalence of these (or other) sequence architectures across enhancers. Here we perform several hypothesis-driven analyses to explore the ability of DNNs to learn the regulatory grammar of enhancers. We created synthetic datasets based on existing hypotheses about combinatorial transcription factor binding site (TFBS) patterns, including homotypic clusters, heterotypic clusters, and enhanceosomes, from real TF binding motifs from diverse TF families. We then trained deep residual neural networks (ResNets) to model the sequences under a range of scenarios that reflect real-world multi-label regulatory sequence prediction tasks. We developed a gradient-based unsupervised clustering method to extract the patterns learned by the ResNet models. We demonstrated that simulated regulatory grammars are best learned in the penultimate layer of the ResNets, and the proposed method can accurately retrieve the regulatory grammar even when there is heterogeneity in the enhancer categories and a large fraction of TFBS outside of the regulatory grammar. However, we also identify common scenarios where ResNets fail to learn simulated regulatory grammars. Finally, we applied the proposed method to mouse developmental enhancers and were able to identify the components of a known heterotypic TF cluster. Our results provide a framework for interpreting the regulatory rules learned by ResNets, and they demonstrate that the ability and efficiency of ResNets in learning the regulatory grammar depends on the nature of the prediction task.


Sign in / Sign up

Export Citation Format

Share Document