scholarly journals The role of maternal pioneer factors in predefining first zygotic responses to inductive signals

2018 ◽  
Author(s):  
George E. Gentsch ◽  
Thomas Spruce ◽  
Nick D. L. Owens ◽  
James C. Smith

ABSTRACTEmbryonic development yields many different cell types in response to just a few families of inductive signals. The property of a signal-receiving cell that determines how it responds to such signals, including the activation of cell type-specific genes, is known as its competence. Here, we show how maternal factors modify chromatin to specify initial competence in the frog Xenopus tropicalis. We identified the earliest engaged regulatory DNA sequences, and inferred from them critical activators of the zygotic genome. Of these, we showed that the pioneering activity of the maternal pluripotency factors Pou5f3 and Sox3 predefines competence for germ layer formation by extensively remodeling compacted chromatin before the onset of signaling. The remodeling includes the opening and marking of thousands of regulatory elements, extensive chromatin looping, and the co-recruitment of signal-mediating transcription factors. Our work identifies significant developmental principles that inform our understanding of how pluripotent stem cells interpret inductive signals.

2019 ◽  
Vol 10 (1) ◽  
Author(s):  
George E. Gentsch ◽  
Thomas Spruce ◽  
Nick D. L. Owens ◽  
James C. Smith

Abstract Embryonic development yields many different cell types in response to just a few families of inductive signals. The property of signal-receiving cells that determines how they respond to inductive signals is known as competence, and it differs in different cell types. Here, we explore the ways in which maternal factors modify chromatin to specify initial competence in the frog Xenopus tropicalis. We identify early-engaged regulatory DNA sequences, and infer from them critical activators of the zygotic genome. Of these, we show that the pioneering activity of the maternal pluripotency factors Pou5f3 and Sox3 determines competence for germ layer formation by extensively remodelling compacted chromatin before the onset of inductive signalling. This remodelling includes the opening and marking of thousands of regulatory elements, extensive chromatin looping, and the co-recruitment of signal-mediating transcription factors. Our work identifies significant developmental principles that inform our understanding of how pluripotent stem cells interpret inductive signals.


2017 ◽  
Author(s):  
Luca Pinello ◽  
Rick Farouni ◽  
Guo-Cheng Yuan

AbstractMotivationWith the increasing amount of genomic and epigenomic data in the public domain, a pressing challenge is how to integrate these data to investigate the role of epigenetic mechanisms in regulating gene expression and maintenance of cell-identity. To this end, we have implemented a computational pipeline to systematically study epigenetic variability and uncover regulatory DNA sequences that play a role in gene regulation.ResultsHaystack is a bioinformatics pipeline to characterize hotspots of epigenetic variability across different cell-types as well as cell-type specific cis-regulatory elements along with their corresponding transcription factors. Our approach is generally applicable to any epigenetic mark and provides an important tool to investigate cell-type identity and the mechanisms underlying epigenetic switches during development. Additionally, we make available a set of precomputed tracks for a number of epigenetic marks across several cell types. These precomputed results may be used as an independent resource for functional annotation of the human genome.AvailabilityThe Haystack pipeline is implemented as an open-source, multiplatform, Python package called haystack_bio available at https://github.com/pinellolab/[email protected], [email protected]


2020 ◽  
Author(s):  
Yupeng Wang ◽  
Rosario B. Jaime-Lara ◽  
Abhrarup Roy ◽  
Ying Sun ◽  
Xinyue Liu ◽  
...  

AbstractWe propose SeqEnhDL, a deep learning framework for classifying cell type-specific enhancers based on sequence features. DNA sequences of “strong enhancer” chromatin states in nine cell types from the ENCODE project were retrieved to build and test enhancer classifiers. For any DNA sequence, sequential k-mer (k=5, 7, 9 and 11) fold changes relative to randomly selected non-coding sequences were used as features for deep learning models. Three deep learning models were implemented, including multi-layer perceptron (MLP), Convolutional Neural Network (CNN) and Recurrent Neural Network (RNN). All models in SeqEnhDL outperform state-of-the-art enhancer classifiers including gkm-SVM and DanQ, with regard to distinguishing cell type-specific enhancers from randomly selected non-coding sequences. Moreover, SeqEnhDL is able to directly discriminate enhancers from different cell types, which has not been achieved by other enhancer classifiers. Our analysis suggests that both enhancers and their tissue-specificity can be accurately identified according to their sequence features. SeqEnhDL is publicly available at https://github.com/wyp1125/SeqEnhDL.


2021 ◽  
Author(s):  
Philipp Benner ◽  
Martin Vingron

AbstractRecent efforts to measure epigenetic marks across a wide variety of different cell types and tissues provide insights into the cell type-specific regulatory landscape. We use this data to study if there exists a correlate of epigenetic signals in the DNA sequence of enhancers and explore with computational methods to what degree such sequence patterns can be used to predict cell type-specific regulatory activity. By constructing classifiers that predict in which tissues enhancers are active, we are able to identify sequence features that might be recognized by the cell in order to regulate gene expression. While classification performances vary greatly between tissues, we show examples where our classifiers correctly predict tissue specific regulation from sequence alone. We also show that many of the informative patterns indeed harbor transcription factor footprints.


2016 ◽  
Author(s):  
Elizabeth Baskin ◽  
Rick Farouni ◽  
Ewy A. Mathe

AbstractSummaryRegulatory elements regulate gene transcription, and their location and accessibility is cell-type specific, particularly for enhancers. Mapping and comparing chromatin accessibility between different cell types may identify mechanisms involved in cellular development and disease progression. To streamline and simplify differential analysis of regulatory elements genome-wide using chromatin accessibility data, such as DNase-seq, ATAC-seq, we developed ALTRE (ALTered Regulatory Elements), an R package and associated R Shiny web app. ALTRE makes such analysis accessible to a wide range of users – from novice to practiced computational biologists.Availabilityhttps://github.com/Mathelab/[email protected]


2021 ◽  
Author(s):  
Biswajyoti Sahu ◽  
Tuomo Hartonen ◽  
Paivi Pihlajamaa ◽  
Bei Wei ◽  
Kashyap Dave ◽  
...  

DNA determines where and when genes are expressed, but the full set of sequence determinants that control gene expression is not known. To obtain a global and unbiased view of the relative importance of different sequence determinants in gene expression, we measured transcriptional activity of DNA sequences that are in aggregate ~100 times longer than the human genome in three different cell types. We show that enhancers can be classified to three main types: classical enhancers1, closed chromatin enhancers and chromatin-dependent enhancers, which act via different mechanisms and differ in motif content. Transcription factors (TFs) act generally in an additive manner with weak grammar, with classical enhancers increasing expression from promoters by a mechanism that does not involve specific TF-TF interactions. Few TFs are strongly active in a cell, with most activities similar between cell types. Chromatin-dependent enhancers are enriched in forkhead motifs, whereas classical enhancers contain motifs for TFs with strong transactivator domains such as ETS and bZIP; these motifs are also found at transcription start site (TSS)-proximal positions. However, some TFs, such as NRF1 only activate transcription when placed close to the TSS, and others such as YY1 display positional preference with respect to the TSS. TFs can thus be classified into four non-exclusive subtypes based on their transcriptional activity: chromatin opening, enhancing, promoting and TSS determining factors — consistent with the view that the binding motif is the only atomic unit of gene expression.


2019 ◽  
Author(s):  
Anvita Gupta ◽  
Anshul Kundaje

AbstractTargeted optimizing of existing DNA sequences for useful properties, has the potential to enable several synthetic biology applications from modifying DNA to treat genetic disorders to designing regulatory elements to fine tune context-specific gene expression. Current approaches for targeted genome editing are largely based on prior biological knowledge or ad-hoc rules. Few if any machine learning approaches exist for targeted optimization of regulatory DNA sequences.Here, we propose a novel generative neural network architecture for targeted DNA sequence editing – the EDA architecture – consisting of an encoder, decoder, and analyzer. We showcase the use of EDA to optimize regulatory DNA sequences to bind to the transcription factor SPI1. Compared to other state-of-the-art approaches such as a textual variational autoencoder and rule-based editing, EDA significantly improves predicted binding of SPI1 of genomic sequences with the minimal set of edits. We also use EDA to design regulatory elements with optimized grammars of CREB1 binding sites that can tune reporter expression levels as measured by massively parallel reporter assays (MPRA). We analyze the properties of the binding sites in the edited sequences and find patterns that are consistent with previously reported grammatical rules which tie gene expression to CRE binding site density, spacing and affinity.


2021 ◽  
Author(s):  
Erin M Wissink ◽  
Delsy M. Martinez ◽  
Kirk T. Ehmsen ◽  
Keith R. Yamamoto ◽  
John T Lis

The glucocorticoid receptor (GR) regulates transcription through binding to specific DNA motifs, particularly at enhancers. While the motif to which it binds is constant across cell types, GR has cell type-specific binding at genomic loci, resulting in regulation of different genes. The presence of other bound transcription factors (TFs) is hypothesized to strongly influence where GR binds. Here, we addressed the roles of other TFs in the glucocorticoid response by comparing changes in GR binding and nascent transcription at promoters and distal candidate cis-regulatory elements (CCREs) in two distinct human cancer cell types. We found that after glucocorticoid treatment, GR binds to thousands of genomic loci that are primarily outside of promoter regions and are potentially enhancers. The majority of these GR binding sites are cell-type specific, and they are associated with pioneer factor binding. A small fraction of GR occupied regions (GORs) displayed increased bidirectional nascent transcription, which is a characteristic of many active enhancers, after glucocorticoid treatment. Non-promoter GORs with increased transcription were specifically enriched for AP-1 binding prior to glucocorticoid treatment. These results support a model of transcriptional regulation in which multiple classes of TFs are required. The pioneer factors increase chromatin accessibility, facilitating the binding of GR and additional factors. AP-1 binding poises a fraction of accessible sites to be rapidly transcribed upon glucocorticoid-induced GR binding. The coordinated activity of multiple TFs then results in cell type-specific changes in gene expression. We anticipate that many models of inducible gene expression also require multiple distinct TFs that act at multiple steps of transcriptional regulation.


2021 ◽  
Vol 3 (4) ◽  
Author(s):  
Philipp Benner ◽  
Martin Vingron

Abstract Recent efforts to measure epigenetic marks across a wide variety of different cell types and tissues provide insights into the cell type-specific regulatory landscape. We use these data to study whether there exists a correlate of epigenetic signals in the DNA sequence of enhancers and explore with computational methods to what degree such sequence patterns can be used to predict cell type-specific regulatory activity. By constructing classifiers that predict in which tissues enhancers are active, we are able to identify sequence features that might be recognized by the cell in order to regulate gene expression. While classification performances vary greatly between tissues, we show examples where our classifiers correctly predict tissue-specific regulation from sequence alone. We also show that many of the informative patterns indeed harbor transcription factor footprints.


Acta Naturae ◽  
2016 ◽  
Vol 8 (2) ◽  
pp. 79-86 ◽  
Author(s):  
P. V. Elizar’ev ◽  
D. V. Lomaev ◽  
D. A. Chetverina ◽  
P. G. Georgiev ◽  
M. M. Erokhin

Maintenance of the individual patterns of gene expression in different cell types is required for the differentiation and development of multicellular organisms. Expression of many genes is controlled by Polycomb (PcG) and Trithorax (TrxG) group proteins that act through association with chromatin. PcG/TrxG are assembled on the DNA sequences termed PREs (Polycomb Response Elements), the activity of which can be modulated and switched from repression to activation. In this study, we analyzed the influence of transcriptional read-through on PRE activity switch mediated by the yeast activator GAL4. We show that a transcription terminator inserted between the promoter and PRE doesnt prevent switching of PRE activity from repression to activation. We demonstrate that, independently of PRE orientation, high levels of transcription fail to dislodge PcG/TrxG proteins from PRE in the absence of a terminator. Thus, transcription is not the main factor required for PRE activity switch.


Sign in / Sign up

Export Citation Format

Share Document