scholarly journals Mobility connects: transposable elements wire new transcriptional networks by transferring transcription factor binding motifs

2020 ◽  
Vol 48 (3) ◽  
pp. 1005-1017
Author(s):  
Yichun Qiu ◽  
Claudia Köhler

Transposable elements (TEs) constitute major fractions of plant genomes. Their potential to be mobile provides them with the capacity to cause major genome rearrangements. Those effects are potentially deleterious and enforced the evolution of epigenetic suppressive mechanisms controlling TE activity. However, beyond their deleterious effects, TE insertions can be neutral or even advantageous for the host, leading to long-term retention of TEs in the host genome. Indeed, TEs are increasingly recognized as major drivers of evolutionary novelties by regulating the expression of nearby genes. TEs frequently contain binding motifs for transcription factors and capture binding motifs during transposition, which they spread through the genome by transposition. Thus, TEs drive the evolution and diversification of gene regulatory networks by recruiting lineage-specific targets under the regulatory control of specific transcription factors. This process can explain the rapid and repeated evolution of developmental novelties, such as C4 photosynthesis and a wide spectrum of stress responses in plants. It also underpins the convergent evolution of embryo nourishing tissues, the placenta in mammals and the endosperm in flowering plants. Furthermore, the gene regulatory network underlying flower development has also been largely reshaped by TE-mediated recruitment of regulatory elements; some of them being preserved across long evolutionary timescales. In this review, we highlight the potential role of TEs as evolutionary toolkits in plants by showcasing examples of TE-mediated evolutionary novelties.

Database ◽  
2020 ◽  
Vol 2020 ◽  
Author(s):  
Davide Dalfovo ◽  
Samuel Valentini ◽  
Alessandro Romanel

Abstract Understanding the interaction between human genome regulatory elements and transcription factors is fundamental to elucidate the structure of gene regulatory networks. Here we present CONREL, a web application that allows for the exploration of functionally annotated transcriptional ‘consensus’ regulatory elements at different levels of abstraction. CONREL provides an extensive collection of consensus promoters, enhancers and active enhancers for 198 cell-lines across 38 tissue types, which are also combined to provide global consensuses. In addition, 1000 Genomes Project genotype data and the ‘total binding affinity’ of thousands of transcription factor binding motifs at genomic regulatory elements is fully combined and exploited to characterize and annotate functional properties of our collection. Comparison with other available resources highlights the strengths and advantages of CONREL. CONREL can be used to explore genomic loci, specific genes or genomic regions of interest across different cell lines and tissue types. The resource is freely available at https://bcglab.cibio.unitn.it/conrel.


2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Julius Judd ◽  
Hayley Sanderson ◽  
Cédric Feschotte

Abstract Background Transposable elements are increasingly recognized as a source of cis-regulatory variation. Previous studies have revealed that transposons are often bound by transcription factors and some have been co-opted into functional enhancers regulating host gene expression. However, the process by which transposons mature into complex regulatory elements, like enhancers, remains poorly understood. To investigate this process, we examined the contribution of transposons to the cis-regulatory network controlling circadian gene expression in the mouse liver, a well-characterized network serving an important physiological function. Results ChIP-seq analyses reveal that transposons and other repeats contribute ~ 14% of the binding sites for core circadian regulators (CRs) including BMAL1, CLOCK, PER1/2, and CRY1/2, in the mouse liver. RSINE1, an abundant murine-specific SINE, is the only transposon family enriched for CR binding sites across all datasets. Sequence analyses and reporter assays reveal that the circadian regulatory activity of RSINE1 stems from the presence of imperfect CR binding motifs in the ancestral RSINE1 sequence. These motifs matured into canonical motifs through point mutations after transposition. Furthermore, maturation occurred preferentially within elements inserted in the proximity of ancestral CR binding sites. RSINE1 also acquired motifs that recruit nuclear receptors known to cooperate with CRs to regulate circadian gene expression specifically in the liver. Conclusions Our results suggest that the birth of enhancers from transposons is predicated both by the sequence of the transposon and by the cis-regulatory landscape surrounding their genomic integration site.


2021 ◽  
pp. 1-13
Author(s):  
Francis Poulat

In vertebrates, gonadal sex determination is the process by which transcription factors drive the choice between the testicular and ovarian identity of undifferentiated somatic progenitors through activation of 2 different transcriptional programs. Studies in animal models suggest that sex determination always involves sex-specific transcription factors that activate or repress sex-specific genes. These transcription factors control their target genes by recognizing their regulatory elements in the non-coding genome and their binding motifs within their DNA sequence. In the last 20 years, the development of genomic approaches that allow identifying all the genomic targets of a transcription factor in eukaryotic cells gave the opportunity to globally understand the function of the nuclear proteins that control complex genetic programs. Here, the major transcription factors involved in male and female vertebrate sex determination and the genomic profiling data of mouse gonads that contributed to deciphering their transcriptional regulation role will be reviewed.


2020 ◽  
Vol 223 (14) ◽  
pp. jeb221622
Author(s):  
Sarah M. Ryan ◽  
Kaitie Wildman ◽  
Briseida Oceguera-Perez ◽  
Scott Barbee ◽  
Nathan T. Mortimer ◽  
...  

ABSTRACTAs organisms are constantly exposed to the damaging effects of oxidative stress through both environmental exposure and internal metabolic processes, they have evolved a variety of mechanisms to cope with this stress. One such mechanism is the highly conserved p38 MAPK (p38K) pathway, which is known to be post-translationally activated in response to oxidative stress, resulting in the activation of downstream antioxidant targets. However, little is known about the role of p38K transcriptional regulation in response to oxidative stress. Therefore, we analyzed the p38K gene family across the genus Drosophila to identify conserved regulatory elements. We found that oxidative stress exposure results in increased p38K protein levels in multiple Drosophila species and is associated with increased oxidative stress resistance. We also found that the p38Kb genomic locus includes conserved AP-1 and lola-PT transcription factor consensus binding sites. Accordingly, over-expression of these transcription factors in D. melanogaster is sufficient to induce transcription of p38Kb and enhances resistance to oxidative stress. We further found that the presence of a putative lola-PT binding site in the p38Kb locus of a given species is predictive of the species' survival in response to oxidative stress. Through our comparative genomics approach, we have identified biologically relevant putative transcription factor binding sites that regulate the expression of p38Kb and are associated with resistance to oxidative stress. These findings reveal a novel mode of regulation for p38K genes and suggest that transcription may play as important a role in p38K-mediated stress responses as post-translational modifications.


2019 ◽  
Vol 18 (5) ◽  
pp. 290-301 ◽  
Author(s):  
Christa G Toenhake ◽  
Richárd Bártfai

Abstract Malaria parasites are characterized by a complex life cycle that is accompanied by dynamic gene expression patterns. The factors and mechanisms that regulate gene expression in these parasites have been searched for even before the advent of next generation sequencing technologies. Functional genomics approaches have substantially boosted this area of research and have yielded significant insights into the interplay between epigenetic, transcriptional and post-transcriptional mechanisms. Recently, considerable progress has been made in identifying sequence-specific transcription factors and DNA-encoded regulatory elements. Here, we review the insights obtained from these efforts including the characterization of core promoters, the involvement of sequence-specific transcription factors in life cycle progression and the mapping of gene regulatory elements. Furthermore, we discuss recent developments in the field of functional genomics and how they might contribute to further characterization of this complex gene regulatory network.


2020 ◽  
Author(s):  
Kasum Azim ◽  
Filippo Calzolari ◽  
Martina Cantone ◽  
Rainer Akkermann ◽  
Julio Vera ◽  
...  

AbstractThe subventricular zone (SVZ) is the largest neurogenic niche in the adult forebrain. Notably, neural stem cells (NSCs) of the SVZ generate not only neurons, but also oligodendrocytes, the myelin-forming cells of the central nervous system. Transcriptomic studies have provided detailed knowledge of the molecular events that regulate neurogenesis, but little is understood about adult oligodendrogenesis from SVZ-NSCs. To address this, we performed in-depth single-cell transcriptomic analyses to resolve the major differences in neuronal and oligodendroglial lineages derived from the adult SVZ. A hallmark of adult oligodendrogenesis was the stage-specific expression of transcriptional modulators that regulate developmental oligodendrogenesis. Notably, divergence of the oligodendroglial lineage was distinguished by Wnt-Notch and angiogenesis-related signaling, whereas G-protein-coupled receptor signaling pathways were the major signature observed in the neurogenic lineage. Moreover, in-depth gene regulatory network analysis identified key stage-specific master regulators of the oligodendrocyte lineage and revealed new mechanisms by which signaling pathways interact with transcriptional networks to control lineage progression. Our work provides an integrated view of the multi-step differentiation process leading from NSCs to mature oligodendrocytes, by linking environmental signals to known and novel transcriptional mechanisms orchestrating oligodendrogenesis.Main pointsDistinct adult NSC populations giving rise to either oligodendrocytes or neurons can be identified by the expression of transcription factors.Gene regulatory control of oligodendrogenesis is a major fate-determinant for their generation.


Author(s):  
Alexandra Maslova ◽  
Ricardo N. Ramirez ◽  
Ke Ma ◽  
Hugo Schmutz ◽  
Chendi Wang ◽  
...  

SUMMARYThe mammalian genome contains several million cis-regulatory elements, whose differential activity marked by open chromatin determines organogenesis and differentiation. This activity is itself embedded in the DNA sequence, decoded by sequence-specific transcription factors. Leveraging a granular ATAC-seq atlas of chromatin activity across 81 immune cell-types we show that a convolutional neural network (“AI-TAC”) can learn to infer cell-type-specific chromatin activity solely from the DNA sequence. AI-TAC does so by rediscovering, with astonishing precision, binding motifs for known regulators, and some unknown ones, mapping them with high concordance to positions validated by ChIP-seq data. AI-TAC also uncovers combinatorial influences, establishing a hierarchy of transcription factors (TFs) and their interactions involved in immunocyte specification, with intriguingly different strategies between lineages. Mouse-trained AI-TAC can parse human DNA, revealing a strikingly similar ranking of influential TFs. Thus, Deep Learning can reveal the regulatory syntax that drives the full differentiative complexity of the immune system.


2021 ◽  
Author(s):  
Valérie Schreiber ◽  
Reuben Mercier ◽  
Sara Jímenez ◽  
Tao Ye ◽  
Emmanuel García-Sánchez ◽  
...  

Objective: Mice lacking the bHLH transcription factor (TF) Neurog3 do not form pancreatic islet cells, including insulin secreting beta cells, causing diabetes. In human, homozygous mutations of NEUROG3 manifest with neonatal or childhood diabetes. Despite this critical role in islet cell development, the precise function and downstream genetic programs regulated directly by NEUROG3 remain elusive. We therefore mapped genome-wide NEUROG3 occupancy in human induced pluripotent stem cell (iPSC)-derived endocrine progenitors and determined NEUROG3 dependency of associated genes to uncover direct targets. Methods: We generated a novel hiPSC line (NEUROG3-HA-P2A-Venus), where NEUROG3 is HA-tagged and fused to a self-cleaving fluorescent VENUS reporter. We used the CUT&RUN technique to map NEUROG3 occupancy and epigenetic marks in pancreatic endocrine progenitors (PEP) differentiated from this hiPSC line. We integrated NEUROG3 occupancy data with chromatin status and gene expression in PEPs and their NEUROG3-dependence. In addition, we searched whether NEUROG3 binds type 2 diabetes mellitus (T2DM)-associated variants at the PEP stage. Results: CUT&RUN revealed a total of 863 NEUROG3 binding sites assigned to 1268 unique genes. NEUROG3 occupancy was found at promoters as well as at distant cis-regulatory elements frequently overlapping within PEP active enhancers. De novo motif analyses defined a NEUROG3 consensus binding motif and suggested potential co-regulation of NEUROG3 target genes by FOXA, RFX or PBX transcription factors. Moreover, we found that 22% of the genes downregulated in NEUROG3−/− hESC-derived PEPs are bound by NEUROG3 and thus likely to be directly regulated. NEUROG3 targets include transcription factors known to have important roles in islet cell development or function, such as NEUROD1, PAX4, NKX2-2, SOX4, MLXIPL, LMX1B, RFX3, and NEUROG3 itself. Remarkably, we uncovered that NEUROG3 binds transcriptional regulator genes with enriched expression in human fetal pancreatic alpha (e.g., IRX1, IRX2), beta (e.g., NKX6-1, SMAD9, ISX, TFCP2L1) and delta cells (ERBB4) suggesting that NEUROG3 could control islets subtype programs. Moreover, NEUROG3 targets genes critical for insulin secretion in beta cells (e.g., GCK, ABCC8/KCNJ11, CACNA1A, CHGA, SCG2, SLC30A8 and PCSK1). In addition, we unveiled a panel of ncRNA potentially regulated by NEUROG3. Lastly, we identified several T2DM risk SNPs within NEUROG3 peaks suggesting a possible developmental role of NEUROG3 in T2DM susceptibility. Conclusion: Mapping of NEUROG3 genome occupancy in PEPs uncovers an unexpectedly broad, direct control of the endocrine gene regulatory network (GRN) and raises novel hypotheses on how this master regulator controls islet and beta cell differentiation.


Author(s):  
Hidenori Nishihara

Abstract Acquisition of cis-elements is a major driving force for rewiring a gene regulatory network. Several kinds of transposable elements (TEs), mostly retrotransposons that propagate via a copy-and-paste mechanism, are known to possess transcription factor binding motifs and have provided source sequences for enhancers/promoters. However, it remains largely unknown whether retrotransposons have spread the binding sites of master regulators of morphogenesis and accelerated cis-regulatory expansion involved in common mammalian morphological features during evolution. Here, I demonstrate that thousands of binding sites for estrogen receptor α (ERα) and three related pioneer factors (FoxA1, GATA3 and AP2γ) that are essential regulators of mammary gland development arose from a spreading of the binding motifs by retrotransposons. The TE-derived functional elements serve primarily as distal enhancers and are enriched around genes associated with mammary gland morphogenesis. The source TEs occurred via a two-phased expansion consisting of mainly L2/MIR in a eutherian ancestor and endogenous retrovirus 1 (ERV1) in simian primates and murines. Thus the build-up of potential sources for cis-elements by retrotransposons followed by their frequent utilization by the host (co-option/exaptation) may have a general accelerating effect on both establishing and diversifying a gene regulatory network, leading to morphological innovation.


2020 ◽  
Vol 48 (8) ◽  
pp. 4195-4213 ◽  
Author(s):  
Allison N Catizone ◽  
Gizem Karsli Uzunbas ◽  
Petra Celadova ◽  
Sylvia Kuang ◽  
Daniel Bose ◽  
...  

Abstract The master tumor suppressor p53 controls transcription of a wide-ranging gene network involved in apoptosis, cell cycle arrest, DNA damage repair, and senescence. Recent studies revealed pervasive binding of p53 to cis-regulatory elements (CREs), which are non-coding segments of DNA that spatially and temporally control transcription through the combinatorial binding of local transcription factors. Although the role of p53 as a strong trans-activator of gene expression is well known, the co-regulatory factors and local sequences acting at p53-bound CREs are comparatively understudied. We designed and executed a massively parallel reporter assay (MPRA) to investigate the effect of transcription factor binding motifs and local sequence context on p53-bound CRE activity. Our data indicate that p53-bound CREs are both positively and negatively affected by alterations in local sequence context and changes to co-regulatory TF motifs. Our data suggest p53 has the flexibility to cooperate with a variety of transcription factors in order to regulate CRE activity. By utilizing different sets of co-factors across CREs, we hypothesize that global p53 activity is guarded against loss of any one regulatory partner, allowing for dynamic and redundant control of p53-mediated transcription.


Sign in / Sign up

Export Citation Format

Share Document