scholarly journals Determinants of Transcription Initiation Directionality in Metazoans

2017 ◽  
Author(s):  
Mahmoud M. Ibrahim ◽  
Aslihan Karabacak ◽  
Alexander Glahs ◽  
Ena Kolundzic ◽  
Antje Hirsekorn ◽  
...  

AbstractDivergent transcription from promoters and enhancers is pervasive in many species, but it remains unclear if it is a general and passive feature of all eukaryotic cis regulatory elements. To address this, we define promoters and enhancers in C. elegans, D. melanogaster and H. sapiens using ATAC-Seq and investigate the determinants of their transcription initiation directionalities by analyzing genome-wide nascent, cap-selected, polymerase run-on assays. All three species initiate divergent transcription from separate core promoter sequences. Sequence asymmetry downstream of forward and reverse initiation sites, known to be important for termination and stability in H. sapiens, is unique in each species. Chromatin states of divergent promoters are not entirely conserved, but in all three species, the levels of histone modifications on the +1 nucleosome are independent from those on the -1 nucleosome, arguing for independent initiation events. This is supported by an integrative model of H3K4me3 levels and core promoter sequence that is highly predictive of promoter directionality and of two types of promoters: those with balanced initiation directionality and those with skewed directionality. Lastly, D. melanogaster enhancers display variation in chromatin architecture depending on enhancer location, and D. melanogaster promoter regions with dual enhancer/promoter potential are enriched for divergent transcription. Our results point to a high degree of variation in regulatory element transcription initiation directionality within and between metazoans, and to non-passive regulatory mechanisms of transcription initiation directionality in those species.

2017 ◽  
Author(s):  
Sarah Rennie ◽  
Maria Dalby ◽  
Marta Lloret-Llinares ◽  
Stylianos Bakoulis ◽  
Christian Dalager Vaagensø ◽  
...  

ABSTRACTMammalian gene promoters and enhancers share many properties. They are composed of a unified promoter architecture of divergent transcripton initiation and gene promoters may exhibit enhancer function. However, it is currently unclear how expression strength of a regulatory element relates to its enhancer strength and if the unifying architecture is conserved across Metazoa. Here we investigate the transcription initiation landscape and its associated RNA decay in D. melanogaster. Surprisingly, we find that the majority of active gene-distal enhancers and a considerable fraction of gene promoters are divergently transcribed. We observe quantitative relationships between enhancer potential, expression level and core promoter strength, providing an explanation for indirectly related histone modifications that are reflecting expression levels. Lowly abundant unstable RNAs initiated from weak core promoters are key characteristics of gene-distal developmental enhancers, while the housekeeping enhancer strengths of gene promoters reflect their expression strengths. The different layers of regulation mediated by gene-distal enhancers and gene promoters are also reflected in chromatin interaction data. Our results suggest a unified promoter architecture of many D. melanogaster regulatory elements, that is universal across Metazoa, whose regulatory functions seem to be related to their core promoter elements.


Animals ◽  
2019 ◽  
Vol 9 (10) ◽  
pp. 816
Author(s):  
Shuang Yang ◽  
Yan Deng ◽  
Da Chen ◽  
Shenqiang Hu ◽  
Yingying Zhang ◽  
...  

Anti-Müllerian hormone (AMH) is recognized as a reliable marker of ovarian reserve. However, the regulatory mechanism of goose AMH gene remains poorly understood. In the present study, both the full-length coding sequence (CDS) and promoter sequence of goose AMH have been cloned. Its CDS consisted of 2013 nucleotides encoding 670 amino acids and the amino acid sequence contained two structural domain: AMH-N and transforming growth factor beta (TGF-β) domain. The obtained promoter sequence spanned from the −2386 bp to its transcription start site (ATG). Core promoter regions and regulatory elements were identified as well as transcription factors were predicted in its promoter sequence. The luciferase activity was the highest spanning from the −331 to −1 bp by constructing deletion promoter reporter vectors. In CHO cells, the luciferase activity significantly increased by co-expression of AMH and GATA binding protein 4 (GATA-4), while that significantly decreased by mutating the binding sites of GATA-4 located in the −778 and −1477 bp. Results from quantitative real-time polymerase chain reaction (qPCR) indicated that levels of AMH mRNA in geese granulosa layers decreased gradually with the increasing follicular diameter. Taken together, it could be concluded that the transcriptional activity of AMH was activated by GATA-4 to inhibit the development of small follicles in goose.


2021 ◽  
Author(s):  
Dohoon Lee ◽  
Jeewon Yang ◽  
Sun Kim

The quantitative characterization of the transcriptional control by histone modifications (HMs) has been challenged by many computational studies, but still most of them exploit only partial aspects of intricate mechanisms involved in gene regulation, leaving a room for improvement. We present Chromoformer, a new transformer-based deep learning architecture that achieves the state-of-the-art performance in the quantitative deciphering of the histone codes of gene regulation. The core essence of Chromoformer architecture lies in the three variants of attention operation, each specialized to model individual hierarchy of three-dimensional (3D) transcriptional regulation including (1) histone codes at core promoters, (2) pairwise interaction between a core promoter and a distal cis-regulatory element mediated by 3D chromatin interactions, and (3) the collective effect of the pairwise cis-regulations. In-depth interpretation of the trained model behavior based on attention scores suggests that Chromoformer adaptively exploits the distant dependencies between HMs associated with transcription initiation and elongation. We also demonstrate that the quantitative kinetics of transcription factories and polycomb group bodies, in which the coordinated gene regulation occurs through spatial sequestration of genes with regulatory elements, can be captured by Chromoformer. Together, our study shows the great power of attention-based deep learning as a versatile modeling approach for the complex epigenetic landscape of gene regulation and highlights its potential as an effective toolkit that facilitates scientific discoveries in computational epigenetics.


2004 ◽  
Vol 186 (12) ◽  
pp. 3738-3748 ◽  
Author(s):  
Jinxin Gao ◽  
George C. Stewart

ABSTRACT Staphylococcal protein A (Spa) is an important virulence factor of Staphylococcus aureus. Transcription of the spa determinant occurs during the exponential growth phase and is repressed when the cells enter the postexponential growth phase. Regulation of spa expression has been found to be complicated, with regulation involving multiple factors, including Agr, SarA, SarS, SarT, Rot, and MgrA. Our understanding of how these factors work on the spa promoter to regulate spa expression is incomplete. To identify regulatory sites within the spa promoter, analysis of deletion derivatives of the promoter in host strains deficient in one or more of the regulatory factors was undertaken, and several critical features of spa regulation were revealed. The transcriptional start sites of spa were determined by primer extension. The spa promoter sequences were subcloned in front of a promoterless chloramphenicol acetyltransferase reporter gene. Various lengths of spa truncations with the same 3′ end were constructed, and the resultant plasmids were transduced into strains with different regulatory genetic backgrounds. Our results identified upstream promoter sequences necessary for Agr system regulation of spa expression. The cis elements for SarS activity, an activator of spa expression, and for SarA activity, a repressor of spa expression, were identified. The well-characterized SarA consensus sequence on the spa promoter was found to be insufficient for SarA repression of the spa promoter. Full repression required the presence of a second consensus site adjacent to the SarS binding site. Sequences directly upstream of the core promoter sequence were found to stimulate transcription.


2016 ◽  
Author(s):  
Jenna E Gallegos ◽  
Alan B Rose

AbstractIn diverse eukaryotes, certain introns increase mRNA accumulation through the poorly understood mechanism of intron-mediated enhancement (IME). A distinguishing feature of IME is that these introns have no effect from upstream or more than 1 Kb downstream of the transcription start site (TSS). To more precisely define the intron position requirements for IME in Arabidopsis, we tested the effect of the UBQ10 intron on gene expression from 6 different positions surrounding the TSS of a TRP1:GUS fusion. The intron strongly increased expression from all transcribed positions, but had no effect when 204 nt or more upstream of the 5’-most TSS. When the intron was located in the 5’ UTR, the TSS unexpectedly changed, resulting in longer transcripts. Remarkably, deleting 303 nt of the core promoter, including all known TSS’s and all but 18 nt of the 5’ UTR, had virtually no effect on the level of gene expression as long as a stimulating intron was included in the gene. When the core promoter was deleted, transcription initiated in normally untranscribed sequences the same distance upstream of the intron as when the promoter was intact. Together, these results suggest that certain introns play unexpectedly large roles in directing transcription initiation and represent a previously unrecognized type of downstream regulatory elements for genes transcribed by RNA polymerase II. This study also demonstrates considerable flexibility in the sequences surrounding the TSS, indicating that the TSS is not determined by promoter sequences alone. These findings are relevant in practical applications where introns are used to increase gene expression and contribute to our general understanding of gene structure and regulation in eukaryotes.


1991 ◽  
Vol 11 (2) ◽  
pp. 641-654
Author(s):  
C Hinkley ◽  
M Perry

Xenopus oocytes, arrested in G2 before the first meiotic division, accumulate histone mRNA and protein in the absence of chromosomal DNA replication and therefore represent an attractive biological system in which to examine histone gene expression uncoupled from the cell cycle. Previous studies have shown that sequences necessary for maximal levels of transcription in oocytes are present within 200 bp at the 5' end of the transcription initiation site for genes encoding each of the five major Xenopus histone classes. We have defined by site-directed mutagenesis individual regulatory sequences and characterized DNA-binding proteins required for histone H2B gene transcription in injected oocytes. The Xenopus H2B gene has a relatively simple promoter containing several transcriptional regulatory elements, including TFIID, CBP, and ATF/CREB binding sites, required for maximal transcription. A sequence (CTTTACAT) in the H2B promoter resembling the conserved octamer motif (ATTTGCAT), the target for cell-cycle regulation of a human H2B gene, is not required for transcription in oocytes. Nonetheless, substitution of a consensus octamer motif for the variant octamer element activates H2B transcription. Oocyte factors, presumably including the ubiquitous Oct-1 factor, specifically bind to the consensus octamer motif but not to the variant sequence. Our results demonstrate that a transcriptional regulatory element involved in lymphoid-specific expression of immunoglobulin genes and in S-phase-specific activation of mammalian H2B histone genes can activate transcription in nondividing amphibian oocytes.


1988 ◽  
Vol 8 (7) ◽  
pp. 2896-2909 ◽  
Author(s):  
E A Sternberg ◽  
G Spizz ◽  
W M Perry ◽  
D Vizard ◽  
T Weil ◽  
...  

Terminal differentiation of skeletal myoblasts is accompanied by induction of a series of tissue-specific gene products, which includes the muscle isoenzyme of creatine kinase (MCK). To begin to define the sequences and signals involved in MCK regulation in developing muscle cells, the mouse MCK gene has been isolated. Sequence analysis of 4,147 bases of DNA surrounding the transcription initiation site revealed several interesting structural features, some of which are common to other muscle-specific genes and to cellular and viral enhancers. To test for sequences required for regulated expression, a region upstream of the MCK gene from -4800 to +1 base pairs, relative to the transcription initiation site, was linked to the coding sequences of the bacterial chloramphenicol acetyltransferase (CAT) gene. Introduction of this MCK-CAT fusion gene into C2 muscle cells resulted in high-level expression of CAT activity in differentiated myotubes and no detectable expression in proliferating undifferentiated myoblasts or in nonmyogenic cell lines. Deletion mutagenesis of sequences between -4800 and the transcription start site showed that the region between -1351 and -1050 was sufficient to confer cell type-specific and developmentally regulated expression on the MCK promoter. This upstream regulatory element functioned independently of position, orientation, or distance from the promoter and therefore exhibited the properties of a classical enhancer. This upstream enhancer also was able to confer muscle-specific regulation on the simian virus 40 promoter, although it exhibited a 3- to 5-fold preference for its own promoter. In contrast to the cell type- and differentiation-specific expression of the upstream enhancer, the MCK promoter was able to function in myoblasts and myotubes and in nonmyogenic cell lines when combined with the simian virus 40 enhancer. An additional positive regulatory element was identified within the first intron of the MCK gene. Like the upstream enhancer, this intragenic element functioned independently of position, orientation, and distance with respect to the MCK promoter and was active in differentiated myotubes but not in myoblasts. These results demonstrate that expression of the MCK gene in developing muscle cells is controlled by complex interactions among multiple upstream and intragenic regulatory elements that are functional only in the appropriate cellular context.


2003 ◽  
Vol 370 (3) ◽  
pp. 771-784 ◽  
Author(s):  
Cristina PÉREZ-GÓMEZ ◽  
José M. MATÉS ◽  
Pedro M. GÓMEZ-FABRE ◽  
Antonio del CASTILLO-OLIVARES ◽  
Francisco J. ALONSO ◽  
...  

In mammals, glutaminase (GA) is expressed in most tissues, but the regulation of organ-specific expression is largely unknown. Therefore, as an essential step towards studying the regulation of GA expression, the human liver-type GA (hLGA) gene has been characterized. LGA genomic sequences were isolated using the genome walking technique. Analysis and comparison of these sequences with two LGA cDNA clones and the Human Genome Project database, allowed the determination of the genomic organization of the LGA gene. The gene has 18 exons and is approx. 18kb long. All exon/intron junction sequences conform to the GT/AG rule. Progressive deletion analysis of LGA promoter—luciferase constructs indicated that the core promoter is located between nt −141 and +410, with several potential regulatory elements: CAAT, GC, TATA-like, Ras-responsive element binding protein and specificity protein 1 (Sp1) sites. The minimal promoter was mapped within +107 and +410, where only an Sp1 binding site is present. Mutation experiments suggested that two CAAT recognition elements near the transcription-initiation site (-138 and −87), play a crucial role for optimal promoter activity. Electrophoretic mobility-shift assays confirmed the importance of CAAT- and TATA-like boxes to enhance basal transcription, and demonstrated that HNF-1 motif is a significant distal element for transcriptional regulation of the hLGA gene.


2018 ◽  
Author(s):  
Ni Huang ◽  
Wei Qiang Seow ◽  
Julie Ahringer

AbstractInteractions between cis-regulatory elements such as promoters and enhancers are important for transcription but global identification of these interactions remains a major challenge. Leveraging the chromatin accessiblity of regulatory elements, we developed ARC-C (accessible region chromosome conformation capture), which profiles chromatin regulatory interactions genome-wide at high resolution. Applying ARC-C to C. elegans, we identify ~15,000 significant interactions at 500bp resolution. Regions bound by transcription factors and chromatin regulators such as cohesin and condensin II are enriched for interactions, and we use ARC-C to show that the BLMP-1 transcription factor mediates interactions between its targets. Investigating domain level architecture, we find that C. elegans chromatin domains defined by either active or repressive modifications form topologically associating domains (TADs) and that these domains interact to form A/B (active/inactive) compartment structure. ARC-C is a powerful new tool to interrogate genome architecture and regulatory interactions at high resolution.


2021 ◽  
Author(s):  
Li Ziyun ◽  
Li Biao ◽  
Zhao yichen ◽  
Zhao Degang

Abstract The lignans of Eucommia ulmoides have been extensively studied and shown to have a dual mechanism of regulating blood pressure. Studies have shown that DIR1 is a key gene in the biosynthetic pathway of lignans in Eucommia ulmoides Oliv. In this study, a 2000 bp upstream promoter sequence was cloned, and part of the sequence (1495 bp) and its 5'-end truncated segment were constructed into the pCAMBIA1391Z expression plasmid upstream of ß-glucuronidase (GUS). Agrobacterium-mediated genetic transformation produced stable transgenic tobacco lines. The results showed that although both full-length and truncated promoters could initiate GUS expression, their levels were influenced by the degree of deletion at the 5' end. GUS histochemical staining showed that the core promoter region was located in the region containing the transcription initiation site (TIS) within 212 bp. In addition, the DIR1 promoter responded to environmental and hormonal stressors, such as jasmonic acid (MeJA), abscisic acid (ABA), D-mannitol (drought mimic), and high concentrations of NaCl. In transgenic tobacco seedlings, MeJA, D-mannitol, and ABA could activate the DIR1 promoter, whereas high concentrations of NaCl could inhibit it. In E. ulmoides Oliv seedlings, MeJA, NaCl, and D-mannitol activated the DIR1 promoter, whereas ABA had an inhibitory effect. In summary, our findings provide a theoretical basis for the use of the DIR1 promoter in plant genetic engineering, indicating its potential. Our study also presents novel insights for lignan biosynthesis and sheds light on the mechanisms of E. ulmoides Oliv in response to stress.


Sign in / Sign up

Export Citation Format

Share Document