scholarly journals Benchmark and integration of resources for the estimation of human transcription factor activities

2018 ◽  
Author(s):  
Luz Garcia-Alonso ◽  
Mahmoud M Ibrahim ◽  
Denes Turei ◽  
Julio Saez-Rodriguez

ABSTRACTPrediction of transcription factor (TF) activities from the gene expression of their targets (i.e. TF regulon) is becoming a widely-used approach to characterize the functional status of transcriptional regulatory circuits. Several strategies and datasets have been proposed to link the target genes likely regulated by a TF, each one providing a different level of evidence. The most established ones are: (i) manually curated repositories, (ii) interactions derived from ChIP-seq binding data, (iii) in silico prediction of TF binding on gene promoters, and (iv) reverse-engineered regulons from large gene expression datasets. However, it is not known how these different sources of regulons affect the TF activity estimations, and thereby downstream analysis and interpretation. Here we compared the accuracy and biases of these strategies to define human TF regulons by means of their ability to predict changes in TF activities in three reference benchmark datasets. We assembled a collection of TF-target interactions among 1,541 TFs, and evaluated how the different molecular and regulatory properties of the TFs, such as the DNA-binding domain, specificities or mode of interaction with the chromatin, affect the predictions of TF activity changes. We assessed their coverage and found little overlap on the regulons derived from each strategy and better performance by literature-curated information followed by ChIP-seq data. We provide an integrated resource of all TF-target interactions derived through these strategies with a confidence score, as a resource for enhanced prediction of TF activities.

2016 ◽  
Author(s):  
Ian K Quigley ◽  
Chris Kintner

AbstractCooperative transcription factor binding at cis-regulatory sites in the genome drives robust eukaryotic gene expression, and many such sites must be coordinated to produce coherent transcriptional programs. The transcriptional program leading to motile cilia formation requires members of the DNA-binding forkhead (Fox) and Rfx transcription factor families and these factors co-localize to cilia gene promoters, but it is not clear how many cilia genes are regulated by these two factors, whether these factors act directly or indirectly, or how these factors act with specificity in the context of a 3-dimensional genome. Here, we use genome-wide approaches to show that cilia genes reside at the boundaries of topological domains and that these areas have low enhancer density. We show that the transcription factors Foxj1 and Rfx2 binding occurs in the promoters of more cilia genes than other known cilia transcription factors and that while Rfx2 binds directly to promoters and enhancers equally, Foxj1 prefers direct binding to enhancers and is stabilized at promoters by Rfx2. Finally, we show that Rfx2 and Foxj1 lie at the anchor endpoints of chromatin loops, suggesting that target genes are activated when Foxj1 bound at distal sites is recruited via a loop created by Rfx2 binding at both sites. We speculate that the primary function of Rfx2 is to stabilize distal enhancers with proximal promoters by operating as a scaffolding factor, bringing key regulatory domains bound by Foxj1 into close physical proximity and enabling coordinated cilia gene expression.Author SummaryThe multiciliated cell extends hundreds of motile cilia to produce fluid flow in the airways and other organ systems. The formation of this specialized cell type requires the coordinated expression of hundreds of genes in order to produce all the protein parts motile cilia require. While a relatively small number of transcription factors has been identified that promote gene expression during multiciliate cell differentiation, it is not clear how they work together to coordinate the expression of genes required for multiple motile ciliation. Here, we show that two transcription factors known to drive cilia formation, Foxj1 and Rfx2, play complementary roles wherein Foxj1 activates target genes but tends not to bind near them in the genome, whereas Rfx2 can’t activate target genes by itself but instead acts as a scaffold by localizing Foxj1 to the proper targets. These results suggest not only a mechanism by which complex gene expression is coordinated in multiciliated cells, but also how transcriptional programs in general could be modular and deployed across different cellular contexts with the same basic promoter configuration.


Author(s):  
Helen Ray-Jones ◽  
Mikhail Spivakov

AbstractTranscriptional enhancers play a key role in the initiation and maintenance of gene expression programmes, particularly in metazoa. How these elements control their target genes in the right place and time is one of the most pertinent questions in functional genomics, with wide implications for most areas of biology. Here, we synthesise classic and recent evidence on the regulatory logic of enhancers, including the principles of enhancer organisation, factors that facilitate and delimit enhancer–promoter communication, and the joint effects of multiple enhancers. We show how modern approaches building on classic insights have begun to unravel the complexity of enhancer–promoter relationships, paving the way towards a quantitative understanding of gene control.


2019 ◽  
Author(s):  
Joanna Mitchelmore ◽  
Nastasiya Grinberg ◽  
Chris Wallace ◽  
Mikhail Spivakov

AbstractIdentifying DNA cis-regulatory modules (CRMs) that control the expression of specific genes is crucial for deciphering the logic of transcriptional control. Natural genetic variation can point to the possible gene regulatory function of specific sequences through their allelic associations with gene expression. However, comprehensive identification of causal regulatory sequences in brute-force association testing without incorporating prior knowledge is challenging due to limited statistical power and effects of linkage disequilibrium. Sequence variants affecting transcription factor (TF) binding at CRMs have a strong potential to influence gene regulatory function, which provides a motivation for prioritising such variants in association testing. Here, we generate an atlas of CRMs showing predicted allelic variation in TF binding affinity in human lymphoblastoid cell lines (LCLs) and test their association with the expression of their putative target genes inferred from Promoter Capture Hi-C and immediate linear proximity. We reveal over 1300 CRM TF-binding variants associated with target gene expression, the majority of them undetected with standard association testing. A large proportion of CRMs showing associations with the expression of genes they contact in 3D localise to the promoter regions of other genes, supporting the notion of ‘epromoters’: dual-action CRMs with promoter and distal enhancer activity.


F1000Research ◽  
2018 ◽  
Vol 7 ◽  
pp. 1459
Author(s):  
Shalem Raju Modi ◽  
Tarja Kokkola

GR24 is a synthetic strigolactone analog, demonstrated to regulate the development of plants and arbuscular mycorrhizal fungi. GR24 possesses anti-cancer and anti-apoptotic properties, enhances insulin sensitivity and mitochondrial biogenesis in skeletal myotubes, inhibits adipogenesis, decreases inflammation in adipocytes and macrophages and downregulates the expression of hepatic gluconeogenic enzymes. Transcription factor Nrf2 (Nuclear factor (erythroid-derived 2)-like 2) is a master regulator of antioxidant response, regulating a multitude of genes involved in cellular stress responses and anti-inflammatory pathways, thus maintaining cellular redox homeostasis. Nrf2 activation reduces the deleterious effects of mitochondrial toxins and has multiple roles in promoting mitochondrial function and dynamics. We studied the role of GR24 on gene expression in rat L6 skeletal muscle cells which were differentiated into myotubes. The myotubes were treated with GR24 and analyzed by microarray gene expression profiling. GR24 upregulated the cytoprotective transcription factor Nrf2 and its target genes, activating antioxidant defences, suggesting that GR24 may protect skeletal muscle from the toxic effects of oxidative stress.


2010 ◽  
Vol 9 (4) ◽  
pp. 514-531 ◽  
Author(s):  
Barbara Heise ◽  
Julia van der Felden ◽  
Sandra Kern ◽  
Mario Malcher ◽  
Stefan Brückner ◽  
...  

ABSTRACT In Saccharomyces cerevisiae, the TEA transcription factor Tec1 is known to regulate target genes together with a second transcription factor, Ste12. Tec1-Ste12 complexes can activate transcription through Tec1 binding sites (TCSs), which can be further combined with Ste12 binding sites (PREs) for cooperative DNA binding. However, previous studies have hinted that Tec1 might regulate transcription also without Ste12. Here, we show that in vivo, physiological amounts of Tec1 are sufficient to stimulate TCS-mediated gene expression and transcription of the FLO11 gene in the absence of Ste12. In vitro, Tec1 is able to bind TCS elements with high affinity and specificity without Ste12. Furthermore, Tec1 contains a C-terminal transcriptional activation domain that confers Ste12-independent activation of TCS-regulated gene expression. On a genome-wide scale, we identified 302 Tec1 target genes that constitute two distinct classes. A first class of 254 genes is regulated by Tec1 in a Ste12-dependent manner and is enriched for genes that are bound by Tec1 and Ste12 in vivo. In contrast, a second class of 48 genes can be regulated by Tec1 independently of Ste12 and is enriched for genes that are bound by the stress transcription factors Yap6, Nrg1, Cin5, Skn7, Hsf1, and Msn4. Finally, we find that combinatorial control by Tec1-Ste12 complexes stabilizes Tec1 against degradation. Our study suggests that Tec1 is able to regulate TCS-mediated gene expression by Ste12-dependent and Ste12-independent mechanisms that enable promoter-specific transcriptional control.


F1000Research ◽  
2017 ◽  
Vol 6 ◽  
pp. 372 ◽  
Author(s):  
Delasa Aghamirzaie ◽  
Karthik Raja Velmurugan ◽  
Shuchi Wu ◽  
Doaa Altarawy ◽  
Lenwood S. Heath ◽  
...  

Motivation: The increasing availability of chromatin immunoprecipitation sequencing (ChIP-Seq) data enables us to learn more about the action of transcription factors in the regulation of gene expression. Even though in vivo transcriptional regulation often involves the concerted action of more than one transcription factor, the format of each individual ChIP-Seq dataset usually represents the action of a single transcription factor. Therefore, a relational database in which available ChIP-Seq datasets are curated is essential. Results: We present Expresso (database and webserver) as a tool for the collection and integration of available Arabidopsis ChIP-Seq peak data, which in turn can be linked to a user’s gene expression data. Known target genes of transcription factors were identified by motif analysis of publicly available GEO ChIP-Seq data sets. Expresso currently provides three services: 1) Identification of target genes of a given transcription factor; 2) Identification of transcription factors that regulate a gene of interest; 3) Computation of correlation between the gene expression of transcription factors and their target genes. Availability: Expresso is freely available at http://bioinformatics.cs.vt.edu/expresso/


Author(s):  
Liis Kolberg ◽  
Nurlan Kerimov ◽  
Hedi Peterson ◽  
Kaur Alasoo

AbstractBackgroundDeveloping novel therapies for complex disease requires better understanding of the causal processes that contribute to disease onset and progression. Although trans-acting gene expression quantitative trait loci (trans-eQTLs) can be a powerful approach to directly reveal cellular processes modulated by disease variants, detecting trans-eQTLs remains challenging due to their small effect sizes and large number of genes tested. However, if a single trans-eQTL controls a group of co-regulated genes, then multiple testing burden can be greatly reduced by summarising gene expression at the level of co-expression modules prior to trans-eQTL analysis.ResultsWe analysed gene expression and genotype data from six blood cell types from 226 to 710 individuals. We inferred gene co-expression modules with five methods on the full dataset, as well as in each cell type separately. We detected a number of established co-expression module trans-eQTLs, such as the monocyte-specific associations at the IFNB1 and LYZ loci, as well as a platelet-specific ARHGEF3 locus associated with mean platelet volume. We also discovered a novel trans association near the SLC39A8 gene in LPS-stimulated monocytes. Here, we linked an early-response cis-eQTL of the SLC39A8 gene to a module of co-expressed metallothionein genes upregulated more than 20 hours later and used motif analysis to identify zinc-induced activation of the MTF1 transcription factor as a likely mediator of this effect.ConclusionsOur analysis provides a rare detailed characterisation of a trans-eQTL effect cascade from a proximal cis effect to the affected signalling pathway, transcription factor, and target genes. This highlights how co-expression analysis combined with functional enrichment analysis can greatly improve the identification and prioritisation of trans-eQTLs when applied to emerging cell-type specific datasets.


2020 ◽  
Author(s):  
Maud Fagny ◽  
Marieke Lydia Kuijjer ◽  
Maike Stam ◽  
Johann Joets ◽  
Olivier Turc ◽  
...  

AbstractEnhancers are important regulators of gene expression during numerous crucial processes including tissue differentiation across development. In plants, their recent molecular characterization revealed their capacity to activate the expression of several target genes through the binding of transcription factors. Nevertheless, identifying these target genes at a genome-wide level remains a challenge, in particular in species with large genomes, where enhancers and target genes can be hundreds of kilobases away. Therefore, the contribution of enhancers to regulatory network is still poorly understood in plants. In this study, we investigate the enhancer-driven regulatory network of two maize tissues at different stages: leaves at seedling stage and husks (bracts) at flowering. Using a systems biology approach, we integrate genomic, epigenomic and transcriptomic data to model the regulatory relationship between transcription factors and their potential target genes. We identify regulatory modules specific to husk and V2-IST, and show that they are involved in distinct functions related to the biology of each tissue. We evidence enhancers exhibiting binding sites for two distinct transcription factor families (DOF and AP2/ERF) that drive the tissue-specificity of gene expression in seedling immature leaf and husk. Analysis of the corresponding enhancer sequences reveals that two different transposable element families (TIR transposon Mutator and MITE Pif/Harbinger) have shaped the regulatory network in each tissue, and that MITEs have provided new transcription factor binding sites that are involved in husk tissue-specificity.SignificanceEnhancers play a major role in regulating tissue-specific gene expression in higher eukaryotes, including angiosperms. While molecular characterization of enhancers has improved over the past years, identifying their target genes at the genome-wide scale remains challenging. Here, we integrate genomic, epigenomic and transcriptomic data to decipher the tissue-specific gene regulatory network controlled by enhancers at two different stages of maize leaf development. Using a systems biology approach, we identify transcription factor families regulating gene tissue-specific expression in husk and seedling leaves, and characterize the enhancers likely to be involved. We show that a large part of maize enhancers is derived from transposable elements, which can provide novel transcription factor binding sites crucial to the regulation of tissue-specific biological functions.


2020 ◽  
Author(s):  
Nadezda A. Fursova ◽  
Anne H. Turberfield ◽  
Neil P. Blackledge ◽  
Emma L. Findlater ◽  
Anna Lastuvkova ◽  
...  

AbstractHistone-modifying systems play fundamental roles in gene regulation and the development of multicellular organisms. Histone modifications that are enriched at gene regulatory elements have been heavily studied, but the function of modifications that are found more broadly throughout the genome remains poorly understood. This is exemplified by histone H2A mono-ubiquitylation (H2AK119ub1) which is enriched at Polycomb-repressed gene promoters, but also covers the genome at lower levels. Here, using inducible genetic perturbations and quantitative genomics, we discover that the BAP1 deubiquitylase plays an essential role in constraining H2AK119ub1 throughout the genome. Removal of BAP1 leads to pervasive accumulation of H2AK119ub1, which causes widespread reductions in gene expression. We show that elevated H2AK119ub1 represses gene expression by counteracting transcription initiation from gene regulatory elements, causing reductions in transcription-associated histone modifications. Furthermore, failure to constrain pervasive H2AK119ub1 compromises Polycomb complex occupancy at a subset of Polycomb target genes leading to their derepression, therefore explaining the original genetic characterisation of BAP1 as a Polycomb group gene. Together, these observations reveal that the transcriptional potential of the genome can be modulated by regulating the levels of a pervasive histone modification, without the need for elaborate gene-specific targeting mechanisms.


Sign in / Sign up

Export Citation Format

Share Document