scholarly journals WAPL maintains dynamic cohesin to preserve lineage specific distal gene regulation

2019 ◽  
Author(s):  
Ning Qing Liu ◽  
Michela Maresca ◽  
Teun van den Brand ◽  
Luca Braccioli ◽  
Marijne M.G.A. Schijns ◽  
...  

SUMMARYThe cohesin complex plays essential roles in sister chromatin cohesin, chromosome organization and gene expression. The role of cohesin in gene regulation is incompletely understood. Here, we report that the cohesin release factor WAPL is crucial for maintaining a pool of dynamic cohesin bound to regions that are associated with lineage specific genes in mouse embryonic stem cells. These regulatory regions are enriched for active enhancer marks and transcription factor binding sites, but largely devoid of CTCF binding sites. Stabilization of cohesin, which leads to a loss of dynamic cohesin from these regions, does not affect transcription factor binding or active enhancer marks, but does result in changes in promoter-enhancer interactions and downregulation of genes. Acute cohesin depletion can phenocopy the effect of WAPL depletion, showing that cohesin plays a crucial role in maintaining expression of lineage specific genes. The binding of dynamic cohesin to chromatin is dependent on the pluripotency transcription factor OCT4, but not NANOG. Finally, dynamic cohesin binding sites are also found in differentiated cells, suggesting that they represent a general regulatory principle. We propose that cohesin dynamically binding to regulatory sites creates a favorable spatial environment in which promoters and enhancers can communicate to ensure proper gene expression.HIGHLIGHTSThe cohesin release factor WAPL is crucial for maintaining a pluripotency-specific phenotype.Dynamic cohesin is enriched at lineage specific loci and overlaps with binding sites of pluripotency transcription factors.Expression of lineage specific genes is maintained by dynamic cohesin binding through the formation of promoter-enhancer associated self-interaction domains.CTCF-independent cohesin binding to chromatin is controlled by the pioneer factor OCT4.

Genes ◽  
2018 ◽  
Vol 9 (9) ◽  
pp. 446 ◽  
Author(s):  
Shijie Xin ◽  
Xiaohui Wang ◽  
Guojun Dai ◽  
Jingjing Zhang ◽  
Tingting An ◽  
...  

The proinflammatory cytokine, interleukin-6 (IL-6), plays a critical role in many chronic inflammatory diseases, particularly inflammatory bowel disease. To investigate the regulation of IL-6 gene expression at the molecular level, genomic DNA sequencing of Jinghai yellow chickens (Gallus gallus) was performed to detect single-nucleotide polymorphisms (SNPs) in the region −2200 base pairs (bp) upstream to 500 bp downstream of IL-6. Transcription factor binding sites and CpG islands in the IL-6 promoter region were predicted using bioinformatics software. Twenty-eight SNP sites were identified in IL-6. Four of these 28 SNPs, three [−357 (G > A), −447 (C > G), and −663 (A > G)] in the 5′ regulatory region and one in the 3′ non-coding region [3177 (C > T)] are not labelled in GenBank. Bioinformatics analysis revealed 11 SNPs within the promoter region that altered putative transcription factor binding sites. Furthermore, the C-939G mutation in the promoter region may change the number of CpG islands, and SNPs in the 5′ regulatory region may influence IL-6 gene expression by altering transcription factor binding or CpG methylation status. Genetic diversity analysis revealed that the newly discovered A-663G site significantly deviated from Hardy-Weinberg equilibrium. These results provide a basis for further exploration of the promoter function of the IL-6 gene and the relationships of these SNPs to intestinal inflammation resistance in chickens.


PLoS Genetics ◽  
2020 ◽  
Vol 16 (11) ◽  
pp. e1009189
Author(s):  
Alejandro Martin-Trujillo ◽  
Nihir Patel ◽  
Felix Richter ◽  
Bharati Jadhav ◽  
Paras Garg ◽  
...  

Although DNA methylation is the best characterized epigenetic mark, the mechanism by which it is targeted to specific regions in the genome remains unclear. Recent studies have revealed that local DNA methylation profiles might be dictated by cis-regulatory DNA sequences that mainly operate via DNA-binding factors. Consistent with this finding, we have recently shown that disruption of CTCF-binding sites by rare single nucleotide variants (SNVs) can underlie cis-linked DNA methylation changes in patients with congenital anomalies. These data raise the hypothesis that rare genetic variation at transcription factor binding sites (TFBSs) might contribute to local DNA methylation patterning. In this work, by combining blood genome-wide DNA methylation profiles, whole genome sequencing-derived SNVs from 247 unrelated individuals along with 133 predicted TFBS motifs derived from ENCODE ChIP-Seq data, we observed an association between the disruption of binding sites for multiple TFs by rare SNVs and extreme DNA methylation values at both local and, to a lesser extent, distant CpGs. While the majority of these changes affected only single CpGs, 24% were associated with multiple outlier CpGs within ±1kb of the disrupted TFBS. Interestingly, disruption of functionally constrained sites within TF motifs lead to larger DNA methylation changes at nearby CpG sites. Altogether, these findings suggest that rare SNVs at TFBS negatively influence TF-DNA binding, which can lead to an altered local DNA methylation profile. Furthermore, subsequent integration of DNA methylation and RNA-Seq profiles from cardiac tissues enabled us to observe an association between rare SNV-directed DNA methylation and outlier expression of nearby genes. In conclusion, our findings not only provide insights into the effect of rare genetic variation at TFBS on shaping local DNA methylation and its consequences on genome regulation, but also provide a rationale to incorporate DNA methylation data to interpret the functional role of rare variants.


BMC Biology ◽  
2021 ◽  
Vol 19 (1) ◽  
Author(s):  
Lianggang Huang ◽  
Xuejie Li ◽  
Liangbo Dong ◽  
Bin Wang ◽  
Li Pan

Abstract Background The identification of open chromatin regions and transcription factor binding sites (TFBs) is an important step in understanding the regulation of gene expression in diverse species. ATAC-seq is a technique used for such purpose by providing high-resolution measurements of chromatin accessibility revealed through integration of Tn5 transposase. However, the existence of cell walls in filamentous fungi and associated difficulty in purifying nuclei have precluded the routine application of this technique, leading to a lack of experimentally determined and computationally inferred data on the identity of genome-wide cis-regulatory elements (CREs) and TFBs. In this study, we constructed an ATAC-seq platform suitable for filamentous fungi and generated ATAC-seq libraries of Aspergillus niger and Aspergillus oryzae grown under a variety of conditions. Results We applied the ATAC-seq assay for filamentous fungi to delineate the syntenic orthologue and differentially changed chromatin accessibility regions among different Aspergillus species, during different culture conditions, and among specific TF-deleted strains. The syntenic orthologues of accessible regions were responsible for the conservative functions across Aspergillus species, while regions differentially changed between culture conditions and TFs mutants drove differential gene expression programs. Importantly, we suggest criteria to determine TFBs through the analysis of unbalanced cleavage of distinct TF-bound DNA strands by Tn5 transposase. Based on this criterion, we constructed data libraries of the in vivo genomic footprint of A. niger under distinct conditions, and generated a database of novel transcription factor binding motifs through comparison of footprints in TF-deleted strains. Furthermore, we validated the novel TFBs in vivo through an artificial synthetic minimal promoter system. Conclusions We characterized the chromatin accessibility regions of filamentous fungi species, and identified a complete TFBs map by ATAC-seq, which provides valuable data for future analyses of transcriptional regulation in filamentous fungi.


2017 ◽  
Author(s):  
Ella Preger-Ben Noon ◽  
Gonzalo Sabarís ◽  
Daniela Ortiz ◽  
Jonathan Sager ◽  
Anna Liebowitz ◽  
...  

AbstractDevelopmental genes can have complex c/s-regulatory regions, with multiple enhancers scattered across stretches of DNA spanning tens or hundreds of kilobases. Early work revealed remarkable modularity of enhancers, where distinct regions of DNA, bound by combinations of transcription factors, drive gene expression in defined spatio-temporal domains. Nevertheless, a few reports have shown that enhancer function may be required in multiple developmental stages, implying that regulatory elements can be pleiotropic. In these cases, it is not clear whether the pleiotropic enhancers employ the same transcription factor binding sites to drive expression at multiple developmental stages or whether enhancers function as chromatin scaffolds, where independent sets of transcription factor binding sites act at different stages. In this work we have studied the activity of the enhancers of the shavenbaby gene throughout D. melanogaster development. We found that all seven shavenbaby enhancers drive gene expression in multiple tissues and developmental stages at varying levels of redundancy. We have explored how this pleiotropy is encoded in two of these enhancers. In one enhancer, the same transcription factor binding sites contribute to embryonic and pupal expression, whereas for a second enhancer, these roles are largely encoded by distinct transcription factor binding sites. Our data suggest that enhancer pleiotropy might be a common feature of c/s-regulatory regions of developmental genes and that this pleiotropy can be encoded through multiple genetic architectures.


2005 ◽  
Vol 03 (02) ◽  
pp. 281-301 ◽  
Author(s):  
PATRICK C. H. MA ◽  
KEITH C. C. CHAN ◽  
DAVID K. Y. CHIU

The combined interpretation of gene expression data and gene sequences is important for the investigation of the intricate relationships of gene expression at the transcription level. The expression data produced by microarray hybridization experiments can lead to the identification of clusters of co-expressed genes that are likely co-regulated by the same regulatory mechanisms. By analyzing the promoter regions of co-expressed genes, the common regulatory patterns characterized by transcription factor binding sites can be revealed. Many clustering algorithms have been used to uncover inherent clusters in gene expression data. In this paper, based on experiments using simulated and real data, we show that the performance of these algorithms could be further improved. For the clustering of expression data typically characterized by a lot of noise, we propose to use a two-phase clustering algorithm consisting of an initial clustering phase and a second re-clustering phase. The proposed algorithm has several desirable features: (i) it utilizes both local and global information by computing both a "local" pairwise distance between two gene expression profiles in Phase 1 and a "global" probabilistic measure of interestingness of cluster patterns in Phase 2, (ii) it distinguishes between relevant and irrelevant expression values when performing re-clustering, and (iii) it makes explicit the patterns discovered in each cluster for possible interpretations. Experimental results show that the proposed algorithm can be an effective algorithm for discovering clusters in the presence of very noisy data. The patterns that are discovered in each cluster are found to be meaningful and statistically significant, and cannot otherwise be easily discovered. Based on these discovered patterns, genes co-expressed under the same experimental conditions and range of expression levels have been identified and evaluated. When identifying regulatory patterns at the promoter regions of the co-expressed genes, we also discovered well-known transcription factor binding sites in them. These binding sites can provide explanations for the co-expressed patterns.


2021 ◽  
Vol 25 (1) ◽  
pp. 18-29
Author(s):  
E. V. Ignatieva ◽  
E. A. Matrosova

Whole genome and whole exome sequencing technologies play a very important role in the studies of the genetic aspects of the pathogenesis of various diseases. The ample use of genome-wide and exome-wide association study methodology (GWAS and EWAS) made it possible to identify a large number of genetic variants associated with diseases. This information is accumulated in the databases like GWAS central, GWAS catalog, OMIM, ClinVar, etc. Most of the variants identified by the GWAS technique are located in the noncoding regions of the human genome. According to the ENCODE project, the fraction of regions in the human genome potentially involved in transcriptional control is many times greater than the fraction of coding regions. Thus, genetic variation in noncoding regions of the genome can increase the susceptibility to diseases by disrupting various regulatory elements (promoters, enhancers, silencers, insulator regions, etc.). However, identification of the mechanisms of influence of pathogenic genetic variants on the diseases risk is difficult due to a wide variety of regulatory elements. The present review focuses on the molecular genetic mechanisms by which pathogenic genetic variants affect gene expression. At the same time, attention is concentrated on the transcriptional level of regulation as an initial step in the expression of any gene. A triggering event mediating the effect of a pathogenic genetic variant on the level of gene expression can be, for example, a change in the functional activity of transcription factor binding sites (TFBSs) or DNA methylation change, which, in turn, affects the functional activity of promoters or enhancers. Dissecting the regulatory roles of polymorphic loci have been impossible without close integration of modern experimental approaches with computer analysis of a growing wealth of genetic and biological data obtained using omics technologies. The review provides a brief description of a number of the most well-known public genomic information resources containing data obtained using omics technologies, including (1) resources that accumulate data on the chromatin states and the regions of transcription factor binding derived from ChIP-seq experiments; (2) resources containing data on genomic loci, for which allele-specific transcription factor binding was revealed based on ChIP-seq technology; (3) resources containing in silico predicted data on the potential impact of genetic variants on the transcription factor binding sites.


Sign in / Sign up

Export Citation Format

Share Document