scholarly journals Predicting CTCF-mediated chromatin interactions by integrating genomic and epigenomic features

2017 ◽  
Author(s):  
Yan Kai ◽  
Jaclyn Andricovich ◽  
Zhouhao Zeng ◽  
Jun Zhu ◽  
Alexandros Tzatsos ◽  
...  

AbstractThe CCCTC-binding zinc finger protein (CTCF)-mediated network of long-range chromatin interactions is important for genome organization and function. Although this network has been considered largely invariant, we found that it exhibits extensive cell-type-specific interactions that contribute to cell identity. Here we present Lollipop—a machine-learning framework—which predicts CTCF-mediated long-range interactions using genomic and epigenomic features. Using ChIA-PET data as benchmark, we demonstrated that Lollipop accurately predicts CTCF-mediated chromatin interactions both within and across cell-types, and outperforms other methods based only on CTCF motif orientation. Predictions were confirmed computationally and experimentally by Chromatin Conformation Capture (3C). Moreover, our approach reveals novel determinants of CTCF-mediated chromatin wiring, such as gene expression within the loops. Our study contributes to a better understanding about the underlying principles of CTCF-mediated chromatin interactions and their impact on gene expression.

2019 ◽  
Author(s):  
Tom Aharon Hait ◽  
Ran Elkon ◽  
Ron Shamir

AbstractSpatiotemporal gene expression patterns are governed to a large extent by enhancer elements, typically located distally from their target genes. Identification of enhancer-promoter (EP) links that are specific and functional in individual cell types is a key challenge in understanding gene regulation. We introduce CT-FOCS, a new statistical inference method that utilizes multiple replicates per cell type to infer cell type-specific EP links. Computationally predicted EP links are usually benchmarked against experimentally determined chromatin interactions measured by ChIA-PET and promoter-capture HiC techniques. We expand this validation scheme by using also loops that overlap in their anchor sites. In analyzing 1,366 samples from ENCODE, Roadmap epigenomics and FANTOM5, CT-FOCS inferred highly cell type-specific EP links more accurately than state-of-the-art methods. We illustrate how our inferred EP links drive cell type-specific gene expression and regulation.


Genes ◽  
2019 ◽  
Vol 10 (5) ◽  
pp. 377 ◽  
Author(s):  
Marta Zuzic ◽  
Jesus Eduardo Rojo Arias ◽  
Stefanie Gabriele Wohl ◽  
Volker Busskamp

The health and function of our visual system relies on accurate gene expression. While many genetic mutations are associated with visual impairment and blindness, we are just beginning to understand the complex interplay between gene regulation and retinal pathologies. MicroRNAs (miRNAs), a class of non-coding RNAs, are important regulators of gene expression that exert their function through post-transcriptional silencing of complementary mRNA targets. According to recent transcriptomic analyses, certain miRNA species are expressed in all retinal cell types, while others are cell type-specific. As miRNAs play important roles in homeostasis, cellular function, and survival of differentiated retinal cell types, their dysregulation is associated with retinal degenerative diseases. Thus, advancing our understanding of the genetic networks modulated by miRNAs is central to harnessing their potential as therapeutic agents to overcome visual impairment. In this review, we summarize the role of distinct miRNAs in specific retinal cell types, the current knowledge on their implication in inherited retinal disorders, and their potential as therapeutic agents.


2021 ◽  
Vol 18 (10) ◽  
pp. 1196-1203 ◽  
Author(s):  
Žiga Avsec ◽  
Vikram Agarwal ◽  
Daniel Visentin ◽  
Joseph R. Ledsam ◽  
Agnieszka Grabska-Barwinska ◽  
...  

AbstractHow noncoding DNA determines gene expression in different cell types is a major unsolved problem, and critical downstream applications in human genetics depend on improved solutions. Here, we report substantially improved gene expression prediction accuracy from DNA sequences through the use of a deep learning architecture, called Enformer, that is able to integrate information from long-range interactions (up to 100 kb away) in the genome. This improvement yielded more accurate variant effect predictions on gene expression for both natural genetic variants and saturation mutagenesis measured by massively parallel reporter assays. Furthermore, Enformer learned to predict enhancer–promoter interactions directly from the DNA sequence competitively with methods that take direct experimental data as input. We expect that these advances will enable more effective fine-mapping of human disease associations and provide a framework to interpret cis-regulatory evolution.


eLife ◽  
2020 ◽  
Vol 9 ◽  
Author(s):  
Timothy P O'Leary ◽  
Kaitlin E Sullivan ◽  
Lihua Wang ◽  
Jody Clements ◽  
Andrew L Lemire ◽  
...  

The basolateral amygdala complex (BLA), extensively connected with both local amygdalar nuclei as well as long-range circuits, is involved in a diverse array of functional roles. Understanding the mechanisms of such functional diversity will be greatly informed by understanding the cell-type-specific landscape of the BLA. Here, beginning with single-cell RNA sequencing, we identified both discrete and graded continuous gene-expression differences within the mouse BLA. Via in situ hybridization, we next mapped this discrete transcriptomic heterogeneity onto a sharp spatial border between the basal and lateral amygdala nuclei, and identified continuous spatial gene-expression gradients within each of these regions. These discrete and continuous spatial transformations of transcriptomic cell-type identity were recapitulated by local morphology as well as long-range connectivity. Thus, BLA excitatory neurons are a highly heterogenous collection of neurons that spatially covary in molecular, cellular, and circuit properties. This heterogeneity likely drives pronounced spatial variation in BLA computation and function.


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
John A. Halsall ◽  
Simon Andrews ◽  
Felix Krueger ◽  
Charlotte E. Rutledge ◽  
Gabriella Ficz ◽  
...  

AbstractChromatin configuration influences gene expression in eukaryotes at multiple levels, from individual nucleosomes to chromatin domains several Mb long. Post-translational modifications (PTM) of core histones seem to be involved in chromatin structural transitions, but how remains unclear. To explore this, we used ChIP-seq and two cell types, HeLa and lymphoblastoid (LCL), to define how changes in chromatin packaging through the cell cycle influence the distributions of three transcription-associated histone modifications, H3K9ac, H3K4me3 and H3K27me3. We show that chromosome regions (bands) of 10–50 Mb, detectable by immunofluorescence microscopy of metaphase (M) chromosomes, are also present in G1 and G2. They comprise 1–5 Mb sub-bands that differ between HeLa and LCL but remain consistent through the cell cycle. The same sub-bands are defined by H3K9ac and H3K4me3, while H3K27me3 spreads more widely. We found little change between cell cycle phases, whether compared by 5 Kb rolling windows or when analysis was restricted to functional elements such as transcription start sites and topologically associating domains. Only a small number of genes showed cell-cycle related changes: at genes encoding proteins involved in mitosis, H3K9 became highly acetylated in G2M, possibly because of ongoing transcription. In conclusion, modified histone isoforms H3K9ac, H3K4me3 and H3K27me3 exhibit a characteristic genomic distribution at resolutions of 1 Mb and below that differs between HeLa and lymphoblastoid cells but remains remarkably consistent through the cell cycle. We suggest that this cell-type-specific chromosomal bar-code is part of a homeostatic mechanism by which cells retain their characteristic gene expression patterns, and hence their identity, through multiple mitoses.


2021 ◽  
Vol 4 (3) ◽  
pp. 49
Author(s):  
Tomas Zelenka ◽  
Charalampos Spilianakis

The functional implications of the three-dimensional genome organization are becoming increasingly recognized. The Hi-C and HiChIP research approaches belong among the most popular choices for probing long-range chromatin interactions. A few methodical protocols have been published so far, yet their reproducibility and efficiency may vary. Most importantly, the high frequency of the dangling ends may dramatically affect the number of usable reads mapped to valid interaction pairs. Additionally, more obstacles arise from the chromatin compactness of certain investigated cell types, such as primary T cells, which due to their small and compact nuclei, impede limitations for their use in various genomic approaches. Here we systematically optimized all the major steps of the HiChIP protocol in T cells. As a result, we reduced the number of dangling ends to nearly zero and increased the proportion of long-range interaction pairs. Moreover, using three different mouse genotypes and multiple biological replicates, we demonstrated the high reproducibility of the optimized protocol. Although our primary goal was to optimize HiChIP, we also successfully applied the optimized steps to Hi-C, given their significant protocol overlap. Overall, we describe the rationale behind every optimization step, followed by a detailed protocol for both HiChIP and Hi-C experiments.


eLife ◽  
2019 ◽  
Vol 8 ◽  
Author(s):  
Sinisa Hrvatin ◽  
Christopher P Tzeng ◽  
M Aurel Nagy ◽  
Hume Stroud ◽  
Charalampia Koutsioumpa ◽  
...  

Enhancers are the primary DNA regulatory elements that confer cell type specificity of gene expression. Recent studies characterizing individual enhancers have revealed their potential to direct heterologous gene expression in a highly cell-type-specific manner. However, it has not yet been possible to systematically identify and test the function of enhancers for each of the many cell types in an organism. We have developed PESCA, a scalable and generalizable method that leverages ATAC- and single-cell RNA-sequencing protocols, to characterize cell-type-specific enhancers that should enable genetic access and perturbation of gene function across mammalian cell types. Focusing on the highly heterogeneous mammalian cerebral cortex, we apply PESCA to find enhancers and generate viral reagents capable of accessing and manipulating a subset of somatostatin-expressing cortical interneurons with high specificity. This study demonstrates the utility of this platform for developing new cell-type-specific viral reagents, with significant implications for both basic and translational research.


eLife ◽  
2019 ◽  
Vol 8 ◽  
Author(s):  
Aminah T Ali ◽  
Lena Boehme ◽  
Guillermo Carbajosa ◽  
Vlad C Seitan ◽  
Kerrin S Small ◽  
...  

Mitochondria play important roles in cellular processes and disease, yet little is known about how the transcriptional regime of the mitochondrial genome varies across individuals and tissues. By analyzing >11,000 RNA-sequencing libraries across 36 tissue/cell types, we find considerable variation in mitochondrial-encoded gene expression along the mitochondrial transcriptome, across tissues and between individuals, highlighting the importance of cell-type specific and post-transcriptional processes in shaping mitochondrial-encoded RNA levels. Using whole-genome genetic data we identify 64 nuclear loci associated with expression levels of 14 genes encoded in the mitochondrial genome, including missense variants within genes involved in mitochondrial function (TBRG4, MTPAP and LONP1), implicating genetic mechanisms that act in trans across the two genomes. We replicate ~21% of associations with independent tissue-matched datasets and find genetic variants linked to these nuclear loci that are associated with cardio-metabolic phenotypes and Vitiligo, supporting a potential role for variable mitochondrial-encoded gene expression in complex disease.


2018 ◽  
Author(s):  
Caroline Fecher ◽  
Laura Trovò ◽  
Stephan A. Müller ◽  
Nicolas Snaidero ◽  
Jennifer Wettmarshausen ◽  
...  

AbstractMitochondria vary in morphology and function in different tissues, however little is known about their molecular diversity among cell types. To investigate mitochondrial diversity in vivo, we developed an efficient protocol to isolate cell type-specific mitochondria based on a new MitoTag mouse. We profiled the mitochondrial proteome of three major neural cell types in cerebellum and identified a substantial number of differential mitochondrial markers for these cell types in mice and humans. Based on predictions from these proteomes, we demonstrate that astrocytic mitochondria metabolize long-chain fatty acids more efficiently than neurons. Moreover, we identified Rmdn3 as a major determinant of ER-mitochondria proximity in Purkinje cells. Our novel approach enables exploring mitochondrial diversity on the functional and molecular level in many in vivo contexts.


Author(s):  
Nurlan Kerimov ◽  
James D Hayhurst ◽  
Kateryna Peikova ◽  
Jonathan R Manning ◽  
Peter Walter ◽  
...  

An increasing number of gene expression quantitative trait locus (eQTL) studies have made summary statistics publicly available, which can be used to gain insight into complex human traits by downstream analyses, such as fine mapping and colocalisation. However, differences between these datasets, in their variants tested, allele codings, and in the transcriptional features quantified, are a barrier to their widespread use. Consequently, target genes for most GWAS signals have still not been identified. Here, we present the eQTL Catalogue (https://www.ebi.ac.uk/eqtl/), a resource which contains quality controlled, uniformly re-computed QTLs from 21 eQTL studies. We find that for matching cell types and tissues, the eQTL effect sizes are highly reproducible between studies, enabling the integrative analysis of these data. Although most cis-eQTLs were shared between most bulk tissues, the analysis of purified cell types identified a greater diversity of cell-type-specific eQTLs, a subset of which also manifested as novel disease colocalisations. Our summary statistics can be downloaded by FTP, accessed via a REST API, and visualised on the Ensembl genome browser. New datasets will continuously be added to the eQTL Catalogue, enabling the systematic interpretation of human GWAS associations across many cell types and tissues.


Sign in / Sign up

Export Citation Format

Share Document