scholarly journals ALTRE: workflow for defining ALTered Regulatory Elements using chromatin accessibility data

2016 ◽  
Author(s):  
Elizabeth Baskin ◽  
Rick Farouni ◽  
Ewy A. Mathe

AbstractSummaryRegulatory elements regulate gene transcription, and their location and accessibility is cell-type specific, particularly for enhancers. Mapping and comparing chromatin accessibility between different cell types may identify mechanisms involved in cellular development and disease progression. To streamline and simplify differential analysis of regulatory elements genome-wide using chromatin accessibility data, such as DNase-seq, ATAC-seq, we developed ALTRE (ALTered Regulatory Elements), an R package and associated R Shiny web app. ALTRE makes such analysis accessible to a wide range of users – from novice to practiced computational biologists.Availabilityhttps://github.com/Mathelab/[email protected]

2021 ◽  
Author(s):  
Sneha Gopalan ◽  
Yuqing Wang ◽  
Nicholas W. Harper ◽  
Manuel Garber ◽  
Thomas G Fazzio

Methods derived from CUT&RUN and CUT&Tag enable genome-wide mapping of the localization of proteins on chromatin from as few as one cell. These and other mapping approaches focus on one protein at a time, preventing direct measurements of co-localization of different chromatin proteins in the same cells and requiring prioritization of targets where samples are limiting. Here we describe multi-CUT&Tag, an adaptation of CUT&Tag that overcomes these hurdles by using antibody-specific barcodes to simultaneously map multiple proteins in the same cells. Highly specific multi-CUT&Tag maps of histone marks and RNA Polymerase II uncovered sites of co-localization in the same cells, active and repressed genes, and candidate cis-regulatory elements. Single-cell multi-CUT&Tag profiling facilitated identification of distinct cell types from a mixed population and characterization of cell type-specific chromatin architecture. In sum, multi-CUT&Tag increases the information content per cell of epigenomic maps, facilitating direct analysis of the interplay of different proteins on chromatin.


2020 ◽  
Author(s):  
SK Reilly ◽  
SJ Gosai ◽  
A Gutierrez ◽  
JC Ulirsch ◽  
M Kanai ◽  
...  

AbstractCRISPR screens for cis-regulatory elements (CREs) have shown unprecedented power to endogenously characterize the non-coding genome. To characterize CREs we developed HCR-FlowFISH (Hybridization Chain Reaction Fluorescent In-Situ Hybridization coupled with Flow Cytometry), which directly quantifies native transcripts within their endogenous loci following CRISPR perturbations of regulatory elements, eliminating the need for restrictive phenotypic assays such as growth or transcript-tagging. HCR-FlowFISH accurately quantifies gene expression across a wide range of transcript levels and cell types. We also developed CASA (CRISPR Activity Screen Analysis), a hierarchical Bayesian model to identify and quantify CRE activity. Using >270,000 perturbations, we identified CREs for GATA1, HDAC6, ERP29, LMO2, MEF2C, CD164, NMU, FEN1 and the FADS gene cluster. Our methods detect subtle gene expression changes and identify CREs regulating multiple genes, sometimes at different magnitudes and directions. We demonstrate the power of HCR-FlowFISH to parse genome-wide association signals by nominating causal variants and target genes.


2019 ◽  
Vol 35 (20) ◽  
pp. 3898-3905 ◽  
Author(s):  
Ziyi Li ◽  
Zhijin Wu ◽  
Peng Jin ◽  
Hao Wu

Abstract Motivation Samples from clinical practices are often mixtures of different cell types. The high-throughput data obtained from these samples are thus mixed signals. The cell mixture brings complications to data analysis, and will lead to biased results if not properly accounted for. Results We develop a method to model the high-throughput data from mixed, heterogeneous samples, and to detect differential signals. Our method allows flexible statistical inference for detecting a variety of cell-type specific changes. Extensive simulation studies and analyses of two real datasets demonstrate the favorable performance of our proposed method compared with existing ones serving similar purpose. Availability and implementation The proposed method is implemented as an R package and is freely available on GitHub (https://github.com/ziyili20/TOAST). Supplementary information Supplementary data are available at Bioinformatics online.


1985 ◽  
Vol 101 (4) ◽  
pp. 1442-1454 ◽  
Author(s):  
P Cowin ◽  
H P Kapprell ◽  
W W Franke

Desmosomal plaque proteins have been identified in immunoblotting and immunolocalization experiments on a wide range of cell types from several species, using a panel of monoclonal murine antibodies to desmoplakins I and II and a guinea pig antiserum to desmosomal band 5 protein. Specifically, we have taken advantage of the fact that certain antibodies react with both desmoplakins I and II, whereas others react only with desmoplakin I, indicating that desmoplakin I contains unique regions not present on the closely related desmoplakin II. While some of these antibodies recognize epitopes conserved between chick and man, others display a narrow species specificity. The results show that proteins whose size, charge, and biochemical behavior are very similar to those of desmoplakin I and band 5 protein of cow snout epidermis are present in all desmosomes examined. These include examples of simple and pseudostratified epithelia and myocardial tissue, in addition to those of stratified epithelia. In contrast, in immunoblotting experiments, we have detected desmoplakin II only among cells of stratified and pseudostratified epithelial tissues. This suggests that the desmosomal plaque structure varies in its complement of polypeptides in a cell-type specific manner. We conclude that the obligatory desmosomal plaque proteins, desmoplakin I and band 5 protein, are expressed in a coordinate fashion but independently from other differentiation programs of expression such as those specific for either epithelial or cardiac cells.


2018 ◽  
Author(s):  
George E. Gentsch ◽  
Thomas Spruce ◽  
Nick D. L. Owens ◽  
James C. Smith

ABSTRACTEmbryonic development yields many different cell types in response to just a few families of inductive signals. The property of a signal-receiving cell that determines how it responds to such signals, including the activation of cell type-specific genes, is known as its competence. Here, we show how maternal factors modify chromatin to specify initial competence in the frog Xenopus tropicalis. We identified the earliest engaged regulatory DNA sequences, and inferred from them critical activators of the zygotic genome. Of these, we showed that the pioneering activity of the maternal pluripotency factors Pou5f3 and Sox3 predefines competence for germ layer formation by extensively remodeling compacted chromatin before the onset of signaling. The remodeling includes the opening and marking of thousands of regulatory elements, extensive chromatin looping, and the co-recruitment of signal-mediating transcription factors. Our work identifies significant developmental principles that inform our understanding of how pluripotent stem cells interpret inductive signals.


Author(s):  
Tiit Örd ◽  
Kadri Õunap ◽  
Lindsey Stolze ◽  
Rédouane Aherrahrou ◽  
Valtteri Nurminen ◽  
...  

Rationale: Genome-wide association studies (GWAS) have identified hundreds of loci associated with coronary artery disease (CAD). Many of these loci are enriched in cis-regulatory elements (CREs) but not linked to cardiometabolic risk factors nor to candidate causal genes, complicating their functional interpretation. Objective: Single nucleus chromatin accessibility profiling of the human atherosclerotic lesions was used to investigate cell type-specific patterns of CREs, to understand transcription factors establishing cell identity and to interpret CAD-relevant, non-coding genetic variation. Methods and Results: We used single nucleus ATAC-seq to generate DNA accessibility maps in > 7,000 cells derived from human atherosclerotic lesions. We identified five major lesional cell types including endothelial cells, smooth muscle cells, monocyte/macrophages, NK/T-cells and B-cells and further investigated subtype characteristics of macrophages and smooth muscle cells transitioning into fibromyocytes. We demonstrated that CAD associated genetic variants are particularly enriched in endothelial and smooth muscle cell-specific open chromatin. Using single cell co-accessibility and cis-eQTL information, we prioritized putative target genes and candidate regulatory elements for ~30% of all known CAD loci. Finally, we performed genome-wide experimental fine-mapping of the CAD GWAS variants using epigenetic QTL analysis in primary human aortic endothelial cells and STARR-Seq massively parallel reporter assay in smooth muscle cells. This analysis identified potential causal SNP(s) and the associated target gene for over 30 CAD loci. We present several examples where the chromatin accessibility and gene expression could be assigned to one cell type predicting the cell type of action for CAD loci. Conclusions: These findings highlight the potential of applying snATAC-seq to human tissues in revealing relative contributions of distinct cell types to diseases and in identifying genes likely to be influenced by non-coding GWAS variants.


2019 ◽  
Author(s):  
Florian Schmidt ◽  
Alexander Marx ◽  
Marie Hebel ◽  
Martin Wegner ◽  
Nina Baumgarten ◽  
...  

AbstractUnderstanding the complexity of transcriptional regulation is a major goal of computational biology. Because experimental linkage of regulatory sites to genes is challenging, computational methods considering epigenomics data have been proposed to create tissue-specific regulatory maps. However, we showed that these approaches are not well suited to account for the variations of the regulatory landscape between cell-types. To overcome these drawbacks, we developed a new method called STITCHIT, that identifies and links putative regulatory sites to genes. Within STITCHIT, we consider the chromatin accessibility signal of all samples jointly to identify regions exhibiting a signal variation related to the expression of a distinct gene. STITCHIToutperforms previous approaches in various validation experiments and was used with a genome-wide CRISPR-Cas9 screen to prioritize novel doxorubicin-resistance genes and their associated non-coding regulatory regions. We believe that our work paves the way for a more refined understanding of transcriptional regulation at the gene-level.


2020 ◽  
Vol 11 (1) ◽  
Author(s):  
Jingxue Xin ◽  
Hui Zhang ◽  
Yaoxi He ◽  
Zhana Duren ◽  
Caijuan Bai ◽  
...  

Abstract High-altitude adaptation of Tibetans represents a remarkable case of natural selection during recent human evolution. Previous genome-wide scans found many non-coding variants under selection, suggesting a pressing need to understand the functional role of non-coding regulatory elements (REs). Here, we generate time courses of paired ATAC-seq and RNA-seq data on cultured HUVECs under hypoxic and normoxic conditions. We further develop a variant interpretation methodology (vPECA) to identify active selected REs (ASREs) and associated regulatory network. We discover three causal SNPs of EPAS1, the key adaptive gene for Tibetans. These SNPs decrease the accessibility of ASREs with weakened binding strength of relevant TFs, and cooperatively down-regulate EPAS1 expression. We further construct the downstream network of EPAS1, elucidating its roles in hypoxic response and angiogenesis. Collectively, we provide a systematic approach to interpret phenotype-associated noncoding variants in proper cell types and relevant dynamic conditions, to model their impact on gene regulation.


Author(s):  
Tianshun Gao ◽  
Jiang Qian

Abstract Enhancers are distal cis-regulatory elements that activate the transcription of their target genes. They regulate a wide range of important biological functions and processes, including embryogenesis, development, and homeostasis. As more and more large-scale technologies were developed for enhancer identification, a comprehensive database is highly desirable for enhancer annotation based on various genome-wide profiling datasets across different species. Here, we present an updated database EnhancerAtlas 2.0 (http://www.enhanceratlas.org/indexv2.php), covering 586 tissue/cell types that include a large number of normal tissues, cancer cell lines, and cells at different development stages across nine species. Overall, the database contains 13 494 603 enhancers, which were obtained from 16 055 datasets using 12 high-throughput experiment methods (e.g. H3K4me1/H3K27ac, DNase-seq/ATAC-seq, P300, POLR2A, CAGE, ChIA-PET, GRO-seq, STARR-seq and MPRA). The updated version is a huge expansion of the first version, which only contains the enhancers in human cells. In addition, we predicted enhancer–target gene relationships in human, mouse and fly. Finally, the users can search enhancers and enhancer–target gene relationships through five user-friendly, interactive modules. We believe the new annotation of enhancers in EnhancerAtlas 2.0 will facilitate users to perform useful functional analysis of enhancers in various genomes.


Science ◽  
2020 ◽  
Vol 370 (6518) ◽  
pp. eaba7612 ◽  
Author(s):  
Silvia Domcke ◽  
Andrew J. Hill ◽  
Riza M. Daza ◽  
Junyue Cao ◽  
Diana R. O’Day ◽  
...  

The chromatin landscape underlying the specification of human cell types is of fundamental interest. We generated human cell atlases of chromatin accessibility and gene expression in fetal tissues. For chromatin accessibility, we devised a three-level combinatorial indexing assay and applied it to 53 samples representing 15 organs, profiling ~800,000 single cells. We leveraged cell types defined by gene expression to annotate these data and cataloged hundreds of thousands of candidate regulatory elements that exhibit cell type–specific chromatin accessibility. We investigated the properties of lineage-specific transcription factors (such as POU2F1 in neurons), organ-specific specializations of broadly distributed cell types (such as blood and endothelial), and cell type–specific enrichments of complex trait heritability. These data represent a rich resource for the exploration of in vivo human gene regulation in diverse tissues and cell types.


Sign in / Sign up

Export Citation Format

Share Document