scholarly journals Chromatin co-accessibility is highly structured, spans entire chromosomes, and mediates long range regulatory genetic effects

2019 ◽  
Author(s):  
William W. Young Greenwald ◽  
Agnieszka D’Antonio-Chronowska ◽  
Paola Benaglio ◽  
Hiroko Matsui ◽  
Erin N. Smith ◽  
...  

AbstractChromatin accessibility identifies active regions of the genome, often at transcription factor (TF) binding sites, enhancers, and promoters, and contains regulatory genetic variation. Functionally related accessible sites have been reported to be co-accessible; however, the prevalence and range of co-accessibility is unknown. We perform ATAC-seq in induced pluripotent stem cells from 134 individuals and integrate it with RNA-seq, WGS, and ChIP-seq, providing the first long-range chromosome-length analysis of co-accessibility. We show that co-accessibility is highly connected, with sites having a median of 24 co-accessible partners up to 250Mb away. We also show that co-accessibility can de novo identify known and novel co-expressed genes, and co-regulatory TFs and chromatin states. We perform a cis and trans-caQTL, a trans-eQTL, and examine allelic effects of co-accessibility, identifying tens of thousands of trans-caQTLs, and showing that trans genetic effects can be propagated through co-accessibility to gene expression for cell-type and disease relevant genes.

Blood ◽  
2019 ◽  
Vol 134 (Supplement_1) ◽  
pp. 3783-3783
Author(s):  
Alexia Katsarou ◽  
Nikolaos Trasanidis ◽  
Jaime Alvarez-Benayas ◽  
Foteini Papaleonidopoulou ◽  
Keren Keren ◽  
...  

Overexpression of the transcription factor MAF, as a result of its juxtaposition to the IgH enhancer [MAF-translocated t(14;16)], is a myeloma-initiating event in 3-5% of patients with multiple myeloma (MM) and confers a poor prognosis. MAF is also overexpressed in another 40% of cases, often in co-operation with the oncogene MMSET. The mechanisms by which MAF overexpression impacts on the regulatory genome to generate the MAF-driven oncogenic transcriptome and its direct targets are not known. To address this, we employed a multi-layer -omics approach using primary myeloma plasma cells (PC) as well as myeloma cell lines (MMCL). First, we determined the chromatin accessibility and transcriptome profiles of MAF-translocated myeloma by performing ATAC-seq and RNA-seq, respectively, in purified bone marrow CD138+ PC from two patients with t(14;16) and three healthy donors. We identified 6,640 differentially accessible regions, 87% of which displayed enhanced chromatin accessibility in MAF samples compared to normal PC. Secondary analysis comparing this with ATAC-seq data from a set of 28 other MM samples, including hyperdiploid, MMSET and CCND1-translocated MM, revealed 33% of those regions to be MAF subgroup specific (1,949 regions), with the rest shared between MAF and other cytogenetic groups. Gene annotation and pathway enrichment analysis using GREAT confirmed overrepresentation of the MF myeloma patient signature, as previously identified in microarray datasets. RNA-seq detected significant upregulation of approximately 900 genes in MAF samples compared to normal counterparts, including MAF itself (top 4th hit) as well as its presumed targets (CCND2, ITGB7 and NUAK1). Next, we obtained the MAF cistrome using ChIP-seq in the MAF-translocated MMCL MM1.S and integrated it with the primary PC ATAC-seq data. This revealed that 31% (618/1,949) of the differentially accessible regions in MAF-translocated MM PC are also MAF-bound. Additional overlay with ENCODE ChromHMM epigenome map showed that 47% of MAF binding sites are on active enhancers and 42% on active promoters signifying potential direct regulation of the corresponding genes. Next, we superimposed the accessible and MAF-bound loci on the epigenomic landscapes of normal PC and other B-cell types using their corresponding ChromHMM maps (Blueprint consortium data). Interestingly, 56% (345/618) of the MAF-specific regions were not active in any stage of B cell development. This suggests that aberrant MAF overexpression and chromatin binding in PC is associated with de novo activation of these chromatin regions, over half of which (200/345; 58%) are enhancers; we termed these 'neo-enhancers'. Upon de novo motif analysis of MAF ChIP-seq in MAF-translocated JJN3 and MM1.S MMCL, we confirmed MAF as the first and, interestingly, IRF4 as the second top hit, suggesting a possible MAF-IRF4 functional interaction in myelomagenesis. Indeed, overlay of the accessible MAF-bound loci with IRF4 ChIP-seq data in MM1.S revealed 63% co-occupancy (including 62% of "neo-enhancers"), proposing a novel and extensive co-operative chromatin-based network between the two transcription factors. Final integration of the accessible MAF-bound regions with the paired transcriptomes of primary myeloma PC revealed a set 206 candidate enhancer-gene pairs. Strikingly, we identified two IRF4-cobound "neo-enhancers" linked to overexpression of TLR4 and CCR1, two genes known for their roles in myeloma cell proliferation and migration. We confirmed significant downregulation of both genes upon shRNA-mediated knockdown of MAF in the two MAF-translocated MMCL, MM1.S and JJN3, as well as the lethality of MAF depletion. Further, MAF overexpression in MAF-negative myeloma backgrounds led to transcriptional upregulation of these genes, further validating them as MAF targets. While CRISPR/Cas9i experiments targeting TLR4 are ongoing, preliminary results validated the functional role of the "neo-enhancer" in CCR1 gene expression. In conclusion, we demonstrate for the first time an extensive re-organisation of the PC chromatin conferred by oncogenic MAF in MM; we reveal its extensive co-operation with IRF4 in this process; we validate the directly MAF-regulated genes and functionally characterise neo-enhancers of key MAF-dependent genes that in addition to MAF itself are also critical for myeloma biology. Disclosures Hatjiharissi: Janssen: Honoraria. Caputo:GSK: Research Funding. Karadimitris:GSK: Research Funding.


2020 ◽  
Author(s):  
Vinay S. Swamy ◽  
Temesgen D. Fufa ◽  
Robert B. Hufnagel ◽  
David M. McGaughey

AbstractDe novo transcriptome construction from short-read RNA-seq is a common method for reconstructing mRNA transcripts within a given sample. However, the precision of this process is unclear as it is difficult to obtain a ground-truth measure of transcript expression. With advances in third generation sequencing, full length transcripts of whole transcriptomes can be accurately sequenced to generate a ground-truth transcriptome. We generated long-read PacBio and short-read Illumina RNA-seq data from a human induced pluripotent stem cell- derived retinal pigmented epithelium (iPSC-RPE) cell line. We use long-read data to identify simple metrics for assessing de novo transcriptome construction and optimize a short-read based de novo transcriptome construction pipeline. We apply this this pipeline to construct transcriptomes for 340 short-read RNA-seq samples originating from healthy adult and fetal human retina, cornea, and RPE. We identify hundreds of novel gene isoforms and examine their significance in the context of ocular development and disease.


2022 ◽  
Author(s):  
Daniel F Kaemena ◽  
Masahito Yoshihara ◽  
James Ashmore ◽  
Meryam Beniazza ◽  
Suling Zhao ◽  
...  

Successful generation of induced pluripotent stem cells (iPSCs) via the overexpression of Oct4 (Pou5f1), Sox2, Klf4 and c-Myc (OSKM) highlights the power of transcription factor (TF)-mediated cellular conversions. Nevertheless, iPSC reprogramming is inherently inefficient and understanding the molecular mechanisms underlying this inefficiency holds the key to control cellular identity successfully. Here, we report 16 novel reprogramming roadblock genes identified by CRISPR/Cas9-mediated genome-wide knockout (KO) screening. Of these, disruption of KRAB zinc finger protein (KRAB-ZFP) Zfp266 strongly and consistently enhanced iPSC generation in several iPSC reprogramming settings, emerging as the most robust roadblock. Further analyses revealed that ZFP266 bound Short Interspersed Nuclear Elements (SINEs) adjacent to OSK binding sites and impedes chromatin opening. This work serves as a resource for better understanding reprogramming mechanisms and proposes SINEs as a critical genetic element that regulates chromatin accessibility at enhancers for efficient pluripotency induction.


2020 ◽  
Author(s):  
Bahareh Haddad Derafshi ◽  
Tamas Danko ◽  
Soham Chanda ◽  
Pedro Batista ◽  
Ulrike Litzenburger ◽  
...  

AbstractThe chromodomain helicase DNA-binding protein CHD8 is among the most frequently found de-novo mutations in autism (1–3). Unlike most other autism-risk genes, CHD8 mutations appear to be fully penetrant (4). Despite its prominent disease involvement, little known about its molecular function. Based on sequence homology, CHD8 is believed to be a chromatin regulator, but mechanisms for its genomic targeting and its role on chromatin are unclear. Here, we developed a human cell model carrying conditional CHD8 loss-of-function alleles. Full knockout CHD8 was required for the viability of undifferentiated human embryonic stem (ES) cells, whereas postmitotic neurons survived following CHD8 depletion. However, chromatin accessibility maps and transcriptional profiling revealed that CHD8 is a potent general chromatin activator, enhancing transcription of its direct target genes, including a large group of autism genes. CHD8’s genomic binding sites in human neurons were significantly enriched for ELK1 (ETS) motifs. Moreover, positive CHD8-dependent chromatin remodeling was enhanced at ELK1 motif-containing CHD8 binding sites. ELK1 was the most prominent ETS factor expressed in human neurons and was necessary for CHD8 to target the sites that contained the ELK1 motif, demonstrating a cooperative interaction between ELK1 and CHD8 on chromatin. We also observed potential role of CHD8 in ELK1 localization on nuclear compartments in a transcription-stage-dependent manner. Finally, inhibition of ELK1 activity or ELK1 knockdown that enhances the neurogenesis from embryonic stem cells (ES) was dependent on the presence of CHD8. In summary, our results establish that CHD8 is a strong activator of chromatin accessibility and transcription in neurons and reveals a role in regulating many high-risk autism genes. Additionally, we show there is molecular and functional interdependence of CHD8 and ELK1 in chromatin binding of CHD8, nuclear interaction of ELK1, and neurogenesis enhancement. These data imply the involvement of the MAPK/ERK pathway effector ELK1 in pathogenesis of autism caused by CHD8 mutations (5).


BMC Genomics ◽  
2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Surajit Bhattacharya ◽  
Hayk Barseghyan ◽  
Emmanuèle C. Délot ◽  
Eric Vilain

Abstract Background Whole genome sequencing is effective at identification of small variants, but because it is based on short reads, assessment of structural variants (SVs) is limited. The advent of Optical Genome Mapping (OGM), which utilizes long fluorescently labeled DNA molecules for de novo genome assembly and SV calling, has allowed for increased sensitivity and specificity in SV detection. However, compared to small variant annotation tools, OGM-based SV annotation software has seen little development, and currently available SV annotation tools do not provide sufficient information for determination of variant pathogenicity. Results We developed an R-based package, nanotatoR, which provides comprehensive annotation as a tool for SV classification. nanotatoR uses both external (DGV; DECIPHER; Bionano Genomics BNDB) and internal (user-defined) databases to estimate SV frequency. Human genome reference GRCh37/38-based BED files are used to annotate SVs with overlapping, upstream, and downstream genes. Overlap percentages and distances for nearest genes are calculated and can be used for filtration. A primary gene list is extracted from public databases based on the patient’s phenotype and used to filter genes overlapping SVs, providing the analyst with an easy way to prioritize variants. If available, expression of overlapping or nearby genes of interest is extracted (e.g. from an RNA-Seq dataset, allowing the user to assess the effects of SVs on the transcriptome). Most quality-control filtration parameters are customizable by the user. The output is given in an Excel file format, subdivided into multiple sheets based on SV type and inheritance pattern (INDELs, inversions, translocations, de novo, etc.). nanotatoR passed all quality and run time criteria of Bioconductor, where it was accepted in the April 2019 release. We evaluated nanotatoR’s annotation capabilities using publicly available reference datasets: the singleton sample NA12878, mapped with two types of enzyme labeling, and the NA24143 trio. nanotatoR was also able to accurately filter the known pathogenic variants in a cohort of patients with Duchenne Muscular Dystrophy for which we had previously demonstrated the diagnostic ability of OGM. Conclusions The extensive annotation enables users to rapidly identify potential pathogenic SVs, a critical step toward use of OGM in the clinical setting.


2021 ◽  
Vol 12 (1) ◽  
Author(s):  
Yapeng Li ◽  
Junfeng Gao ◽  
Mohammad Kamran ◽  
Laura Harmacek ◽  
Thomas Danhorn ◽  
...  

AbstractMast cells are critical effectors of allergic inflammation and protection against parasitic infections. We previously demonstrated that transcription factors GATA2 and MITF are the mast cell lineage-determining factors. However, it is unclear whether these lineage-determining factors regulate chromatin accessibility at mast cell enhancer regions. In this study, we demonstrate that GATA2 promotes chromatin accessibility at the super-enhancers of mast cell identity genes and primes both typical and super-enhancers at genes that respond to antigenic stimulation. We find that the number and densities of GATA2- but not MITF-bound sites at the super-enhancers are several folds higher than that at the typical enhancers. Our studies reveal that GATA2 promotes robust gene transcription to maintain mast cell identity and respond to antigenic stimulation by binding to super-enhancer regions with dense GATA2 binding sites available at key mast cell genes.


Plants ◽  
2021 ◽  
Vol 10 (7) ◽  
pp. 1465
Author(s):  
Ramon de Koning ◽  
Raphaël Kiekens ◽  
Mary Esther Muyoka Toili ◽  
Geert Angenon

Raffinose family oligosaccharides (RFO) play an important role in plants but are also considered to be antinutritional factors. A profound understanding of the galactinol and RFO biosynthetic gene families and the expression patterns of the individual genes is a prerequisite for the sustainable reduction of the RFO content in the seeds, without compromising normal plant development and functioning. In this paper, an overview of the annotation and genetic structure of all galactinol- and RFO biosynthesis genes is given for soybean and common bean. In common bean, three galactinol synthase genes, two raffinose synthase genes and one stachyose synthase gene were identified for the first time. To discover the expression patterns of these genes in different tissues, two expression atlases have been created through re-analysis of publicly available RNA-seq data. De novo expression analysis through an RNA-seq study during seed development of three varieties of common bean gave more insight into the expression patterns of these genes during the seed development. The results of the expression analysis suggest that different classes of galactinol- and RFO synthase genes have tissue-specific expression patterns in soybean and common bean. With the obtained knowledge, important galactinol- and RFO synthase genes that specifically play a key role in the accumulation of RFOs in the seeds are identified. These candidate genes may play a pivotal role in reducing the RFO content in the seeds of important legumes which could improve the nutritional quality of these beans and would solve the discomforts associated with their consumption.


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Álvaro Figueroa ◽  
Antonio Brante ◽  
Leyla Cárdenas

AbstractThe polychaete Boccardia wellingtonensis is a poecilogonous species that produces different larval types. Females may lay Type I capsules, in which only planktotrophic larvae are present, or Type III capsules that contain planktotrophic and adelphophagic larvae as well as nurse eggs. While planktotrophic larvae do not feed during encapsulation, adelphophagic larvae develop by feeding on nurse eggs and on other larvae inside the capsules and hatch at the juvenile stage. Previous works have not found differences in the morphology between the two larval types; thus, the factors explaining contrasting feeding abilities in larvae of this species are still unknown. In this paper, we use a transcriptomic approach to study the cellular and genetic mechanisms underlying the different larval trophic modes of B. wellingtonensis. By using approximately 624 million high-quality reads, we assemble the de novo transcriptome with 133,314 contigs, coding 32,390 putative proteins. We identify 5221 genes that are up-regulated in larval stages compared to their expression in adult individuals. The genetic expression profile differed between larval trophic modes, with genes involved in lipid metabolism and chaetogenesis over expressed in planktotrophic larvae. In contrast, up-regulated genes in adelphophagic larvae were associated with DNA replication and mRNA synthesis.


Genes ◽  
2021 ◽  
Vol 12 (2) ◽  
pp. 320
Author(s):  
Lorissa I. McDougall ◽  
Ryan M. Powell ◽  
Magdalena Ratajska ◽  
Chi F. Lynch-Sutherland ◽  
Sultana Mehbuba Hossain ◽  
...  

Melanoma comprises <5% of cutaneous malignancies, yet it causes a significant proportion of skin cancer-related deaths worldwide. While new therapies for melanoma have been developed, not all patients respond well. Thus, further research is required to better predict patient outcomes. Using long-range nanopore sequencing, RT-qPCR, and RNA sequencing analyses, we examined the transcription of BARD1 splice isoforms in melanoma cell lines and patient tissue samples. Seventy-six BARD1 mRNA variants were identified in total, with several previously characterised isoforms (γ, φ, δ, ε, and η) contributing to a large proportion of the expressed transcripts. In addition, we identified four novel splice events, namely, Δ(E3_E9), ▼(i8), IVS10+131▼46, and IVS10▼176, occurring in various combinations in multiple transcripts. We found that short-read RNA-Seq analyses were limited in their ability to predict isoforms containing multiple non-contiguous splicing events, as compared to long-range nanopore sequencing. These studies suggest that further investigations into the functional significance of the identified BARD1 splice variants in melanoma are warranted.


Sign in / Sign up

Export Citation Format

Share Document