scholarly journals plot2DO: a tool to assess the quality and distribution of genomic data

2017 ◽  
Author(s):  
Răzvan V. Chereji

AbstractSummaryMicrococcal nuclease digestion followed by deep sequencing (MNase-seq) is the most used method to investigate nucleosome organization on a genome-wide scale. We present plot2DO, a software package for creating 2D occupancy plots, which allows biologists to evaluate the quality of MNase-seq data and to visualize the distribution of nucleosomes near the functional regions of the genome (e.g. gene promoters, origins of replication, etc.).Availability And ImplementationThe plot2DO open source package is freely available on GitHub at https://github.com/rchereji/plot2DO under the MIT [email protected] InformationSupplementary data are available at Bioinformatics online.

2017 ◽  
Author(s):  
Sheng’en Hu ◽  
Xiaolan Chen ◽  
Ji Liao ◽  
Yiqing Chen ◽  
Chengchen Zhao ◽  
...  

AbstractNucleosome organization affects the accessibility of cis-elements to trans-acting factors. Micrococcal nuclease digestion followed by high-throughput sequencing (MNase-seq) is the most popular technology used to profile nucleosome organization on a genome-wide scale. Evaluating the data quality of MNase-seq data remains challenging, especially in mammalian. There is a strong need for a convenient and comprehensive approach to obtain dedicated quality control (QC) for MNase-seq data analysis. Here we developed CAM, which is a comprehensive QC pipeline for MNase-seq data. The CAM pipeline provides multiple informative QC measurements and nucleosome organization profiles on different potentially functional regions for given MNase-seq data. CAM also includes 268 historical MNase-seq datasets from human and mouse as a reference atlas for unbiased assessment. CAM is freely available at: http://www.tongji.edu.cn/~zhanglab/CAM


2019 ◽  
Author(s):  
Maria Rojec ◽  
Antoine Hocher ◽  
Matthias Merkenschlager ◽  
Tobias Warnecke

ABSTRACTNucleosomes restrict DNA accessibility throughout eukaryotic genomes, with repercussions for replication, transcription, and other DNA-templated processes. How this globally restrictive organization emerged from a presumably more open ancestral state remains poorly understood. Here, to better understand the challenges associated with establishing globally restrictive chromatin, we express histones in a naïve bacterial system that has not evolved to deal with nucleosomal structures:Escherichia coli. We find that histone proteins from the archaeonMethanothermus fervidusassemble on theE. colichromosomein vivoand protect DNA from micrococcal nuclease digestion, allowing us to map binding footprints genome-wide. We provide evidence that nucleosome occupancy along theE. coligenome tracks intrinsic sequence preferences but is disturbed by ongoing transcription and replication. Notably, we show that higher nucleosome occupancy at promoters and across gene bodies is associated with lower transcript levels, consistent with local repressive effects. Surprisingly, however, this sudden enforced chromatinization has only mild repercussions for growth, suggesting that histones can become established as ubiquitous chromatin proteins without interfering critically with key DNA-templated processes. Our results have implications for the evolvability of transcriptional ground states and highlight chromatinization by archaeal histones as a potential avenue for controlling genome accessibility in synthetic prokaryotic systems.


eLife ◽  
2019 ◽  
Vol 8 ◽  
Author(s):  
Maria Rojec ◽  
Antoine Hocher ◽  
Kathryn M Stevens ◽  
Matthias Merkenschlager ◽  
Tobias Warnecke

Nucleosomes restrict DNA accessibility throughout eukaryotic genomes, with repercussions for replication, transcription, and other DNA-templated processes. How this globally restrictive organization emerged during evolution remains poorly understood. Here, to better understand the challenges associated with establishing globally restrictive chromatin, we express histones in a naive system that has not evolved to deal with nucleosomal structures: Escherichia coli. We find that histone proteins from the archaeon Methanothermus fervidus assemble on the E. coli chromosome in vivo and protect DNA from micrococcal nuclease digestion, allowing us to map binding footprints genome-wide. We show that higher nucleosome occupancy at promoters is associated with lower transcript levels, consistent with local repressive effects. Surprisingly, however, this sudden enforced chromatinization has only mild repercussions for growth unless cells experience topological stress. Our results suggest that histones can become established as ubiquitous chromatin proteins without interfering critically with key DNA-templated processes.


2019 ◽  
Vol 36 (5) ◽  
pp. 1509-1516
Author(s):  
Andrew W George ◽  
Arunas Verbyla ◽  
Joshua Bowden

Abstract Motivation We present Eagle, a new method for multi-locus association mapping. The motivation for developing Eagle was to make multi-locus association mapping ‘easy’ and the method-of-choice. Eagle’s strengths are that it (i) is considerably more powerful than single-locus association mapping, (ii) does not suffer from multiple testing issues, (iii) gives results that are immediately interpretable and (iv) has a computational footprint comparable to single-locus association mapping. Results By conducting a large simulation study, we will show that Eagle finds true and avoids false single-nucleotide polymorphism trait associations better than competing single- and multi-locus methods. We also analyze data from a published mouse study. Eagle found over 50% more validated findings than the state-of-the-art single-locus method. Availability and implementation Eagle has been implemented as an R package, with a browser-based Graphical User Interface for users less familiar with R. It is freely available via the CRAN website at https://cran.r-project.org. Videos, Quick Start guides, FAQs and Demos are available via the Eagle website http://eagle.r-forge.r-project.org. Supplementary information Supplementary data are available at Bioinformatics online.


2014 ◽  
Vol 369 (1652) ◽  
pp. 20130514 ◽  
Author(s):  
Erica Shen ◽  
Hennady Shulha ◽  
Zhiping Weng ◽  
Schahram Akbarian

The growing list of mutations implicated in monogenic disorders of the developing brain includes at least seven genes ( ARX, CUL4B, KDM5A, KDM5C, KMT2A, KMT2C, KMT2D ) with loss-of-function mutations affecting proper regulation of histone H3 lysine 4 methylation, a chromatin mark which on a genome-wide scale is broadly associated with active gene expression, with its mono-, di- and trimethylated forms differentially enriched at promoter and enhancer and other regulatory sequences. In addition to these rare genetic syndromes, dysregulated H3K4 methylation could also play a role in the pathophysiology of some cases diagnosed with autism or schizophrenia, two conditions which on a genome-wide scale are associated with H3K4 methylation changes at hundreds of loci in a subject-specific manner. Importantly, the reported alterations for some of the diseased brain specimens included a widespread broadening of H3K4 methylation profiles at gene promoters, a process that could be regulated by the UpSET(KMT2E/MLL5)-histone deacetylase complex. Furthermore, preclinical studies identified maternal immune activation, parental care and monoaminergic drugs as environmental determinants for brain-specific H3K4 methylation. These novel insights into the epigenetic risk architectures of neurodevelopmental disease will be highly relevant for efforts aimed at improved prevention and treatment of autism and psychosis spectrum disorders.


Genes ◽  
2020 ◽  
Vol 11 (10) ◽  
pp. 1154
Author(s):  
Min Jeong Hong ◽  
Jin-Baek Kim ◽  
Yong Weon Seo ◽  
Dae Yeon Kim

Genes of the F-box family play specific roles in protein degradation by post-translational modification in several biological processes, including flowering, the regulation of circadian rhythms, photomorphogenesis, seed development, leaf senescence, and hormone signaling. F-box genes have not been previously investigated on a genome-wide scale; however, the establishment of the wheat (Triticum aestivum L.) reference genome sequence enabled a genome-based examination of the F-box genes to be conducted in the present study. In total, 1796 F-box genes were detected in the wheat genome and classified into various subgroups based on their functional C-terminal domain. The F-box genes were distributed among 21 chromosomes and most showed high sequence homology with F-box genes located on the homoeologous chromosomes because of allohexaploidy in the wheat genome. Additionally, a synteny analysis of wheat F-box genes was conducted in rice and Brachypodium distachyon. Transcriptome analysis during various wheat developmental stages and expression analysis by quantitative real-time PCR revealed that some F-box genes were specifically expressed in the vegetative and/or seed developmental stages. A genome-based examination and classification of F-box genes provide an opportunity to elucidate the biological functions of F-box genes in wheat.


2014 ◽  
Vol 42 (15) ◽  
pp. 9838-9853 ◽  
Author(s):  
Saeed Kaboli ◽  
Takuya Yamakawa ◽  
Keisuke Sunada ◽  
Tao Takagaki ◽  
Yu Sasano ◽  
...  

Abstract Despite systematic approaches to mapping networks of genetic interactions in Saccharomyces cerevisiae, exploration of genetic interactions on a genome-wide scale has been limited. The S. cerevisiae haploid genome has 110 regions that are longer than 10 kb but harbor only non-essential genes. Here, we attempted to delete these regions by PCR-mediated chromosomal deletion technology (PCD), which enables chromosomal segments to be deleted by a one-step transformation. Thirty-three of the 110 regions could be deleted, but the remaining 77 regions could not. To determine whether the 77 undeletable regions are essential, we successfully converted 67 of them to mini-chromosomes marked with URA3 using PCR-mediated chromosome splitting technology and conducted a mitotic loss assay of the mini-chromosomes. Fifty-six of the 67 regions were found to be essential for cell growth, and 49 of these carried co-lethal gene pair(s) that were not previously been detected by synthetic genetic array analysis. This result implies that regions harboring only non-essential genes contain unidentified synthetic lethal combinations at an unexpectedly high frequency, revealing a novel landscape of genetic interactions in the S. cerevisiae genome. Furthermore, this study indicates that segmental deletion might be exploited for not only revealing genome function but also breeding stress-tolerant strains.


2022 ◽  
Vol 12 ◽  
Author(s):  
Inge Holm ◽  
Luisa Nardini ◽  
Adrien Pain ◽  
Emmanuel Bischoff ◽  
Cameron E. Anderson ◽  
...  

Almost all regulation of gene expression in eukaryotic genomes is mediated by the action of distant non-coding transcriptional enhancers upon proximal gene promoters. Enhancer locations cannot be accurately predicted bioinformatically because of the absence of a defined sequence code, and thus functional assays are required for their direct detection. Here we used a massively parallel reporter assay, Self-Transcribing Active Regulatory Region sequencing (STARR-seq), to generate the first comprehensive genome-wide map of enhancers in Anopheles coluzzii, a major African malaria vector in the Gambiae species complex. The screen was carried out by transfecting reporter libraries created from the genomic DNA of 60 wild A. coluzzii from Burkina Faso into A. coluzzii 4a3A cells, in order to functionally query enhancer activity of the natural population within the homologous cellular context. We report a catalog of 3,288 active genomic enhancers that were significant across three biological replicates, 74% of them located in intergenic and intronic regions. The STARR-seq enhancer screen is chromatin-free and thus detects inherent activity of a comprehensive catalog of enhancers that may be restricted in vivo to specific cell types or developmental stages. Testing of a validation panel of enhancer candidates using manual luciferase assays confirmed enhancer function in 26 of 28 (93%) of the candidates over a wide dynamic range of activity from two to at least 16-fold activity above baseline. The enhancers occupy only 0.7% of the genome, and display distinct composition features. The enhancer compartment is significantly enriched for 15 transcription factor binding site signatures, and displays divergence for specific dinucleotide repeats, as compared to matched non-enhancer genomic controls. The genome-wide catalog of A. coluzzii enhancers is publicly available in a simple searchable graphic format. This enhancer catalogue will be valuable in linking genetic and phenotypic variation, in identifying regulatory elements that could be employed in vector manipulation, and in better targeting of chromosome editing to minimize extraneous regulation influences on the introduced sequences.Importance: Understanding the role of the non-coding regulatory genome in complex disease phenotypes is essential, but even in well-characterized model organisms, identification of regulatory regions within the vast non-coding genome remains a challenge. We used a large-scale assay to generate a genome wide map of transcriptional enhancers. Such a catalogue for the important malaria vector, Anopheles coluzzii, will be an important research tool as the role of non-coding regulatory variation in differential susceptibility to malaria infection is explored and as a public resource for research on this important insect vector of disease.


Sign in / Sign up

Export Citation Format

Share Document