scholarly journals RAEdb: a database of enhancers identified by high-throughput reporter assays

Database ◽  
2019 ◽  
Vol 2019 ◽  
Author(s):  
Zena Cai ◽  
Ya Cui ◽  
Zhiying Tan ◽  
Gaihua Zhang ◽  
Zhongyang Tan ◽  
...  

Abstract High-throughput reporter assays have been recently developed to directly and quantitatively assess enhancer activity for thousands of regulatory elements. However, there is still no database to collect these enhancers. We developed RAEdb, the first database to collect enhancers identified by high-throughput reporter assays. RAEdb includes 538 320 enhancers derived from eight studies, most of which were from six human cell lines. An activity score was assigned to each enhancer based on reporter assays. Based on these enhancers, 7658 epromoters (promoters with enhancer activity) were identified and stored in the database. RAEdb provides two ways of searches: the first is to search studies by species and cell line; the other is to search enhancers or epromoters by position, activity score, sequence and gene. RAEdb also provides a genome browser to query, visualize and compare enhancers. All data in RAEdb is freely available for download.

2018 ◽  
Vol 5 (5) ◽  
pp. 172472 ◽  
Author(s):  
Setsuko Shioda ◽  
Fumio Kasai ◽  
Ken Watanabe ◽  
Kohei Kawakami ◽  
Azusa Ohtani ◽  
...  

Human cell lines have been used in a variety of research fields as an in vitro model. These cells are all derived from human tissue samples, thus there is a possibility of virus infection. Virus tests are routinely performed in clinical practice, but are limited in cell lines. In this study, we investigated 15 kinds of viruses in 844 human cell lines registered at the Japanese Collection of Research Bioresources (JCRB) Cell Bank. Our real-time PCR analysis revealed that six viruses, EBV, HTLV-1, HBV, B19V, HHV-6 and HHV-7, were detected in 43 cell lines. Of them, 20 cell lines were transformed by intentional infection in vitro with EBV or HTLV-1. Viruses in the other 23 cell lines and one EBV transformed cell line are derived from an in vivo infection, including five de novo identifications of EBV, B19V or HHV-7 carriers. Among them, 17 cell lines were established from patients diagnosed with virus-associated diseases. However, the other seven cell lines originated from in vivo cells unrelated to disease or cellular tropism. Our approach to screen for a set of 15 viruses in each cell line has worked efficiently to identify these rare cases. Virus tests in cell lines contribute not only to safety assessments but also to investigation of in vivo viral infection which can be a characteristic feature of cell lines.


F1000Research ◽  
2017 ◽  
Vol 6 ◽  
pp. 939 ◽  
Author(s):  
David Santiago-Algarra ◽  
Lan T.M. Dao ◽  
Lydie Pradel ◽  
Alexandre España ◽  
Salvatore Spicuglia

The regulation of gene transcription in higher eukaryotes is accomplished through the involvement of transcription start site (TSS)-proximal (promoters) and -distal (enhancers) regulatory elements. It is now well acknowledged that enhancer elements play an essential role during development and cell differentiation, while genetic alterations in these elements are a major cause of human disease. Many strategies have been developed to identify and characterize enhancers. Here, we discuss recent advances in high-throughput approaches to assess enhancer activity, from the well-established massively parallel reporter assays to the recent clustered regularly interspaced short palindromic repeats (CRISPR)/Cas9-based technologies. We highlight how these approaches contribute toward a better understanding of enhancer function, eventually leading to the discovery of new types of regulatory sequences, and how the alteration of enhancers can affect transcriptional regulation.


Endocrinology ◽  
2006 ◽  
Vol 147 (12) ◽  
pp. 5845-5854 ◽  
Author(s):  
Monique H. A. Kester ◽  
George G. J. M. Kuiper ◽  
Rogier Versteeg ◽  
Theo J. Visser

Type I iodothyronine deiodinase (D1) and type II iodothyronine deiodinase (D2) catalyze the activation of the prohormone T4 to the active hormone T3; type III iodothyronine deiodinase (D3) catalyzes the inactivation of T4 and T3. D3 is highly expressed in brain, placenta, pregnant uterus, and fetal tissues and plays an important role in regulating thyroid hormone bioavailability during fetal development. We examined the activity of the different deiodinases in human cell lines and investigated the regulation of D3 activity and mRNA expression in these cell lines, as well as its possible coexpression with neighboring genes Dlk1 and Dio3os, which may also be especially important during development. D1 activity and mRNA were only found in HepG2 hepatocarcinoma cells, and D2 activity was observed in none of the cell lines. D3 activity and mRNA was found in ECC-1 endometrium carcinoma cells, MCF-7 mammacarcinoma cells, WRL-68 embryonic liver cells, and SH-SY5Y neuroblastoma cells, but not in the HepG2 hepatocarcinoma cell line or in any choriocarcinoma or astrocytoma cell line. We demonstrated that the phorbol ester 12-O-tetradecanoylphorbol-13-acetate increased D3 activity 2- to 9-fold in ECC-1, MCF-7, WRL-68, and SH-SY5Y cells. Estradiol increased D3 activity 3-fold in ECC-1, but not in any other cells. Dexamethasone decreased D3 activity in WRL-68 cells only in the absence of fetal calf serum. Incubation with retinoids increased D3 activity 2- to 3-fold in ECC-1, WRL-68, and MCF-7 cells but decreased D3 activity in SH-SY5Y cells. D3 expression in the different cells was not affected by cAMP or thyroid hormone. Interestingly, D3 mRNA expression in the different cell lines strongly correlated with Dio3os mRNA expression and in a large set of neuroblastoma cell lines also with Dlk1 expression. In conclusion, we identified different human D3-expressing cell lines, in which the regulation of D3 expression is cell type-specific. Our data suggest that estradiol may be one of the factors contributing to the induction of D3 activity in the pregnant uterus and that in addition to gene-specific regulatory elements, more distant common regulatory elements also may be involved in the regulation of D3 expression.


Blood ◽  
2005 ◽  
Vol 106 (11) ◽  
pp. 358-358
Author(s):  
Jotin Marango ◽  
Manabu Shimoyama ◽  
Boris A. Leibovitch ◽  
Ming Ming Zhou ◽  
Yolanda Martinez ◽  
...  

Abstract Over 40% of cases of multiple myeloma (MM) are associated with translocations of the immunoglobulin heavy (IgH) chain gene gene. The t(4;14) translocation, present in ca. 20% of myeloma cases, results in the overexpression of two potential oncogenes, MMSET and FGFR3, via juxtaposition of their endogenous promoters to regulatory elements of the IgH locus. The presence of t(4;14), and MMSET overexpression, is an adverse prognostic factor in MM irrespective of FGFR3 expression. MMSET contains several conserved motifs found in proteins involved in chromatin function (PWWP, HMG, PHD domains) and in the epigenetic control of transcription (SET domain). Accordingly, we found that the two main isoforms of the MMSET protein exhibit exclusive nuclear localization in both transfected fibroblasts and myeloma cells carrying t(4;14). Towards our goal of defining the ability of MMSET to affect gene regulation and contribute to the disease pathogenesis, we found that the SET domain of MMSET possesses in vitro methyltransferase activity specific for core histones H3 and H4. Using a computational approach and theoretical extrapolation from the solved NMR structure of vSET, we identified residues in the active site of MMSET essential for catalysis, whose mutation drastically reduces enzymatic activity. Reporter assays using Gal4 fusion constructs showed that both the amino terminus of MMSET, containing the PWWP and HMG domains, as well as the SET-containing carboxy terminus act as transcriptional repressors. MMSET interacts physically and functionally with a number of known co-repressor molecules, such as HDAC1, HDAC2, Sin3a, and SIRT1, but not HDAC4 or HDAC6. As such, MMSET co-expression enhances HDAC1 and HDAC2-mediated repression in transcriptional reporter assays, and MMSET repression is partially relieved by the addition of an HDAC inhibitor. A yeast two hybrid screen identified a number of other functional partners of MMSET, including ZNF331/RITA (Rearranged in Thyroid Adenoma), a KRAB domain/zinc finger protein previously implicated in malignancy. MMSET and ZNF331 co-localize in the nuclei of transfected fibroblasts, co-immunoprecipitate, and display cooperative repression in reporter assays. Collectively, these data support the idea that MMSET is a biologically active, bifunctional transcriptional mediator acting as a HMT enzyme in chromatin remodeling and as a complex adaptor in the recruitment of repressor species. Presently we are modeling the biological effects of MMSET through a conditional overexpression system in a B cell line. While low levels of MMSET are ubiquitiously expressed, induction of high levels of MMSET expression in the B cell line is associated with growth suppression and G1 arrest. While paradoxical for a presumed oncoprotein, such actions have been observed for other disease-associated proteins such as Runx1/MTG8. In contrast, a myeloma cell line harboring t(4;14) proliferates in the presence of high level MMSET expression. RNAi-mediated knockdown of MMSET in these cells induces apoptotic cell death. This suggests that MMSET may be critical for growth and survival of myeloma cells. Profiling of gene expression changes in these systems should link the transcriptional and biological activities of MMSET.


Blood ◽  
2016 ◽  
Vol 128 (22) ◽  
pp. 954-954
Author(s):  
Kotaro Shide ◽  
Takuro Kameda ◽  
Masaaki Sekine ◽  
Ayako Kamiunten ◽  
Keiichi Akizuki ◽  
...  

Abstract Calreticulin (CALR) exon 9 mutations were reported in about two-thirds of JAK2 or MPL mutation negative ET and PMF patients. The mutations cause frameshifts that result in proteins with novel C-terminus.Retrovirus-mediated gene transfer into cell lines and mouse bone marrow (BM) cells is a common technique, but the expression level is very high compared to the physiological expression.We investigated the effects of physiological expression of mutant CALR using CRISPR/Cas9 gene editing techniques for cell lines, and as for the mouse model, we generated a transgenic mice (TG) expressing human CALR del52 mutant. We used two human cell lines expressing MPL: human acute megakaryoblastic leukemia cell line CMK11-5 which expressed endogenous MPL, and F-36P-MPL cell line which was generated by introducing MPL to GM-CSF-dependent erythroleukemia cell line F-36P. Plasmids coexpressing hCas9 and single-guide RNA were prepared by ligating oligonucleotides (5'-CACCGACAAGAAACGCAAAGAGGAGG-3', 5'-AAACCCTCCTCTTTGCGTTTCTTGTC-3') for the target sequence of human CALR exon 9 into pX330. The plasmids were introduced with a electroporator to each of the cell lines. After limiting dilution cloning, we identified cell lines which have indel mutation at the target site. We produced two types of CMK11-5 subline knocked in a CALR mutation, namely CALR del25 CMK cells and CALR del25/del17 CMK cells, respectively. The former lacks 25 bases in one CALR allele, causing a frameshift that results in a protein resembling human CALR mutant, while the latter lacks an additional 17 bases in another allele in CALR exon 9 and induces a frameshift that causes a deletion in CALR exon 9. Both kinds of CALR mutant CMK11-5 cells showed increased cell proliferation compared to WT cells. We also produced one type of F-36P-MPL subline, CALR del1/ins1 F-36P-MPL cells which had 1 base deletion in one CALR allele resembling human mutation and 1 base insertion in another allele. Though the growth of this subline in the presence of GM-CSF was comparable to WT cells, it showed GM-CSF independent autonomous cell growth. We generated TG mice expressing human CALR del52 mutant driven by the murine H2Kb promoter. We compared the expression level of human CALR mRNA in TG BM cells with the expression of endogenous WT CALR in human cell lines (CMK11-5, F-36P-MPL, CHRF288) using Rn18s as an endogenous control. The expression of human CALR in TG BM was approximately 0.6 times that of endogenous WT CALR in human cell lines, and the physiological expression level was obtained. They exhibited thrombocytosis, with platelet (PLT) counts as high as 2,000 x 109/L. Leukocyte number and the proportion of granulocytes and T and B lymphocytes, were comparable to WT mice. CALR mutation had no impact on Hb level or spleen weight. There was a striking difference in the number of megakaryocytes (Mgks), which was 2-fold higher in BM from TG mice than in WT mice. The TG Mgks were also more mature, with larger diameter, and contained higher number of alpha-granules compared to WT cells. In one year of observation, there is no fibrosis in BM. These observations showed that TG developed human ET-like disease. The survival of TG mice was comparable to that of WT mice. The disease phenotype was transplantable into WT recipient mice. To characterize in detail the impact of MPNs induced by the CALR del52 mutant, we evaluated the frequencies of HSCs and progenitors in BM. The frequency of both LT-HSC and ST-HSC in BM was higher inTG mice compared to WT mice. The frequencies of progenitors (CMP, MEP, and MKP) were also greater in BM from TG mice than from WT mice. However, BM cells did not have enhanced replating capacity. We next examined whether or not ruxolitinib (RUX) treatment ameliorated thrombocytosis in TG mice. Either 90 mg/kg bid of RUX or vehicle was administrated to TG mice for 4 weeks.TG mice treated with vehicle showed a mean 16% increase in PLT count during the treatment period, probably due to the disease progression. RUX treatment attenuated the increase in the number of PLTs in TG mice by a mean of 22%, but the overall count was still higher than that in WT mice. BM sections showed that RUX reduced the Mgks number in TG. In summary, physiological expression of CALR mutant increases cell growth and cytokine independency in human cell lines expressing MPL, and develops ET in mice. RUX therapy attenuated the increased numbers of peripheral blood PLTs and BM Mgks, and ameliorated CALR mutation-induced ET. Disclosures No relevant conflicts of interest to declare.


1985 ◽  
Vol 231 (2) ◽  
pp. 309-313 ◽  
Author(s):  
G Opdenakker ◽  
A Billiau ◽  
G Volckaert ◽  
P de Somer

A labelled cDNA clone was used in DNA-RNA hybridization on nitrocellulose filter paper (dot-blot technique) to detect and quantify mRNA for endogenous tissue plasminogen activator (PA) in cell extracts and samples of RNA purification runs. Although, for detection purposes, the assay was less sensitive than translation in Xenopus oocytes, it was at least as reliable and much more convenient for the purpose of quantitative determination. In particular, the technique was used to study the kinetics of PA mRNA formation in a human melanoma cell line (Bowes) after exposure to the tumour promoter 12-O-tetradecanoylphorbol 13-acetate (TPA). Incubation of the cells with TPA resulted in a 15-20-fold increase in cellular PA mRNA content. The effect was time- and dose-dependent: the increase in PA-specific mRNA was clearly visible as early as 4 h after initiation of TPA treatment in Bowes cells. It was blocked completely by pretreatment of the cells with actinomycin D, indicating that TPA caused enhancement of synthesis of PA mRNA rather than inhibition of PA mRNA degradation. The use of the nitrocellulose dot-blot technique also revealed that two non-human cell lines produce mRNAs which cross-react with the human PA mRNA, namely the mouse melanoma cell line B16 and the rat brain-tumour cell line, RT4-71-1. TPA was found to exert similar stimulatory effects on the synthesis of mRNAs in these cell lines as in Bowes cells.


2017 ◽  
Vol 114 (7) ◽  
pp. E1291-E1300 ◽  
Author(s):  
Sharon R. Grossman ◽  
Xiaolan Zhang ◽  
Li Wang ◽  
Jesse Engreitz ◽  
Alexandre Melnikov ◽  
...  

Enhancers regulate gene expression through the binding of sequence-specific transcription factors (TFs) to cognate motifs. Various features influence TF binding and enhancer function—including the chromatin state of the genomic locus, the affinities of the binding site, the activity of the bound TFs, and interactions among TFs. However, the precise nature and relative contributions of these features remain unclear. Here, we used massively parallel reporter assays (MPRAs) involving 32,115 natural and synthetic enhancers, together with high-throughput in vivo binding assays, to systematically dissect the contribution of each of these features to the binding and activity of genomic regulatory elements that contain motifs for PPARγ, a TF that serves as a key regulator of adipogenesis. We show that distinct sets of features govern PPARγ binding vs. enhancer activity. PPARγ binding is largely governed by the affinity of the specific motif site and higher-order features of the larger genomic locus, such as chromatin accessibility. In contrast, the enhancer activity of PPARγ binding sites depends on varying contributions from dozens of TFs in the immediate vicinity, including interactions between combinations of these TFs. Different pairs of motifs follow different interaction rules, including subadditive, additive, and superadditive interactions among specific classes of TFs, with both spatially constrained and flexible grammars. Our results provide a paradigm for the systematic characterization of the genomic features underlying regulatory elements, applicable to the design of synthetic regulatory elements or the interpretation of human genetic variation.


2021 ◽  
Author(s):  
Thomas Beder ◽  
Olufemi Aromolaran ◽  
Juergen Doenitz ◽  
Sofia Tapanelli ◽  
Eunice Oluwatobiloba Adedeji ◽  
...  

Identifying essential genes on a genome scale is resource intensive and has been performed for only a few eukaryotes. For less studied organisms essentiality might be predicted by gene homology. However, this approach cannot be applied to non-conserved genes. Additionally, divergent essentiality information is obtained from studying single cells or whole, multi-cellular organisms, and particularly when derived from human cell line screens and human population studies. We employed machine learning across six model eukaryotes and 60,381 genes, using 41,635 features derived from sequence, gene functions and network topology. Within a leave-one-organism-out cross-validation, the classifiers showed a high generalizability with an average accuracy close to 80% in the left-out species. As a case study, we applied the method to Tribolium castaneum and validated predictions experimentally yielding similar performance. Finally, using the classifier based on the studied model organisms enabled linking the essentiality information of human cell line screens and population studies.


2016 ◽  
Author(s):  
Maxwell W. Libbrecht ◽  
Oscar Rodriguez ◽  
Zhiping Weng ◽  
Jeffrey A. Bilmes ◽  
Michael M. Hoffman ◽  
...  

AbstractSemi-automated genome annotation methods such as Segway enable understanding of chromatin activity. Here we present chromatin state annotations of 164 human cell types using 1,615 genomics data sets. To produce these annotations, we developed a fully-automated annotation strategy in which we train separate unsupervised annotation models on each cell type and use a machine learning classifier to automate the state interpretation step. Using these annotations, we developed a measure of the importance of each genomic position called the “conservation-associated activity score,” which we use to aggregate information across cell types into a multi-cell type view. The aggregated conservation-associated activity score provides a measure of importance directly attributable to a specific activity in a specific set of cell types. In contrast to evolutionary conservation, this measure is not biased to detect only elements shared with related species. Using the conservation-associated activity score, we combined all our annotations into a single, cell type-agnostic encyclopedia that catalogs all human transcriptional and regulatory elements, enabling easy and intuitive interpretation of the effect of genome variants on phenotype, such as in disease-associated, evolutionarily conserved or positively selected loci. These resources, including cell type-specific annotations, encyclopedia, and a visualization server, are available at http://noble.gs.washington.edu/proj/encyclopedia.Author SummaryGenome annotation algorithms are an effective class of tools for understanding the function of the genome. These algorithms take as input a set of genome-wide measurements about the activity at each base pair in a given tissue, such as where a given protein is binding or how accessible the DNA is to being read by a protein. The genome is then partitioned and each segment is assigned a label such that positions with the same label exhibit similar patterns in the input data. Such annotations are widely used for many applications, such as to understand the mechanism of impact of a given genetic variant. Here we present, to our knowledge, the most comprehensive set of genome annotations created so far, encompassing 164 human cell types and including 1,615 genomics data sets. These comprehensive annotations are made possible by a strategy that automates the previous interpretation step. Furthermore, we present several methodological innovations that make these genome annotations more useful.


Sign in / Sign up

Export Citation Format

Share Document