scholarly journals The intersectional genetics landscape for human

2019 ◽  
Author(s):  
Andre Macedo ◽  
Alisson M. Gontijo

The human body is made up of hundreds, perhaps thousands of cell types and states, most of which are currently inaccessible genetically. Genetic accessibility carries significant diagnostic and therapeutic potential by allowing the selective delivery of genetic messages or cures to cells. Research in model organisms has shown that single regulatory element (RE) activities are seldom cell type specific, limiting their usage in genetic systems designed to restrict gene expression posteriorly to their delivery to cells. Intersectional genetic approaches can increase the number of genetically accessible cells. A typical intersectional method acts like an AND logic gate by converting the input of two or more active REs into a single synthetic output, which becomes unique for that cell. Here, we systematically assessed the intersectional genetics landscape of human using a curated subset of cells from a large RE usage atlas obtained by Cap Analysis of Gene Expression Sequencing (CAGE-Seq) of thousands of primary and cancer cells (the FANTOM5 consortium atlas). We developed the heuristics and algorithms to retrieve and quality rank AND gate intersections intra- and inter-individually. We find that >90% of the 154 primary cell types surveyed can be distinguished from each other with as little as 3 to 4 active REs, with quantifiable safety and robustness. We call these minimal intersections of active REs with cell-type diagnostic potential “Versatile Entry Codes” (VEnCodes). We show that VEnCodes could be found for 100% of the 158 cancer cell types surveyed, and that most of these are highly robust to intra- and interindividual variation. Our tools for generating and quality-ranking VEnCodes can be adapted to other RE usage databases and to other intersectional methods using alternative Boolean logic operations. Our work demonstrate the potential of intersectional approaches for future gene delivery technologies in human.

GigaScience ◽  
2020 ◽  
Vol 9 (8) ◽  
Author(s):  
Andre Macedo ◽  
Alisson M Gontijo

ABSTRACT Background The human body is made up of hundreds—perhaps thousands—of cell types and states, most of which are currently inaccessible genetically. Intersectional genetic approaches can increase the number of genetically accessible cells, but the scope and safety of these approaches have not been systematically assessed. A typical intersectional method acts like an “AND" logic gate by converting the input of 2 or more active, yet unspecific, regulatory elements (REs) into a single cell type specific synthetic output. Results Here, we systematically assessed the intersectional genetics landscape of the human genome using a subset of cells from a large RE usage atlas (Functional ANnoTation Of the Mammalian genome 5 consortium, FANTOM5) obtained by cap analysis of gene expression sequencing (CAGE-seq). We developed the heuristics and algorithms to retrieve and quality-rank “AND" gate intersections. Of the 154 primary cell types surveyed, >90% can be distinguished from each other with as few as 3 to 4 active REs, with quantifiable safety and robustness. We call these minimal intersections of active REs with cell-type diagnostic potential “versatile entry codes" (VEnCodes). Each of the 158 cancer cell types surveyed could also be distinguished from the healthy primary cell types with small VEnCodes, most of which were robust to intra- and interindividual variation. Methods for the cross-validation of CAGE-seq–derived VEnCodes and for the extraction of VEnCodes from pooled single-cell sequencing data are also presented. Conclusions Our work provides a systematic view of the intersectional genetics landscape in humans and demonstrates the potential of these approaches for future gene delivery technologies.


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
John A. Halsall ◽  
Simon Andrews ◽  
Felix Krueger ◽  
Charlotte E. Rutledge ◽  
Gabriella Ficz ◽  
...  

AbstractChromatin configuration influences gene expression in eukaryotes at multiple levels, from individual nucleosomes to chromatin domains several Mb long. Post-translational modifications (PTM) of core histones seem to be involved in chromatin structural transitions, but how remains unclear. To explore this, we used ChIP-seq and two cell types, HeLa and lymphoblastoid (LCL), to define how changes in chromatin packaging through the cell cycle influence the distributions of three transcription-associated histone modifications, H3K9ac, H3K4me3 and H3K27me3. We show that chromosome regions (bands) of 10–50 Mb, detectable by immunofluorescence microscopy of metaphase (M) chromosomes, are also present in G1 and G2. They comprise 1–5 Mb sub-bands that differ between HeLa and LCL but remain consistent through the cell cycle. The same sub-bands are defined by H3K9ac and H3K4me3, while H3K27me3 spreads more widely. We found little change between cell cycle phases, whether compared by 5 Kb rolling windows or when analysis was restricted to functional elements such as transcription start sites and topologically associating domains. Only a small number of genes showed cell-cycle related changes: at genes encoding proteins involved in mitosis, H3K9 became highly acetylated in G2M, possibly because of ongoing transcription. In conclusion, modified histone isoforms H3K9ac, H3K4me3 and H3K27me3 exhibit a characteristic genomic distribution at resolutions of 1 Mb and below that differs between HeLa and lymphoblastoid cells but remains remarkably consistent through the cell cycle. We suggest that this cell-type-specific chromosomal bar-code is part of a homeostatic mechanism by which cells retain their characteristic gene expression patterns, and hence their identity, through multiple mitoses.


2021 ◽  
Vol 12 (1) ◽  
Author(s):  
Bing He ◽  
Ping Chen ◽  
Sonia Zambrano ◽  
Dina Dabaghie ◽  
Yizhou Hu ◽  
...  

AbstractMolecular characterization of the individual cell types in human kidney as well as model organisms are critical in defining organ function and understanding translational aspects of biomedical research. Previous studies have uncovered gene expression profiles of several kidney glomerular cell types, however, important cells, including mesangial (MCs) and glomerular parietal epithelial cells (PECs), are missing or incompletely described, and a systematic comparison between mouse and human kidney is lacking. To this end, we use Smart-seq2 to profile 4332 individual glomerulus-associated cells isolated from human living donor renal biopsies and mouse kidney. The analysis reveals genetic programs for all four glomerular cell types (podocytes, glomerular endothelial cells, MCs and PECs) as well as rare glomerulus-associated macula densa cells. Importantly, we detect heterogeneity in glomerulus-associated Pdgfrb-expressing cells, including bona fide intraglomerular MCs with the functionally active phagocytic molecular machinery, as well as a unique mural cell type located in the central stalk region of the glomerulus tuft. Furthermore, we observe remarkable species differences in the individual gene expression profiles of defined glomerular cell types that highlight translational challenges in the field and provide a guide to design translational studies.


1985 ◽  
Vol 5 (2) ◽  
pp. 419-421
Author(s):  
K M Zezulak ◽  
H Green

During the differentiation of preadipose 3T3 cells into adipose cells, the mRNAs for three proteins increase strikingly in abundance. To determine the degree of cell-type specificity in the expression of these mRNAs, we estimated their abundances in several nonadipose tissues of the mouse. None of these mRNAs was strictly confined to adipocytes, but the ensemble of three mRNAs was rather specific to adipocytes. Insofar as is revealed by these three markers, the distinctive phenotype of adipocytes is the result of the enhanced expression of a number of genes, none of which is completely silent in all other cell types.


eLife ◽  
2019 ◽  
Vol 8 ◽  
Author(s):  
Sinisa Hrvatin ◽  
Christopher P Tzeng ◽  
M Aurel Nagy ◽  
Hume Stroud ◽  
Charalampia Koutsioumpa ◽  
...  

Enhancers are the primary DNA regulatory elements that confer cell type specificity of gene expression. Recent studies characterizing individual enhancers have revealed their potential to direct heterologous gene expression in a highly cell-type-specific manner. However, it has not yet been possible to systematically identify and test the function of enhancers for each of the many cell types in an organism. We have developed PESCA, a scalable and generalizable method that leverages ATAC- and single-cell RNA-sequencing protocols, to characterize cell-type-specific enhancers that should enable genetic access and perturbation of gene function across mammalian cell types. Focusing on the highly heterogeneous mammalian cerebral cortex, we apply PESCA to find enhancers and generate viral reagents capable of accessing and manipulating a subset of somatostatin-expressing cortical interneurons with high specificity. This study demonstrates the utility of this platform for developing new cell-type-specific viral reagents, with significant implications for both basic and translational research.


2012 ◽  
Vol 93 (5) ◽  
pp. 1046-1058 ◽  
Author(s):  
James C. Towler ◽  
Bahram Ebrahimi ◽  
Brian Lane ◽  
Andrew J. Davison ◽  
Derrick J. Dargan

Broad cell tropism contributes to the pathogenesis of human cytomegalovirus (HCMV), but the extent to which cell type influences HCMV gene expression is unclear. A bespoke HCMV DNA microarray was used to monitor the transcriptome activity of the low passage Merlin strain of HCMV at 12, 24, 48 and 72 h post-infection, during a single round of replication in human fetal foreskin fibroblast cells (HFFF-2s), human retinal pigmented epithelial cells (RPE-1s) and human astrocytoma cells (U373MGs). In order to correlate transcriptome activity with concurrent biological responses, viral cytopathic effect, growth kinetics and genomic loads were examined in the three cell types. The temporal expression pattern of viral genes was broadly similar in HFFF-2s and RPE-1s, but dramatically different in U373MGs. Of the 165 known HCMV protein-coding genes, 41 and 48 were differentially regulated in RPE-1s and U373MGs, respectively, compared with HFFF-2s, and 22 of these were differentially regulated in both RPE-1s and U373MGs. In RPE-1s, all differentially regulated genes were downregulated, but, in U373MGs, some were down- and others upregulated. Differentially regulated genes were identified among the immediate-early, early, early late and true-late viral gene classes. Grouping of downregulated genes according to function at landmark stages of the replication cycle led to the identification of potential bottleneck stages (genome replication, virion assembly, and virion maturation and release) that may account for cell type-dependent viral growth kinetics. The possibility that cell type-specific differences in expressed cellular factors are responsible for modulation of viral gene expression is discussed.


2022 ◽  
Author(s):  
Takaho Tsuchiya ◽  
Hiroki Hori ◽  
Haruka Ozaki

Motivation: Cell-cell communications regulate internal cellular states of the cell, e.g., gene expression and cell functions, and play pivotal roles in normal development and disease states. Furthermore, single-cell RNA sequencing methods have revealed cell-to-cell expression variability of highly variable genes (HVGs), which is also crucial. Nevertheless, the regulation on cell-to-cell expression variability of HVGs via cell-cell communications is still unexplored. The recent advent of spatial transcriptome measurement methods has linked gene expression profiles to the spatial context of single cells, which has provided opportunities to reveal those regulations. The existing computational methods extract genes with expression levels that are influenced by neighboring cell types based on the spatial transcriptome data. However, limitations remain in the quantitativeness and interpretability: it neither focuses on HVGs, considers cooperation of neighboring cell types, nor quantifies the degree of regulation with each neighboring cell type. Results: Here, we propose CCPLS (Cell-Cell communications analysis by Partial Least Square regression modeling), which is a statistical framework for identifying cell-cell communications as the effects of multiple neighboring cell types on cell-to-cell expression variability of HVGs, based on the spatial transcriptome data. For each cell type, CCPLS performs PLS regression modeling and reports coefficients as the quantitative index of the cell-cell communications. Evaluation using simulated data showed our method accurately estimated effects of multiple neighboring cell types on HVGs. Furthermore, by applying CCPLS to the two real datasets, we demonstrate CCPLS can be used to extract biologically interpretable insights from the inferred cell-cell communications.


2020 ◽  
Author(s):  
Devanshi Patel ◽  
Xiaoling Zhang ◽  
John J. Farrell ◽  
Jaeyoon Chung ◽  
Thor D. Stein ◽  
...  

ABSTRACTBecause regulation of gene expression is heritable and context-dependent, we investigated AD-related gene expression patterns in cell-types in blood and brain. Cis-expression quantitative trait locus (eQTL) mapping was performed genome-wide in blood from 5,257 Framingham Heart Study (FHS) participants and in brain donated by 475 Religious Orders Study/Memory & Aging Project (ROSMAP) participants. The association of gene expression with genotypes for all cis SNPs within 1Mb of genes was evaluated using linear regression models for unrelated subjects and linear mixed models for related subjects. Cell type-specific eQTL (ct-eQTL) models included an interaction term for expression of “proxy” genes that discriminate particular cell type. Ct-eQTL analysis identified 11,649 and 2,533 additional significant gene-SNP eQTL pairs in brain and blood, respectively, that were not detected in generic eQTL analysis. Of note, 386 unique target eGenes of significant eQTLs shared between blood and brain were enriched in apoptosis and Wnt signaling pathways. Five of these shared genes are established AD loci. The potential importance and relevance to AD of significant results in myeloid cell-types is supported by the observation that a large portion of GWS ct-eQTLs map within 1Mb of established AD loci and 58% (23/40) of the most significant eGenes in these eQTLs have previously been implicated in AD. This study identified cell-type specific expression patterns for established and potentially novel AD genes, found additional evidence for the role of myeloid cells in AD risk, and discovered potential novel blood and brain AD biomarkers that highlight the importance of cell-type specific analysis.


Genetics ◽  
2020 ◽  
Vol 216 (4) ◽  
pp. 891-903
Author(s):  
Ishara S. Ariyapala ◽  
Jessica M. Holsopple ◽  
Ellen M. Popodi ◽  
Dalton G. Hartwick ◽  
Lily Kahsai ◽  
...  

The Drosophila adult midgut is a model epithelial tissue composed of a few major cell types with distinct regional identities. One of the limitations to its analysis is the lack of tools to manipulate gene expression based on these regional identities. To overcome this obstacle, we applied the intersectional split-GAL4 system to the adult midgut and report 653 driver combinations that label cells by region and cell type. We first identified 424 split-GAL4 drivers with midgut expression from ∼7300 drivers screened, and then evaluated the expression patterns of each of these 424 when paired with three reference drivers that report activity specifically in progenitor cells, enteroendocrine cells, or enterocytes. We also evaluated a subset of the drivers expressed in progenitor cells for expression in enteroblasts using another reference driver. We show that driver combinations can define novel cell populations by identifying a driver that marks a distinct subset of enteroendocrine cells expressing genes usually associated with progenitor cells. The regional cell type patterns associated with the entire set of driver combinations are documented in a freely available website, providing information for the design of thousands of additional driver combinations to experimentally manipulate small subsets of intestinal cells. In addition, we show that intestinal enhancers identified with the split-GAL4 system can confer equivalent expression patterns on other transgenic reporters. Altogether, the resource reported here will enable more precisely targeted gene expression for studying intestinal processes, epithelial cell functions, and diseases affecting self-renewing tissues.


eLife ◽  
2017 ◽  
Vol 6 ◽  
Author(s):  
Julien Racle ◽  
Kaat de Jonge ◽  
Petra Baumgaertner ◽  
Daniel E Speiser ◽  
David Gfeller

Immune cells infiltrating tumors can have important impact on tumor progression and response to therapy. We present an efficient algorithm to simultaneously estimate the fraction of cancer and immune cell types from bulk tumor gene expression data. Our method integrates novel gene expression profiles from each major non-malignant cell type found in tumors, renormalization based on cell-type-specific mRNA content, and the ability to consider uncharacterized and possibly highly variable cell types. Feasibility is demonstrated by validation with flow cytometry, immunohistochemistry and single-cell RNA-Seq analyses of human melanoma and colorectal tumor specimens. Altogether, our work not only improves accuracy but also broadens the scope of absolute cell fraction predictions from tumor gene expression data, and provides a unique novel experimental benchmark for immunogenomics analyses in cancer research (http://epic.gfellerlab.org).


Sign in / Sign up

Export Citation Format

Share Document