Cell-type-specific resolution epigenetics without the need for cell sorting or single-cell biology

Mapping Intimacies ◽

10.1101/437368 ◽

2018 ◽

Author(s):

Elior Rahmani ◽

Regev Schweiger ◽

Brooke Rhead ◽

Lindsey A. Criswell ◽

Lisa F. Barcellos ◽

...

Keyword(s):

Single Cell ◽

Cell Sorting ◽

Cell Biology ◽

Large Scale ◽

Tissue Level ◽

Cell Type ◽

Validation Data ◽

Cell Type Specific ◽

Disease Associated Genes ◽

The Individual

AbstractHigh costs and technical limitations of cell sorting and single-cell techniques currently restrict the collection of large-scale, cell-type-specific DNA methylation data. This, in turn, impedes our ability to tackle key biological questions that pertain to variation within a population, such as identification of disease-associated genes at a cell-type-specific resolution. Here, we show mathematically and empirically that cell-type-specific methylation levels of an individual can be learned from its tissue-level bulk data, conceptually emulating the case where the individual has been profiled with a single-cell resolution and then signals were aggregated in each cell population separately. Provided with this unprecedented way to perform powerful large-scale epigenetic studies with cell-type-specific resolution, we revisit previous studies with tissue-level bulk methylation and reveal novel associations with leukocyte composition in blood and with rheumatoid arthritis. For the latter, we further show consistency with validation data collected from sorted leukocyte sub-types. Corresponding software is available from: https://github.com/cozygene/TCA.

Download Full-text

Cell-type-specific resolution epigenetics without the need for cell sorting or single-cell biology

Nature Communications ◽

10.1038/s41467-019-11052-9 ◽

2019 ◽

Vol 10 (1) ◽

Cited By ~ 14

Author(s):

Elior Rahmani ◽

Regev Schweiger ◽

Brooke Rhead ◽

Lindsey A. Criswell ◽

Lisa F. Barcellos ◽

...

Keyword(s):

Single Cell ◽

Cell Sorting ◽

Cell Biology ◽

Cell Type ◽

Cell Type Specific

Download Full-text

Automated population identification and sorting algorithms for high-dimensional single-cell data

10.1101/046664 ◽

2016 ◽

Cited By ~ 1

Author(s):

Benedict Anchang ◽

Sylvia K. Plevritis

Keyword(s):

Single Cell ◽

Cell Sorting ◽

Intracellular Signaling ◽

Expert Knowledge ◽

Single Cells ◽

Cell Type ◽

Experimental Conditions ◽

Cell Type Specific ◽

Cell Subpopulations ◽

Cell Data

AbstractCell sorting or gating homogenous subpopulations from single-cell data enables cell-type specific characterization, such as cell-type genomic profiling as well as the study of tumor progression. This highlight summarizes recently developed automated gating algorithms that are optimized for both population identification and sorting homogeneous single cells in heterogeneous single-cell data. Data-driven gating strategies identify and/or sort homogeneous subpopulations from a heterogeneous population without relying on expert knowledge thereby removing human bias and variability. We further describe an optimized cell sorting strategy called CCAST based on Clustering, Classification and Sorting Trees which identifies the relevant gating markers, gating hierarchy and partitions that define underlying cell subpopulations. CCAST identifies more homogeneous subpopulations in several applications compared to prior sorting strategies and reveals simultaneous intracellular signaling across different lineage subtypes under different experimental conditions.

Download Full-text

Co-expression enrichment analysis at the single-cell level reveals convergent defects in neural progenitor cells and their cell-type transitions in neurodevelopmental disorders

10.1101/2020.02.13.948315 ◽

2020 ◽

Author(s):

Kaifang Pang ◽

Li Wang ◽

Wei Wang ◽

Jian Zhou ◽

Chao Cheng ◽

...

Keyword(s):

Single Cell ◽

Neurodevelopmental Disorders ◽

Progenitor Cell ◽

Large Scale ◽

Radial Glia ◽

Neural Progenitor ◽

Autism Spectrum ◽

Cell Type ◽

Sequencing Data ◽

Cell Type Specific

AbstractRecent large-scale sequencing studies have identified a great number of genes whose disruptions cause neurodevelopmental disorders (NDDs). However, cell-type-specific functions of NDD genes and their contributions to NDD pathology are unclear. Here, we integrated NDD genetics with single-cell RNA sequencing data to identify cell-type and temporal convergence of genes involved in different NDDs. By assessing the co-expression enrichment pattern of various NDD gene sets, we identified mid-fetal cortical neural progenitor cell development—more specifically, ventricular radial glia-to-intermediate progenitor cell transition at gestational week 10—as a key convergent point in autism spectrum disorder (ASD) and epilepsy. Integrated gene ontology-based analyses further revealed that ASD genes function as upstream regulators to activate neural differentiation and inhibit cell cycle during the transition, whereas epilepsy genes function as downstream effectors in the same processes, offering a potential explanation for the high comorbidity rate of the two disorders. Together, our study provides a framework for investigating the cell-type-specific pathophysiology of NDDs.

Download Full-text

Large-scale genetic association and single cell accessible chromatin mapping defines cell type-specific mechanisms of type 1 diabetes risk

10.1101/2021.01.13.426472 ◽

2021 ◽

Author(s):

Joshua Chiou ◽

Ryan J Geusz ◽

Mei-Lin Okino ◽

Jee Yun Han ◽

Michael Miller ◽

...

Keyword(s):

Single Cell ◽

Large Scale ◽

Complex Disease ◽

Cell Types ◽

Causal Variant ◽

Cell Type ◽

Ductal Cells ◽

Risk Variants ◽

Cell Type Specific

ABSTRACTTranslating genome-wide association studies (GWAS) of complex disease into mechanistic insight requires a comprehensive understanding of risk variant effects on disease-relevant cell types. To uncover cell type-specific mechanisms of type 1 diabetes (T1D) risk, we combined genetic association mapping and single cell epigenomics. We performed the largest to-date GWAS of T1D in 489,679 samples imputed into 59.2M variants, which identified 74 novel association signals including several large-effect rare variants. Fine-mapping of 141 total signals substantially improved resolution of causal variant credible sets, which primarily mapped to non-coding sequence. To annotate cell type-specific regulatory mechanisms of T1D risk variants, we mapped 448,142 candidate cis-regulatory elements (cCREs) in pancreas and peripheral blood mononuclear cell types using snATAC-seq of 131,554 nuclei. T1D risk variants were enriched in cCREs active in CD4+ T cells as well as several additional cell types including pancreatic exocrine acinar and ductal cells. High-probability T1D risk variants at multiple signals mapped to exocrine-specific cCREs including novel loci near CEL, GP2 and CFTR. At the CFTR locus, the likely causal variant rs7795896 mapped in a ductal-specific distal cCRE which regulated CFTR and the risk allele reduced transcription factor binding, enhancer activity and CFTR expression in ductal cells. These findings support a role for the exocrine pancreas in T1D pathogenesis and highlight the power of combining large-scale GWAS and single cell epigenomics to provide insight into the cellular origins of complex disease.

Download Full-text

Cnidarian cell type diversity revealed by whole-organism single-cell RNA-seq analysis

10.1101/201103 ◽

2017 ◽

Author(s):

Arnau Sebé-Pedrós ◽

Elad Chomsky ◽

Baptiste Saudememont ◽

Marie-Pierre Mailhe ◽

Flora Pleisser ◽

...

Keyword(s):

Single Cell ◽

Cell Types ◽

Tissue Level ◽

Rna Seq ◽

Cell Type ◽

Specific Expression ◽

Animal Evolution ◽

Neuronal Markers ◽

Cell Type Specific Expression ◽

Cell Type Specific

A hallmark of animal evolution is the emergence and diversification of cell type-specific transcriptional states. But systematic and unbiased characterization of differentiated gene regulatory programs was so far limited to specific tissues in a few model species. Here, we perform whole-organism single cell transcriptomics to map cell types in the cnidarian Nematostella vectensis, a non-bilaterian animal that display complex tissue-level bodyplan organization. We uncover high diversity of transcriptional states in Nematostella, demonstrating cell type-specific expression for 35% of the genes and 51% of the transcription factors (TFs) detected. We identify eight broad cell clusters corresponding to cell classes such as neurons, muscles, cnidocytes, or digestive cells. These clusters comprise multiple cell modules expressing diverse and specific markers, uncovering in particular a rich repertoire of cells associated with neuronal markers. TF expression and sequence analysis defines the combinatorial code that underlies this cell-specific expression. It also reveals the existence of a complex regulatory lexicon of TF binding motifs encoded at both enhancer and promoters of Nematostella tissue-specific genes. Whole organism single cell RNA-seq is thereby established as a tool for comprehensive study of genome regulation and cell type evolution.

Download Full-text

Comprehensive Cell Type Specific Transcriptomics of the Human Kidney

10.1101/238063 ◽

2017 ◽

Cited By ~ 7

Author(s):

V Sivakamasundari ◽

Mohan Bolisetty ◽

Santhosh Sivajothi ◽

Shannon Bessonett ◽

Diane Ruan ◽

...

Keyword(s):

Human Kidney ◽

Cell Types ◽

Cell Type ◽

Vascular Integrity ◽

Normal Human Kidney ◽

Normal Human ◽

Cell Type Specific ◽

Disease Associated Genes ◽

Transcriptome Resource ◽

The Individual

AbstractThe human kidney is a complex organ composed of specialized cell types. To better define this cellular complexity, we profiled the individual transcriptomes of 22,469 normal human kidney cells, identifying 27 cell types. We describe three distinct endothelial cell populations, a novel subset of intercalated cells, interstitial macrophage and dendritic cells, and identify numerous novel cell-type-specific markers, many validated using imaging mass cytometry and immunohistochemistry. Receptor-ligand analysis revealed previously unknown intercalated-endothelial and intercalated-distal nephron interactions, suggesting a role in maintenance of vascular integrity and intercalated cell survival. Notably, kidney disease-associated genes were largely expressed in proximal tubules, podocytes, endothelial and myeloid cells, highlighting an underappreciated role for endothelial cells in kidney pathologies. Our analysis also provides a resource of cell type enriched markers, solute carriers, channels and lncRNAs. In summary, this cell-type-specific transcriptome resource provides the foundation for a comprehensive understanding of kidney function and dysfunction at single cell resolution.

Download Full-text

Conditional out-of-distribution generation for unpaired data using transfer VAE

Bioinformatics ◽

10.1093/bioinformatics/btaa800 ◽

2020 ◽

Vol 36 (Supplement_2) ◽

pp. i610-i617

Author(s):

Mohammad Lotfollahi ◽

Mohsen Naghipourfar ◽

Fabian J Theis ◽

F Alexander Wolf

Keyword(s):

Single Cell ◽

Generative Models ◽

Response To Treatment ◽

High Dimensional ◽

Compact Representation ◽

Hair Color ◽

Great Success ◽

Cell Type ◽

Style Transfer ◽

Cell Type Specific

Abstract Motivation While generative models have shown great success in sampling high-dimensional samples conditional on low-dimensional descriptors (stroke thickness in MNIST, hair color in CelebA, speaker identity in WaveNet), their generation out-of-distribution poses fundamental problems due to the difficulty of learning compact joint distribution across conditions. The canonical example of the conditional variational autoencoder (CVAE), for instance, does not explicitly relate conditions during training and, hence, has no explicit incentive of learning such a compact representation. Results We overcome the limitation of the CVAE by matching distributions across conditions using maximum mean discrepancy in the decoder layer that follows the bottleneck. This introduces a strong regularization both for reconstructing samples within the same condition and for transforming samples across conditions, resulting in much improved generalization. As this amount to solving a style-transfer problem, we refer to the model as transfer VAE (trVAE). Benchmarking trVAE on high-dimensional image and single-cell RNA-seq, we demonstrate higher robustness and higher accuracy than existing approaches. We also show qualitatively improved predictions by tackling previously problematic minority classes and multiple conditions in the context of cellular perturbation response to treatment and disease based on high-dimensional single-cell gene expression data. For generic tasks, we improve Pearson correlations of high-dimensional estimated means and variances with their ground truths from 0.89 to 0.97 and 0.75 to 0.87, respectively. We further demonstrate that trVAE learns cell-type-specific responses after perturbation and improves the prediction of most cell-type-specific genes by 65%. Availability and implementation The trVAE implementation is available via github.com/theislab/trvae. The results of this article can be reproduced via github.com/theislab/trvae_reproducibility.

Download Full-text

Comprehensive analysis of single cell ATAC-seq data with SnapATAC

Nature Communications ◽

10.1038/s41467-021-21583-9 ◽

2021 ◽

Vol 12 (1) ◽

Author(s):

Rongxin Fang ◽

Sebastian Preissl ◽

Yang Li ◽

Xiaomeng Hou ◽

Jacinta Lucero ◽

...

Keyword(s):

Single Cell ◽

Single Cell Analysis ◽

Expression Patterns ◽

Regulatory Elements ◽

Cellular Heterogeneity ◽

Specific Gene ◽

Open Chromatin ◽

Cell Type ◽

Process Data ◽

Cell Type Specific

AbstractIdentification of the cis-regulatory elements controlling cell-type specific gene expression patterns is essential for understanding the origin of cellular diversity. Conventional assays to map regulatory elements via open chromatin analysis of primary tissues is hindered by sample heterogeneity. Single cell analysis of accessible chromatin (scATAC-seq) can overcome this limitation. However, the high-level noise of each single cell profile and the large volume of data pose unique computational challenges. Here, we introduce SnapATAC, a software package for analyzing scATAC-seq datasets. SnapATAC dissects cellular heterogeneity in an unbiased manner and map the trajectories of cellular states. Using the Nyström method, SnapATAC can process data from up to a million cells. Furthermore, SnapATAC incorporates existing tools into a comprehensive package for analyzing single cell ATAC-seq dataset. As demonstration of its utility, SnapATAC is applied to 55,592 single-nucleus ATAC-seq profiles from the mouse secondary motor cortex. The analysis reveals ~370,000 candidate regulatory elements in 31 distinct cell populations in this brain region and inferred candidate cell-type specific transcriptional regulators.

Download Full-text

An analytical method for the identification of cell type-specific disease gene modules

Journal of Translational Medicine ◽

10.1186/s12967-020-02690-5 ◽

2021 ◽

Vol 19 (1) ◽

Author(s):

Jinting Guan ◽

Yiping Lin ◽

Yang Wang ◽

Junchao Gao ◽

Guoli Ji

Keyword(s):

Disease Gene ◽

Gene Interaction ◽

Cell Types ◽

Autism Spectrum ◽

Specific Gene ◽

Cell Type ◽

Specific Disease ◽

Cell Type Specific ◽

Gene Modules ◽

Disease Associated Genes

Abstract Background Genome-wide association studies have identified genetic variants associated with the risk of brain-related diseases, such as neurological and psychiatric disorders, while the causal variants and the specific vulnerable cell types are often needed to be studied. Many disease-associated genes are expressed in multiple cell types of human brains, while the pathologic variants affect primarily specific cell types. We hypothesize a model in which what determines the manifestation of a disease in a cell type is the presence of disease module comprised of disease-associated genes, instead of individual genes. Therefore, it is essential to identify the presence/absence of disease gene modules in cells. Methods To characterize the cell type-specificity of brain-related diseases, we construct human brain cell type-specific gene interaction networks integrating human brain nucleus gene expression data with a referenced tissue-specific gene interaction network. Then from the cell type-specific gene interaction networks, we identify significant cell type-specific disease gene modules by performing statistical tests. Results Between neurons and glia cells, the constructed cell type-specific gene networks and their gene functions are distinct. Then we identify cell type-specific disease gene modules associated with autism spectrum disorder and find that different gene modules are formed and distinct gene functions may be dysregulated in different cells. We also study the similarity and dissimilarity in cell type-specific disease gene modules among autism spectrum disorder, schizophrenia and bipolar disorder. The functions of neurons-specific disease gene modules are associated with synapse for all three diseases, while those in glia cells are different. To facilitate the use of our method, we develop an R package, CtsDGM, for the identification of cell type-specific disease gene modules. Conclusions The results support our hypothesis that a disease manifests itself in a cell type through forming a statistically significant disease gene module. The identification of cell type-specific disease gene modules can promote the development of more targeted biomarkers and treatments for the disease. Our method can be applied for depicting the cell type heterogeneity of a given disease, and also for studying the similarity and dissimilarity between different disorders, providing new insights into the molecular mechanisms underlying the pathogenesis and progression of diseases.

Download Full-text

Single-cell RNA sequencing of the mammalian pineal gland identifies two pinealocyte subtypes and cell type-specific daily patterns of gene expression

PLoS ONE ◽

10.1371/journal.pone.0205883 ◽

2018 ◽

Vol 13 (10) ◽

pp. e0205883 ◽

Cited By ~ 9

Author(s):

Joseph C. Mays ◽

Michael C. Kelly ◽

Steven L. Coon ◽

Lynne Holtzclaw ◽

Martin F. Rath ◽

...

Keyword(s):

Gene Expression ◽

Pineal Gland ◽

Single Cell ◽

Rna Sequencing ◽

Cell Type ◽

Single Cell Rna Sequencing ◽

Cell Type Specific ◽

Mammalian Pineal Gland ◽

Daily Patterns

Download Full-text