Parallelized Inference for Single Cell Transcriptomic Clustering with Split Merge Sampling on DPMM Model

Mapping Intimacies ◽

10.1101/271163 ◽

2018 ◽

Author(s):

Tiehang Duan ◽

José P. Pinto ◽

Xiaohui Xie

Keyword(s):

Single Cell ◽

High Performance ◽

Clustering Methods ◽

Single Data Point ◽

Computational Speed ◽

Clustering Quality ◽

Single Data ◽

Cell Transcriptome ◽

Single Cell Transcriptome ◽

Mean Time

Motivation: With the development of droplet based systems, massive single cell transcriptome data has become available, which enables analysis of cellular and molecular processes at single cell resolution and is instrumental to understanding many biological processes. While state-of-the-art clustering methods have been applied to the data, they face challenges in the following aspects: (1) the clustering quality still needs to be improved; (2) most models need prior knowledge on number of clusters, which is not always available; (3) there is a demand for faster computational speed.Results: We propose to tackle these challenges with Parallelized Split Merge Sampling on Dirichlet Process Mixture Model (the Para-DPMM model). Unlike classic DPMM methods that perform sampling on each single data point, the split merge mechanism samples on the cluster level, which significantly improves convergence and optimality of the result. The model is highly parallelized and can utilize the computing power of high performance computing (HPC) clusters, enabling massive inference on huge datasets. Experiment results show the model achieves about 7% improvement in clustering accuracy for small datasets and more than 20% improvement for large challenging datasets compared with current widely used models. In the mean time, the model’s computing speed is significantly faster.Availability: Source code is publicly available on https://github.com/tiehangd/Para_DPMM/tree/master/Para_DPMM_package

Download Full-text

Parallel clustering of single cell transcriptomic data with split-merge sampling on Dirichlet process mixtures

Bioinformatics ◽

10.1093/bioinformatics/bty702 ◽

2018 ◽

Vol 35 (6) ◽

pp. 953-961 ◽

Cited By ~ 3

Author(s):

Tiehang Duan ◽

José P Pinto ◽

Xiaohui Xie

Keyword(s):

Single Cell ◽

Dirichlet Process ◽

High Performance ◽

Supplementary Information ◽

Clustering Methods ◽

Dirichlet Process Mixture ◽

Computational Speed ◽

Clustering Quality ◽

Single Data ◽

Cell Transcriptome

Abstract Motivation With the development of droplet based systems, massive single cell transcriptome data has become available, which enables analysis of cellular and molecular processes at single cell resolution and is instrumental to understanding many biological processes. While state-of-the-art clustering methods have been applied to the data, they face challenges in the following aspects: (i) the clustering quality still needs to be improved; (ii) most models need prior knowledge on number of clusters, which is not always available; (iii) there is a demand for faster computational speed. Results We propose to tackle these challenges with Parallelized Split Merge Sampling on Dirichlet Process Mixture Model (the Para-DPMM model). Unlike classic DPMM methods that perform sampling on each single data point, the split merge mechanism samples on the cluster level, which significantly improves convergence and optimality of the result. The model is highly parallelized and can utilize the computing power of high performance computing (HPC) clusters, enabling massive inference on huge datasets. Experiment results show the model outperforms current widely used models in both clustering quality and computational speed. Availability and implementation Source code is publicly available on https://github.com/tiehangd/Para_DPMM/tree/master/Para_DPMM_package. Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text

BAMM-SC: A Bayesian mixture model for clustering droplet-based single cell transcriptomic data from population studies

10.1101/392662 ◽

2018 ◽

Author(s):

Zhe Sun ◽

Li Chen ◽

Hongyi Xin ◽

Qianhui Huang ◽

Anthony R Cillo ◽

...

Keyword(s):

Single Cell ◽

Single Cells ◽

R Package ◽

Clustering Methods ◽

Model Framework ◽

Bayesian Hierarchical ◽

Bayesian Mixture ◽

Population Scale ◽

Cell Transcriptome ◽

Single Cell Transcriptome

AbstractThe recently developed droplet-based single cell transcriptome sequencing (scRNA-seq) technology makes it feasible to perform a population-scale scRNA-seq study, in which the transcriptome is measured for tens of thousands of single cells from multiple individuals. Despite the advances of many clustering methods, there are few tailored methods for population-scale scRNA-seq studies. Here, we have developed a BAyesiany Mixture Model for Single Cell sequencing (BAMM-SC) method to cluster scRNA-seq data from multiple individuals simultaneously. Specifically, BAMM-SC takes raw data as input and can account for data heterogeneity and batch effect among multiple individuals in a unified Bayesian hierarchical model framework. Results from extensive simulations and application of BAMM-SC to in-house scRNA-seq datasets using blood, lung and skin cells from humans or mice demonstrated that BAMM-SC outperformed existing clustering methods with improved clustering accuracy and reduced impact from batch effects. BAMM-SC has been implemented in a user-friendly R package with a detailed tutorial available on www.pitt.edu/~Cwec47/singlecell.html.

Download Full-text

Time to map single cell transcriptome for a whole organism

Blood Science ◽

10.2478/bls-2018-0003 ◽

2018 ◽

Vol 0 (0) ◽

Author(s):

Ping Zhu ◽

Tao Cheng

Keyword(s):

Single Cell ◽

Cell Transcriptome ◽

Single Cell Transcriptome

Download Full-text

Faculty Opinions recommendation of Pooled CRISPR screening with single-cell transcriptome readout.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.727218622.793528198 ◽

2017 ◽

Author(s):

Vijay Tiwari ◽

Hyobin Jeong

Keyword(s):

Single Cell ◽

Cell Transcriptome ◽

Single Cell Transcriptome

Download Full-text

Single cell transcriptome atlas of mouse mammary epithelial cells across development

Breast Cancer Research ◽

10.1186/s13058-021-01445-4 ◽

2021 ◽

Vol 23 (1) ◽

Author(s):

Bhupinder Pal ◽

Yunshun Chen ◽

Michael J. G. Milevskiy ◽

François Vaillant ◽

Lexie Prokopuk ◽

...

Keyword(s):

Epithelial Cells ◽

Single Cell ◽

Developmental Stages ◽

Mammary Epithelial Cells ◽

Expression Profiles ◽

Single Cells ◽

Chromatin Accessibility ◽

Mammary Epithelial ◽

Cell Transcriptome ◽

Single Cell Transcriptome

Abstract Background Heterogeneity within the mouse mammary epithelium and potential lineage relationships have been recently explored by single-cell RNA profiling. To further understand how cellular diversity changes during mammary ontogeny, we profiled single cells from nine different developmental stages spanning late embryogenesis, early postnatal, prepuberty, adult, mid-pregnancy, late-pregnancy, and post-involution, as well as the transcriptomes of micro-dissected terminal end buds (TEBs) and subtending ducts during puberty. Methods The single cell transcriptomes of 132,599 mammary epithelial cells from 9 different developmental stages were determined on the 10x Genomics Chromium platform, and integrative analyses were performed to compare specific time points. Results The mammary rudiment at E18.5 closely aligned with the basal lineage, while prepubertal epithelial cells exhibited lineage segregation but to a less differentiated state than their adult counterparts. Comparison of micro-dissected TEBs versus ducts showed that luminal cells within TEBs harbored intermediate expression profiles. Ductal basal cells exhibited increased chromatin accessibility of luminal genes compared to their TEB counterparts suggesting that lineage-specific chromatin is established within the subtending ducts during puberty. An integrative analysis of five stages spanning the pregnancy cycle revealed distinct stage-specific profiles and the presence of cycling basal, mixed-lineage, and 'late' alveolar intermediates in pregnancy. Moreover, a number of intermediates were uncovered along the basal-luminal progenitor cell axis, suggesting a continuum of alveolar-restricted progenitor states. Conclusions This extended single cell transcriptome atlas of mouse mammary epithelial cells provides the most complete coverage for mammary epithelial cells during morphogenesis to date. Together with chromatin accessibility analysis of TEB structures, it represents a valuable framework for understanding developmental decisions within the mouse mammary gland.

Download Full-text

Single-cell transcriptome sequencing analysis reveals gynaecomastia to breast cancer transition

The Breast ◽

10.1016/s0960-9776(21)00109-0 ◽

2021 ◽

Vol 56 ◽

pp. S26

Author(s):

H. Sun ◽

C. Yang ◽

Z. Wang ◽

L. Shi ◽

T. Xia ◽

...

Keyword(s):

Breast Cancer ◽

Single Cell ◽

Transcriptome Sequencing ◽

Sequencing Analysis ◽

Cell Transcriptome ◽

Single Cell Transcriptome

Download Full-text

Single‐Cell Transcriptome Analysis Uncovers Intratumoral Heterogeneity and Underlying Mechanisms for Drug Resistance in Hepatobiliary Tumor Organoids

Advanced Science ◽

10.1002/advs.202003897 ◽

2021 ◽

pp. 2003897

Author(s):

Yan Zhao ◽

Zhi‐Xuan Li ◽

Yan‐Jing Zhu ◽

Jing Fu ◽

Xiao‐Fang Zhao ◽

...

Keyword(s):

Drug Resistance ◽

Single Cell ◽

Transcriptome Analysis ◽

Intratumoral Heterogeneity ◽

Underlying Mechanisms ◽

Cell Transcriptome ◽

Single Cell Transcriptome ◽

Tumor Organoids

Download Full-text

Single-Cell Transcriptome Analyses Reveal Taxol Resistant Subpopulations in Esophageal Squamous Cancer Cells

International Journal of Radiation Oncology*Biology*Physics ◽

10.1016/j.ijrobp.2017.06.2115 ◽

2017 ◽

Vol 99 (2) ◽

pp. E627-E628

Author(s):

H. Wu ◽

J. Yu ◽

S. Chen ◽

X. Zhang ◽

L. Yang ◽

...

Keyword(s):

Single Cell ◽

Cancer Cells ◽

Squamous Cancer ◽

Transcriptome Analyses ◽

Cell Transcriptome ◽

Single Cell Transcriptome ◽

Esophageal Squamous Cancer

Download Full-text

Single-Cell Transcriptome Analyses Reveal Signals to Activate Dormant Neural Stem Cells

Cell ◽

10.1016/j.cell.2015.04.001 ◽

2015 ◽

Vol 161 (5) ◽

pp. 1175-1186 ◽

Cited By ~ 140

Author(s):

Yuping Luo ◽

Volkan Coskun ◽

Aibing Liang ◽

Juehua Yu ◽

Liming Cheng ◽

...

Keyword(s):

Stem Cells ◽

Neural Stem Cells ◽

Single Cell ◽

Transcriptome Analyses ◽

Cell Transcriptome ◽

Single Cell Transcriptome

Download Full-text

Abstract 93: Single cell transcriptome analysis maps dynamic epithelial mesenchymal transition and immunological remodeling in thyroid cancer progression

10.1158/1538-7445.am2021-93 ◽

2021 ◽

Author(s):

Xi Chen ◽

Jennifer Rui Wang ◽

Ying Henderson ◽

Yuanqing Yan ◽

Shanshan Bai ◽

...

Keyword(s):

Thyroid Cancer ◽

Single Cell ◽

Cancer Progression ◽

Transcriptome Analysis ◽

Epithelial Mesenchymal Transition ◽

Mesenchymal Transition ◽

Cell Transcriptome ◽

Single Cell Transcriptome

Download Full-text