scholarly journals Predicting Cell Populations in Single Cell Mass Cytometry Data

2019 ◽  
Vol 95 (7) ◽  
pp. 769-781 ◽  
Author(s):  
Tamim Abdelaal ◽  
Vincent Unen ◽  
Thomas Höllt ◽  
Frits Koning ◽  
Marcel J.T. Reinders ◽  
...  
2018 ◽  
Author(s):  
Tamim Abdelaal ◽  
Vincent van Unen ◽  
Thomas Höllt ◽  
Frits Koning ◽  
Marcel J.T. Reinders ◽  
...  

AbstractMotivationMass cytometry (CyTOF) is a valuable technology for high-dimensional analysis at the single cell level. Identification of different cell populations is an important task during the data analysis. Many clustering tools can perform this task, however, they are time consuming, often involve a manual step, and lack reproducibility when new data is included in the analysis. Learning cell types from an annotated set of cells solves these problems. However, currently available mass cytometry classifiers are either complex, dependent on prior knowledge of the cell type markers during the learning process, or can only identify canonical cell types.ResultsWe propose to use a Linear Discriminant Analysis (LDA) classifier to automatically identify cell populations in CyTOF data. LDA shows comparable results with two state-of-the-art algorithms on four benchmark datasets and also outperforms a non-linear classifier such as the k-nearest neighbour classifier. To illustrate its scalability to large datasets with deeply annotated cell subtypes, we apply LDA to a dataset of ~3.5 million cells representing 57 cell types. LDA has high performance on abundant cell types as well as the majority of rare cell types, and provides accurate estimates of cell type frequencies. Further incorporating a rejection option, based on the estimated posterior probabilities, allows LDA to identify cell types that were not encountered during training. Altogether, reproducible prediction of cell type compositions using LDA opens up possibilities to analyse large cohort studies based on mass cytometry data.AvailabilityImplementation is available on GitHub (https://github.com/tabdelaal/CyTOF-Linear-Classifier)[email protected]


2021 ◽  
Author(s):  
Lijun Cheng ◽  
Pratik Karkhanis ◽  
Birkan Gokbag ◽  
Lang Li

Background :  Single-cell mass cytometry, also known as cytometry by time of flight (CyTOF) is a powerful high-throughput technology that allows analysis of up to 50 protein markers per cell for the quantification and classification of single cells. Traditional manual gating utilized to identify new cell populations has been inadequate, inefficient, unreliable, and difficult to use, and no algorithms to identify both calibration and new cell populations has been well established. Methods :   A deep learning with graphic cluster (DGCyTOF) visualization is developed as a new integrated embedding visualization approach in identifying canonical and new cell types. The DGCyTOF combines deep-learning classification and hierarchical stable-clustering methods to sequentially build a tri-layer construct for known cell types and the identification of new cell types. First, deep classification learning is constructed to distinguish calibration cell populations from all cells by softmax classification assignment under a probability threshold, and graph embedding clustering is then used to identify new cell populations sequentially. In the middle of two-layer, cell labels are automatically adjusted between new and unknown cell populations via a feedback loop using an iteration calibration system to reduce the rate of error in the identification of cell types, and a 3-dimensional (3D) visualization platform is finally developed to display the cell clusters with all cell-population types annotated. Results : Utilizing two benchmark CyTOF databases comprising up to 43 million cells, we compared accuracy and speed in the identification of cell types among DGCyTOF, DeepCyTOF, and other technologies including dimension reduction with clustering, including Principal Component Analysis ( PCA ) , Factor Analysis ( FA ), Independent Component Analysis ( ICA ), Isometric Feature Mapping ( Isomap ), t-distributed Stochastic Neighbor Embedding ( t-SNE ), and Uniform Manifold Approximation and Projection ( UMAP ) with k -means clustering and Gaussian mixture clustering. We observed the DGCyTOF represents a robust complete learning system with high accuracy, speed and visualization by eight measurement criteria. The DGCyTOF displayed F-scores of 0.9921 for CyTOF1 and 0.9992 for CyTOF2 datasets, whereas those scores were only 0.507 and 0.529 for the t-SNE + k-means ; 0.565 and 0.59, for UMAP + k-means . Comparison of DGCyTOF with t-SNE and UMAP visualization in accuracy demonstrated its approximately 35% superiority in predicting cell types. In addition, observation of cell-population distribution was more intuitive in the 3D visualization in DGCyTOF than t-SNE and UMAP visualization. Conclusions :  The DGCyTOF model can automatically assign known labels to single cells with high accuracy using deep-learning classification assembling with traditional graph-clustering and dimension-reduction strategies. Guided by a calibration system, the model seeks optimal accuracy balance among calibration cell populations and unknown cell types, yielding a complete and robust learning system that is highly accurate in the identification of cell populations compared to results using other methods in the analysis of single-cell CyTOF data. Application of the DGCyTOF method to identify cell populations could be extended to the analysis of single-cell RNASeq data and other omics data.


2021 ◽  
Author(s):  
Ju Hee Lee ◽  
Kafi N. Ealey ◽  
Yash Patel ◽  
Navkiran Verma ◽  
Nikita Thakkar ◽  
...  

2021 ◽  
Vol 18 (1) ◽  
Author(s):  
Christos Nikolaou ◽  
Kerstin Muehle ◽  
Stephan Schlickeiser ◽  
Alberto Sada Japp ◽  
Nadine Matzmohr ◽  
...  

An amendment to this paper has been published and can be accessed via the original article.


2018 ◽  
Vol 22 (1) ◽  
pp. 78-90 ◽  
Author(s):  
Chotima Böttcher ◽  
◽  
Stephan Schlickeiser ◽  
Marjolein A. M. Sneeboer ◽  
Desiree Kunkel ◽  
...  

2019 ◽  
Vol 200 ◽  
pp. 24-30 ◽  
Author(s):  
Min Sun Shin ◽  
Kristina Yim ◽  
Kevin Moon ◽  
Hong-Jai Park ◽  
Subhasis Mohanty ◽  
...  

2020 ◽  
Vol 30 (6) ◽  
pp. 1178-1191 ◽  
Author(s):  
Camila Fernández‐Zapata ◽  
Julia K. H. Leman ◽  
Josef Priller ◽  
Chotima Böttcher

Sign in / Sign up

Export Citation Format

Share Document