scholarly journals Machine and deep learning single-cell segmentation and quantification of multi-dimensional tissue images

2019 ◽  
Author(s):  
Eliot T McKinley ◽  
Joseph T Roland ◽  
Jeffrey L Franklin ◽  
Mary Catherine Macedonia ◽  
Paige N Vega ◽  
...  

AbstractIncreasingly, highly multiplexed in situ tissue imaging methods are used to profile protein expression at the single-cell level. However, a critical limitation is a lack of robust cell segmentation tools applicable for sections of tissues with a complex architecture and multiple cell types. Using human colorectal adenomas, we present a pipeline for cell segmentation and quantification that utilizes machine learning-based pixel classification to define cellular compartments, a novel method for extending incomplete cell membranes, quantification of antibody staining, and a deep learning-based cell shape descriptor. We envision that this method can be broadly applied to different imaging platforms and tissue types.

2021 ◽  
Vol 12 (1) ◽  
Author(s):  
Maria Hurskainen ◽  
Ivana Mižíková ◽  
David P. Cook ◽  
Noora Andersson ◽  
Chanèle Cyr-Depauw ◽  
...  

AbstractDuring late lung development, alveolar and microvascular development is finalized to enable sufficient gas exchange. Impaired late lung development manifests as bronchopulmonary dysplasia (BPD) in preterm infants. Single-cell RNA sequencing (scRNA-seq) allows for assessment of complex cellular dynamics during biological processes, such as development. Here, we use MULTI-seq to generate scRNA-seq profiles of over 66,000 cells from 36 mice during normal or impaired lung development secondary to hyperoxia with validation of some of the findings in lungs from BPD patients. We observe dynamic populations of cells, including several rare cell types and putative progenitors. Hyperoxia exposure, which mimics the BPD phenotype, alters the composition of all cellular compartments, particularly alveolar epithelium, stromal fibroblasts, capillary endothelium and macrophage populations. Pathway analysis and predicted dynamic cellular crosstalk suggest inflammatory signaling as the main driver of hyperoxia-induced changes. Our data provides a single-cell view of cellular changes associated with late lung development in health and disease.


Gene Therapy ◽  
2021 ◽  
Author(s):  
A. S. Mathew ◽  
C. M. Gorick ◽  
R. J. Price

AbstractGene delivery via focused ultrasound (FUS) mediated blood-brain barrier (BBB) opening is a disruptive therapeutic modality. Unlocking its full potential will require an understanding of how FUS parameters (e.g., peak-negative pressure (PNP)) affect transfected cell populations. Following plasmid (mRuby) delivery across the BBB with 1 MHz FUS, we used single-cell RNA-sequencing to ascertain that distributions of transfected cell types were highly dependent on PNP. Cells of the BBB (i.e., endothelial cells, pericytes, and astrocytes) were enriched at 0.2 MPa PNP, while transfection of cells distal to the BBB (i.e., neurons, oligodendrocytes, and microglia) was augmented at 0.4 MPa PNP. PNP-dependent differential gene expression was observed for multiple cell types. Cell stress genes were upregulated proportional to PNP, independent of cell type. Our results underscore how FUS may be tuned to bias transfection toward specific brain cell types in vivo and predict how those cells will respond to transfection.


2020 ◽  
Author(s):  
N. Kakava-Georgiadou ◽  
J.F. Severens ◽  
A.M. Jørgensen ◽  
K.M. Garner ◽  
M.C.M Luijendijk ◽  
...  

AbstractHypothalamic nuclei which regulate homeostatic functions express leptin receptor (LepR), the primary target of the satiety hormone leptin. Single-cell RNA sequencing (scRNA-seq) has facilitated the discovery of a variety of hypothalamic cell types. However, low abundance of LepR transcripts prevented further characterization of LepR cells. Therefore, we perform scRNA-seq on isolated LepR cells and identify eight neuronal clusters, including three uncharacterized Trh-expressing populations as well as 17 non-neuronal populations including tanycytes, oligodendrocytes and endothelial cells. Food restriction had a major impact on Agrp neurons and changed the expression of obesity-associated genes. Multiple cell clusters were enriched for GWAS signals of obesity. We further explored changes in the gene regulatory landscape of LepR cell types. We thus reveal the molecular signature of distinct populations with diverse neurochemical profiles, which will aid efforts to illuminate the multi-functional nature of leptin’s action in the hypothalamus.


2021 ◽  
Author(s):  
Xanthi Stachtea ◽  
Maurice B. Loughrey ◽  
Manuela Salvucci ◽  
Andreas U. Lindner ◽  
Sanghee Cho ◽  
...  

AbstractColorectal cancer (CRC) has one of the highest cancer incidences and mortality rates. In stage III, postoperative chemotherapy benefits <20% of patients, while more than 50% will develop distant metastases. Biomarkers for identification of patients at increased risk of disease recurrence following adjuvant chemotherapy are currently lacking. In this study, we assessed immune signatures in the tumor and tumor microenvironment (TME) using an in situ multiplexed immunofluorescence imaging and single-cell analysis technology (Cell DIVETM) and evaluated their correlations with patient outcomes. Tissue microarrays (TMAs) with up to three 1 mm diameter cores per patient were prepared from 117 stage III CRC patients treated with adjuvant fluoropyrimidine/oxaliplatin (FOLFOX) chemotherapy. Single sections underwent multiplexed immunofluorescence staining for immune cell markers (CD45, CD3, CD4, CD8, FOXP3, PD1) and tumor/cell segmentation markers (DAPI, pan-cytokeratin, AE1, NaKATPase, and S6). We used annotations and a probabilistic classification algorithm to build statistical models of immune cell types. Images were also qualitatively assessed independently by a Pathologist as ‘high’, ‘moderate’ or ‘low’, for stromal and total immune cell content. Excellent agreement was found between manual assessment and total automated scores (p < 0.0001). Moreover, compared to single markers, a multi-marker classification of regulatory T cells (Tregs: CD3+/CD4+FOXP3+/PD1−) was significantly associated with disease-free survival (DFS) and overall survival (OS) (p = 0.049 and 0.032) of FOLFOX-treated patients. Our results also showed that PD1− Tregs rather than PD1+ Tregs were associated with improved survival. These findings were supported by results from an independent FOLFOX-treated cohort of 191 stage III CRC patients, where higher PD1− Tregs were associated with an increase overall survival (p = 0.015) for CD3+/CD4+/FOXP3+/PD1−. Overall, compared to single markers, multi-marker classification provided more accurate quantitation of immune cell types with stronger correlations with outcomes.


2017 ◽  
Author(s):  
Luke Zappia ◽  
Belinda Phipson ◽  
Alicia Oshlack

AbstractAs single-cell RNA sequencing technologies have rapidly developed, so have analysis methods. Many methods have been tested, developed and validated using simulated datasets. Unfortunately, current simulations are often poorly documented, their similarity to real data is not demonstrated, or reproducible code is not available.Here we present the Splatter Bioconductor package for simple, reproducible and well-documented simulation of single-cell RNA-seq data. Splatter provides an interface to multiple simulation methods including Splat, our own simulation, based on a gamma-Poisson distribution. Splat can simulate single populations of cells, populations with multiple cell types or differentiation paths.


2021 ◽  
Vol 12 ◽  
Author(s):  
Bin Zou ◽  
Tongda Zhang ◽  
Ruilong Zhou ◽  
Xiaosen Jiang ◽  
Huanming Yang ◽  
...  

It is well recognized that batch effect in single-cell RNA sequencing (scRNA-seq) data remains a big challenge when integrating different datasets. Here, we proposed deepMNN, a novel deep learning-based method to correct batch effect in scRNA-seq data. We first searched mutual nearest neighbor (MNN) pairs across different batches in a principal component analysis (PCA) subspace. Subsequently, a batch correction network was constructed by stacking two residual blocks and further applied for the removal of batch effects. The loss function of deepMNN was defined as the sum of a batch loss and a weighted regularization loss. The batch loss was used to compute the distance between cells in MNN pairs in the PCA subspace, while the regularization loss was to make the output of the network similar to the input. The experiment results showed that deepMNN can successfully remove batch effects across datasets with identical cell types, datasets with non-identical cell types, datasets with multiple batches, and large-scale datasets as well. We compared the performance of deepMNN with state-of-the-art batch correction methods, including the widely used methods of Harmony, Scanorama, and Seurat V4 as well as the recently developed deep learning-based methods of MMD-ResNet and scGen. The results demonstrated that deepMNN achieved a better or comparable performance in terms of both qualitative analysis using uniform manifold approximation and projection (UMAP) plots and quantitative metrics such as batch and cell entropies, ARI F1 score, and ASW F1 score under various scenarios. Additionally, deepMNN allowed for integrating scRNA-seq datasets with multiple batches in one step. Furthermore, deepMNN ran much faster than the other methods for large-scale datasets. These characteristics of deepMNN made it have the potential to be a new choice for large-scale single-cell gene expression data analysis.


2021 ◽  
Vol 12 ◽  
Author(s):  
Xu Xiao ◽  
Ying Qiao ◽  
Yudi Jiao ◽  
Na Fu ◽  
Wenxian Yang ◽  
...  

Highly multiplexed imaging technology is a powerful tool to facilitate understanding the composition and interactions of cells in tumor microenvironments at subcellular resolution, which is crucial for both basic research and clinical applications. Imaging mass cytometry (IMC), a multiplex imaging method recently introduced, can measure up to 100 markers simultaneously in one tissue section by using a high-resolution laser with a mass cytometer. However, due to its high resolution and large number of channels, how to process and interpret the image data from IMC remains a key challenge to its further applications. Accurate and reliable single cell segmentation is the first and a critical step to process IMC image data. Unfortunately, existing segmentation pipelines either produce inaccurate cell segmentation results or require manual annotation, which is very time consuming. Here, we developed Dice-XMBD1, a Deep learnIng-based Cell sEgmentation algorithm for tissue multiplexed imaging data. In comparison with other state-of-the-art cell segmentation methods currently used for IMC images, Dice-XMBD generates more accurate single cell masks efficiently on IMC images produced with different nuclear, membrane, and cytoplasm markers. All codes and datasets are available at https://github.com/xmuyulab/Dice-XMBD.


2020 ◽  
Author(s):  
Yi-An Tung ◽  
Wen-Tse Yang ◽  
Tsung-Ting Hsieh ◽  
Yu-Chuan Chang ◽  
June-Tai Wu ◽  
...  

AbstractEnhancers are one class of the regulatory elements that have been shown to act as key components to assist promoters in modulating the gene expression in living cells. At present, the number of enhancers as well as their activities in different cell types are still largely unclear. Previous studies have shown that enhancer activities are associated with various functional data, such as histone modifications, sequence motifs, and chromatin accessibilities. In this study, we utilized DNase data to build a deep learning model for predicting the H3K27ac peaks as the active enhancers in a target cell type. We propose joint training of multiple cell types to boost the model performance in predicting the enhancer activities of an unstudied cell type. The results demonstrated that by incorporating more datasets across different cell types, the complex regulatory patterns could be captured by deep learning models and the prediction accuracy can be largely improved. The analyses conducted in this study demonstrated that the cell type-specific enhancer activity can be predicted by joint learning of multiple cell type data using only DNase data and the primitive sequences as the input features. This reveals the importance of cross-cell type learning, and the constructed model can be applied to investigate potential active enhancers of a novel cell type which does not have the H3K27ac modification data yet.AvailabilityThe accuEnhancer package can be freely accessed at: https://github.com/callsobing/accuEnhancer


2020 ◽  
Vol 11 (1) ◽  
Author(s):  
Rui Hou ◽  
Elena Denisenko ◽  
Huan Ting Ong ◽  
Jordan A. Ramilowski ◽  
Alistair R. R. Forrest

Abstract Development of high throughput single-cell sequencing technologies has made it cost-effective to profile thousands of cells from diverse samples containing multiple cell types. To study how these different cell types work together, here we develop NATMI (Network Analysis Toolkit for Multicellular Interactions). NATMI uses connectomeDB2020 (a database of 2293 manually curated ligand-receptor pairs with literature support) to predict and visualise cell-to-cell communication networks from single-cell (or bulk) expression data. Using multiple published single-cell datasets we demonstrate how NATMI can be used to identify (i) the cell-type pairs that are communicating the most (or most specifically) within a network, (ii) the most active (or specific) ligand-receptor pairs active within a network, (iii) putative highly-communicating cellular communities and (iv) differences in intercellular communication when profiling given cell types under different conditions. Furthermore, analysis of the Tabula Muris (organism-wide) atlas confirms our previous prediction that autocrine signalling is a major feature of cell-to-cell communication networks, while also revealing that hundreds of ligands and their cognate receptors are co-expressed in individual cells suggesting a substantial potential for self-signalling.


2021 ◽  
Vol 12 ◽  
Author(s):  
John W. Hickey ◽  
Yuqi Tan ◽  
Garry P. Nolan ◽  
Yury Goltsev

Multiplexed imaging is a recently developed and powerful single-cell biology research tool. However, it presents new sources of technical noise that are distinct from other types of single-cell data, necessitating new practices for single-cell multiplexed imaging processing and analysis, particularly regarding cell-type identification. Here we created single-cell multiplexed imaging datasets by performing CODEX on four sections of the human colon (ascending, transverse, descending, and sigmoid) using a panel of 47 oligonucleotide-barcoded antibodies. After cell segmentation, we implemented five different normalization techniques crossed with four unsupervised clustering algorithms, resulting in 20 unique cell-type annotations for the same dataset. We generated two standard annotations: hand-gated cell types and cell types produced by over-clustering with spatial verification. We then compared these annotations at four levels of cell-type granularity. First, increasing cell-type granularity led to decreased labeling accuracy; therefore, subtle phenotype annotations should be avoided at the clustering step. Second, accuracy in cell-type identification varied more with normalization choice than with clustering algorithm. Third, unsupervised clustering better accounted for segmentation noise during cell-type annotation than hand-gating. Fourth, Z-score normalization was generally effective in mitigating the effects of noise from single-cell multiplexed imaging. Variation in cell-type identification will lead to significant differential spatial results such as cellular neighborhood analysis; consequently, we also make recommendations for accurately assigning cell-type labels to CODEX multiplexed imaging.


Sign in / Sign up

Export Citation Format

Share Document