COMUNET: a tool to explore and visualize intercellular communication

Maria Solovey; Antonio Scialdone

doi:10.1093/bioinformatics/btaa482

COMUNET: a tool to explore and visualize intercellular communication

Bioinformatics ◽

10.1093/bioinformatics/btaa482 ◽

2020 ◽

Vol 36 (15) ◽

pp. 4296-4300

Author(s):

Maria Solovey ◽

Antonio Scialdone

Keyword(s):

Single Cell ◽

Intercellular Communication ◽

Cell Communication ◽

Cell Types ◽

R Package ◽

Communication Patterns ◽

Supplementary Information ◽

Multiplex Networks ◽

Patterns Of Communication ◽

Multicellular Organisms

Abstract Motivation Intercellular communication plays an essential role in multicellular organisms and several algorithms to analyze it from single-cell transcriptional data have been recently published, but the results are often hard to visualize and interpret. Results We developed Cell cOmmunication exploration with MUltiplex NETworks (COMUNET), a tool that streamlines the interpretation of the results from cell–cell communication analyses. COMUNET uses multiplex networks to represent and cluster all potential communication patterns between cell types. The algorithm also enables the search for specific patterns of communication and can perform comparative analysis between two biological conditions. To exemplify its use, here we apply COMUNET to investigate cell communication patterns in single-cell transcriptomic datasets from mouse embryos and from an acute myeloid leukemia patient at diagnosis and after treatment. Availability and implementation Our algorithm is implemented in an R package available from https://github.com/ScialdoneLab/COMUNET, along with all the code to perform the analyses reported here. Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text

COMUNET: a tool to explore and visualize intercellular communication

10.1101/864686 ◽

2019 ◽

Author(s):

Maria Solovey ◽

Antonio Scialdone

Keyword(s):

Single Cell ◽

Intercellular Communication ◽

Myeloid Leukemia ◽

Cell Communication ◽

Cell Types ◽

Mouse Embryos ◽

Multiplex Networks ◽

Leukemia Patient ◽

Patterns Of Communication ◽

Multicellular Organisms

AbstractIntercellular communication plays an essential role in multicellular organisms and several algorithms to analyse it from single-cell transcriptional data have been recently published, but the results are often hard to visualize and interpret. We developed COMUNET (Cell cOMmunication exploration with MUltiplex NETworks), a tool that streamlines the interpretation of the results from cell-cell communication analyses. COMUNET uses multiplex networks to represent and cluster all potential communication pathways between cell types. The algorithm also enables the search for specific patterns of communication and can perform comparative analysis between two biological conditions.To exemplify its use, here we apply COMUNET to investigate cell communication patterns in single-cell transcriptomic datasets from mouse embryos and from an acute myeloid leukemia patient at diagnosis and after treatment. Our algorithm is available on GitHub, along with all the code to perform the analysis reported.

Download Full-text

Comprehensive evaluation of transcriptome-based cell-type quantification methods for immuno-oncology

Bioinformatics ◽

10.1093/bioinformatics/btz363 ◽

2019 ◽

Vol 35 (14) ◽

pp. i436-i445 ◽

Cited By ~ 71

Author(s):

Gregor Sturm ◽

Francesca Finotello ◽

Florent Petitprez ◽

Jitao David Zhang ◽

Jan Baumbach ◽

...

Keyword(s):

Single Cell ◽

Computational Methods ◽

Immune Cell ◽

Comprehensive Evaluation ◽

Cell Types ◽

R Package ◽

Supplementary Information ◽

Rna Seq ◽

Cell Type ◽

Real World Datasets

Abstract Motivation The composition and density of immune cells in the tumor microenvironment (TME) profoundly influence tumor progression and success of anti-cancer therapies. Flow cytometry, immunohistochemistry staining or single-cell sequencing are often unavailable such that we rely on computational methods to estimate the immune-cell composition from bulk RNA-sequencing (RNA-seq) data. Various methods have been proposed recently, yet their capabilities and limitations have not been evaluated systematically. A general guideline leading the research community through cell type deconvolution is missing. Results We developed a systematic approach for benchmarking such computational methods and assessed the accuracy of tools at estimating nine different immune- and stromal cells from bulk RNA-seq samples. We used a single-cell RNA-seq dataset of ∼11 000 cells from the TME to simulate bulk samples of known cell type proportions, and validated the results using independent, publicly available gold-standard estimates. This allowed us to analyze and condense the results of more than a hundred thousand predictions to provide an exhaustive evaluation across seven computational methods over nine cell types and ∼1800 samples from five simulated and real-world datasets. We demonstrate that computational deconvolution performs at high accuracy for well-defined cell-type signatures and propose how fuzzy cell-type signatures can be improved. We suggest that future efforts should be dedicated to refining cell population definitions and finding reliable signatures. Availability and implementation A snakemake pipeline to reproduce the benchmark is available at https://github.com/grst/immune_deconvolution_benchmark. An R package allows the community to perform integrated deconvolution using different methods (https://grst.github.io/immunedeconv). Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text

DIFFpop: a stochastic computational approach to simulate differentiation hierarchies with single cell barcoding

Bioinformatics ◽

10.1093/bioinformatics/btz074 ◽

2019 ◽

Vol 35 (19) ◽

pp. 3849-3851 ◽

Cited By ~ 1

Author(s):

Jeremy Ferlic ◽

Jiantao Shi ◽

Thomas O McDonald ◽

Franziska Michor

Keyword(s):

Single Cell ◽

Clonal Evolution ◽

Stochastic Simulation Algorithm ◽

Cell Types ◽

R Package ◽

Process Models ◽

Supplementary Information ◽

Driver Mutations ◽

Population Sizes ◽

Fixed Population

Abstract Summary DIFFpop is an R package designed to simulate cellular differentiation hierarchies using either exponentially-expanding or fixed population sizes. The software includes functionalities to simulate clonal evolution due to the emergence of driver mutations under the infinite-allele assumption as well as options for simulation and analysis of single cell barcoding and labeling data. The software uses the Gillespie Stochastic Simulation Algorithm and a modification of expanding or fixed-size stochastic process models expanded to a large number of cell types and scenarios. Availability and implementation DIFFpop is available as an R-package along with vignettes on Github (https://github.com/ferlicjl/diffpop). Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text

ExperimentSubset: an R package to manage subsets of Bioconductor Experiment objects

Bioinformatics ◽

10.1093/bioinformatics/btab179 ◽

2021 ◽

Author(s):

Irzam Sarfraz ◽

Muhammad Asif ◽

Joshua D Campbell

Keyword(s):

Single Cell ◽

R Package ◽

Poor Quality ◽

Data Matrix ◽

Supplementary Information ◽

Data Provenance ◽

Rna Seq ◽

Efficient Management ◽

The Matrix ◽

The Relationship

Abstract Motivation R Experiment objects such as the SummarizedExperiment or SingleCellExperiment are data containers for storing one or more matrix-like assays along with associated row and column data. These objects have been used to facilitate the storage and analysis of high-throughput genomic data generated from technologies such as single-cell RNA sequencing. One common computational task in many genomics analysis workflows is to perform subsetting of the data matrix before applying down-stream analytical methods. For example, one may need to subset the columns of the assay matrix to exclude poor-quality samples or subset the rows of the matrix to select the most variable features. Traditionally, a second object is created that contains the desired subset of assay from the original object. However, this approach is inefficient as it requires the creation of an additional object containing a copy of the original assay and leads to challenges with data provenance. Results To overcome these challenges, we developed an R package called ExperimentSubset, which is a data container that implements classes for efficient storage and streamlined retrieval of assays that have been subsetted by rows and/or columns. These classes are able to inherently provide data provenance by maintaining the relationship between the subsetted and parent assays. We demonstrate the utility of this package on a single-cell RNA-seq dataset by storing and retrieving subsets at different stages of the analysis while maintaining a lower memory footprint. Overall, the ExperimentSubset is a flexible container for the efficient management of subsets. Availability and implementation ExperimentSubset package is available at Bioconductor: https://bioconductor.org/packages/ExperimentSubset/ and Github: https://github.com/campbio/ExperimentSubset. Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text

Single-cell analysis reveals cell communication triggered by macrophages associated with the reduction and exhaustion of CD8+ T cells in COVID-19

Cell Communication and Signaling ◽

10.1186/s12964-021-00754-7 ◽

2021 ◽

Vol 19 (1) ◽

Author(s):

Lei He ◽

Quan Zhang ◽

Yue Zhang ◽

Yixian Fan ◽

Fahu Yuan ◽

...

Keyword(s):

T Cells ◽

T Cell ◽

Single Cell ◽

Cell Communication ◽

Cell Types ◽

Trial Registration ◽

T Cell Subsets ◽

Disease Group ◽

Communication Analysis ◽

Receptor Interactions

Abstract Background The coronavirus disease 2019 (COVID-19) outbreak caused by severe acute respiratory syndrome coronavirus 2 (SARS-Cov-2) has become an ongoing pandemic. Understanding the respiratory immune microenvironment which is composed of multiple cell types, together with cell communication based on ligand–receptor interactions is important for developing vaccines, probing COVID-19 pathogenesis, and improving pandemic control measures. Methods A total of 102 consecutive hospitalized patients with confirmed COVID-19 were enrolled in this study. Clinical information, routine laboratory tests, and flow cytometry analysis data with different conditions were collected and assessed for predictive value in COVID-19 patients. Next, we analyzed public single-cell RNA-sequencing (scRNA-seq) data from bronchoalveolar lavage fluid, which offers the closest available view of immune cell heterogeneity as encountered in patients with varying severity of COVID-19. A weighting algorithm was used to calculate ligand–receptor interactions, revealing the communication potentially associated with outcomes across cell types. Finally, serum cytokines including IL6, IL1β, IL10, CXCL10, TNFα, GALECTIN-1, and IGF1 derived from patients were measured. Results Of the 102 COVID-19 patients, 42 cases (41.2%) were categorized as severe. Multivariate logistic regression analysis demonstrated that AST, D-dimer, BUN, and WBC were considered as independent risk factors for the severity of COVID-19. T cell numbers including total T cells, CD4+ and CD8+ T cells in the severe disease group were significantly lower than those in the moderate disease group. The risk model containing the above mentioned inflammatory damage parameters, and the counts of T cells, with AUROCs ranged from 0.78 to 0.87. To investigate the molecular mechanism at the cellular level, we analyzed the published scRNA-seq data and found that macrophages displayed specific functional diversity after SARS-Cov-2 infection, and the metabolic pathway activities in the identified macrophage subtypes were influenced by hypoxia status. Importantly, we described ligand–receptor interactions that are related to COVID-19 serverity involving macrophages and T cell subsets by communication analysis. Conclusions Our study showed that macrophages driving ligand–receptor crosstalk contributed to the reduction and exhaustion of CD8+ T cells. The identified crucial cytokine panel, including IL6, IL1β, IL10, CXCL10, IGF1, and GALECTIN-1, may offer the selective targets to improve the efficacy of COVID-19 therapy. Trial registration: This is a retrospective observational study without a trial registration number.

Download Full-text

Identification of cell-type-specific marker genes from co-expression patterns in tissue samples

Bioinformatics ◽

10.1093/bioinformatics/btab257 ◽

2021 ◽

Author(s):

Yixuan Qiu ◽

Jiebiao Wang ◽

Jing Lei ◽

Kathryn Roeder

Keyword(s):

Single Cell ◽

Expression Patterns ◽

R Package ◽

Supplementary Information ◽

Marker Genes ◽

Specific Marker ◽

Cell Type ◽

Correlation Pattern ◽

Tissue Samples ◽

Bulk Data

Abstract Motivation Marker genes, defined as genes that are expressed primarily in a single cell type, can be identified from the single cell transcriptome; however, such data are not always available for the many uses of marker genes, such as deconvolution of bulk tissue. Marker genes for a cell type, however, are highly correlated in bulk data, because their expression levels depend primarily on the proportion of that cell type in the samples. Therefore, when many tissue samples are analyzed, it is possible to identify these marker genes from the correlation pattern. Results To capitalize on this pattern, we develop a new algorithm to detect marker genes by combining published information about likely marker genes with bulk transcriptome data in the form of a semi-supervised algorithm. The algorithm then exploits the correlation structure of the bulk data to refine the published marker genes by adding or removing genes from the list. Availability and implementation We implement this method as an R package markerpen, hosted on CRAN (https://CRAN.R-project.org/package=markerpen). Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text

New avenues for systematically inferring cell-cell communication: through single-cell transcriptomics data

Protein & Cell ◽

10.1007/s13238-020-00727-5 ◽

2020 ◽

Vol 11 (12) ◽

pp. 866-880 ◽

Cited By ~ 5

Author(s):

Xin Shao ◽

Xiaoyan Lu ◽

Jie Liao ◽

Huajun Chen ◽

Xiaohui Fan

Keyword(s):

Single Cell ◽

Communication Networks ◽

Cell Communication ◽

Receptor Interaction ◽

Communication Studies ◽

Transcriptomics Data ◽

Phenotype Heterogeneity ◽

Multicellular Organisms ◽

Communication Study ◽

Cell Cell

AbstractFor multicellular organisms, cell-cell communication is essential to numerous biological processes. Drawing upon the latest development of single-cell RNA-sequencing (scRNA-seq), high-resolution transcriptomic data have deepened our understanding of cellular phenotype heterogeneity and composition of complex tissues, which enables systematic cell-cell communication studies at a single-cell level. We first summarize a common workflow of cell-cell communication study using scRNA-seq data, which often includes data preparation, construction of communication networks, and result validation. Two common strategies taken to uncover cell-cell communications are reviewed, e.g., physically vicinal structure-based and ligand-receptor interaction-based one. To conclude, challenges and current applications of cell-cell communication studies at a single-cell resolution are discussed in details and future perspectives are proposed.

Download Full-text

DEsingle for detecting three types of differential expression in single-cell RNA-seq data

10.1101/173997 ◽

2017 ◽

Cited By ~ 1

Author(s):

Zhun Miao ◽

Ke Deng ◽

Xiaowo Wang ◽

Xuegong Zhang

Keyword(s):

Single Cell ◽

Differential Expression ◽

Negative Binomial ◽

Single Cells ◽

R Package ◽

Supplementary Information ◽

Binomial Model ◽

Supplementary Data ◽

Rna Seq ◽

Real Zeros

AbstractSummaryThe excessive amount of zeros in single-cell RNA-seq data include “real” zeros due to the on-off nature of gene transcription in single cells and “dropout” zeros due to technical reasons. Existing differential expression (DE) analysis methods cannot distinguish these two types of zeros. We developed an R package DEsingle which employed Zero-Inflated Negative Binomial model to estimate the proportion of real and dropout zeros and to define and detect 3 types of DE genes in single-cell RNA-seq data with higher accuracy.Availability and ImplementationThe R package DEsingle is freely available at https://github.com/miaozhun/DEsingle and is under Bioconductor’s consideration [email protected] informationSupplementary data are available at bioRxiv online.

Download Full-text

Discovering a sparse set of pairwise discriminating features in high-dimensional data

Bioinformatics ◽

10.1093/bioinformatics/btaa690 ◽

2020 ◽

Author(s):

Samuel Melton ◽

Sharad Ramanathan

Keyword(s):

Single Cell ◽

Dimensional Space ◽

Cell Types ◽

Dimensional Subspace ◽

Supplementary Information ◽

High Dimensional ◽

Technological Advances ◽

Data Points ◽

Low Dimensional ◽

Sparse Set

Abstract Motivation Recent technological advances produce a wealth of high-dimensional descriptions of biological processes, yet extracting meaningful insight and mechanistic understanding from these data remains challenging. For example, in developmental biology, the dynamics of differentiation can now be mapped quantitatively using single-cell RNA sequencing, yet it is difficult to infer molecular regulators of developmental transitions. Here, we show that discovering informative features in the data is crucial for statistical analysis as well as making experimental predictions. Results We identify features based on their ability to discriminate between clusters of the data points. We define a class of problems in which linear separability of clusters is hidden in a low-dimensional space. We propose an unsupervised method to identify the subset of features that define a low-dimensional subspace in which clustering can be conducted. This is achieved by averaging over discriminators trained on an ensemble of proposed cluster configurations. We then apply our method to single-cell RNA-seq data from mouse gastrulation, and identify 27 key transcription factors (out of 409 total), 18 of which are known to define cell states through their expression levels. In this inferred subspace, we find clear signatures of known cell types that eluded classification prior to discovery of the correct low-dimensional subspace. Availability and implementation https://github.com/smelton/SMD. Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text

DECENT: differential expression with capture efficiency adjustmeNT for single-cell RNA-seq data

Bioinformatics ◽

10.1093/bioinformatics/btz453 ◽

2019 ◽

Vol 35 (24) ◽

pp. 5155-5162 ◽

Cited By ~ 10

Author(s):

Chengzhong Ye ◽

Terence P Speed ◽

Agus Salim

Keyword(s):

Single Cell ◽

Differential Expression ◽

Type I Error ◽

R Package ◽

Supplementary Information ◽

Type I ◽

Common Phenomenon ◽

Rna Seq ◽

Capture Process ◽

Technological Platforms

Abstract Motivation Dropout is a common phenomenon in single-cell RNA-seq (scRNA-seq) data, and when left unaddressed it affects the validity of the statistical analyses. Despite this, few current methods for differential expression (DE) analysis of scRNA-seq data explicitly model the process that gives rise to the dropout events. We develop DECENT, a method for DE analysis of scRNA-seq data that explicitly and accurately models the molecule capture process in scRNA-seq experiments. Results We show that DECENT demonstrates improved DE performance over existing DE methods that do not explicitly model dropout. This improvement is consistently observed across several public scRNA-seq datasets generated using different technological platforms. The gain in improvement is especially large when the capture process is overdispersed. DECENT maintains type I error well while achieving better sensitivity. Its performance without spike-ins is almost as good as when spike-ins are used to calibrate the capture model. Availability and implementation The method is implemented as a publicly available R package available from https://github.com/cz-ye/DECENT. Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text