scholarly journals scRNA-seq mixology: towards better benchmarking of single cell RNA-seq analysis methods

2018 ◽  
Author(s):  
Luyi Tian ◽  
Xueyi Dong ◽  
Saskia Freytag ◽  
Kim-Anh Lê Cao ◽  
Shian Su ◽  
...  

AbstractSingle cell RNA sequencing (scRNA-seq) technology has undergone rapid development in recent years, bringing with new challenges in data processing and analysis. This has led to an explosion of tailored analysis methods for scRNA-seq data to address various biological questions. However, the current lack of gold-standard benchmark datasets makes it difficult for researchers to systematically evaluate the performance of the many methods available. Here, we designed and carried out a realistic benchmark experiment that included mixtures of single cells or ‘pseudo cells’ created by sampling admixtures of cells or RNA from up to 5 distinct cancer cell lines. Altogether we generated 14 datasets using droplet and plate-based scRNA-seq protocols, compared multiple data analysis methods in combination for tasks ranging from normalization and imputation, to clustering, trajectory analysis and data integration. Evaluation across 3,913 analyses (methods × benchmark dataset combinations) revealed pipelines suited to different types of data for different tasks. Our dataset and analysis present a comprehensive comparison framework for benchmarking most common scRNA-seq analysis tasks.

2021 ◽  
Vol 7 (8) ◽  
pp. eabe3610
Author(s):  
Conor J. Kearney ◽  
Stephin J. Vervoort ◽  
Kelly M. Ramsbottom ◽  
Izabela Todorovski ◽  
Emily J. Lelliott ◽  
...  

Multimodal single-cell RNA sequencing enables the precise mapping of transcriptional and phenotypic features of cellular differentiation states but does not allow for simultaneous integration of critical posttranslational modification data. Here, we describe SUrface-protein Glycan And RNA-seq (SUGAR-seq), a method that enables detection and analysis of N-linked glycosylation, extracellular epitopes, and the transcriptome at the single-cell level. Integrated SUGAR-seq and glycoproteome analysis identified tumor-infiltrating T cells with unique surface glycan properties that report their epigenetic and functional state.


2019 ◽  
Author(s):  
Ning Wang ◽  
Andrew E. Teschendorff

AbstractInferring the activity of transcription factors in single cells is a key task to improve our understanding of development and complex genetic diseases. This task is, however, challenging due to the relatively large dropout rate and noisy nature of single-cell RNA-Seq data. Here we present a novel statistical inference framework called SCIRA (Single Cell Inference of Regulatory Activity), which leverages the power of large-scale bulk RNA-Seq datasets to infer high-quality tissue-specific regulatory networks, from which regulatory activity estimates in single cells can be subsequently obtained. We show that SCIRA can correctly infer regulatory activity of transcription factors affected by high technical dropouts. In particular, SCIRA can improve sensitivity by as much as 70% compared to differential expression analysis and current state-of-the-art methods. Importantly, SCIRA can reveal novel regulators of cell-fate in tissue-development, even for cell-types that only make up 5% of the tissue, and can identify key novel tumor suppressor genes in cancer at single cell resolution. In summary, SCIRA will be an invaluable tool for single-cell studies aiming to accurately map activity patterns of key transcription factors during development, and how these are altered in disease.


2019 ◽  
Author(s):  
Anna Danese ◽  
Maria L. Richter ◽  
David S. Fischer ◽  
Fabian J. Theis ◽  
Maria Colomé-Tatché

ABSTRACTEpigenetic single-cell measurements reveal a layer of regulatory information not accessible to single-cell transcriptomics, however single-cell-omics analysis tools mainly focus on gene expression data. To address this issue, we present epiScanpy, a computational framework for the analysis of single-cell DNA methylation and single-cell ATAC-seq data. EpiScanpy makes the many existing RNA-seq workflows from scanpy available to large-scale single-cell data from other -omics modalities. We introduce and compare multiple feature space constructions for epigenetic data and show the feasibility of common clustering, dimension reduction and trajectory learning techniques. We benchmark epiScanpy by interrogating different single-cell brain mouse atlases of DNA methylation, ATAC-seq and transcriptomics. We find that differentially methylated and differentially open markers between cell clusters enrich transcriptome-based cell type labels by orthogonal epigenetic information.


Author(s):  
Jinfen Wei ◽  
Zixi Chen ◽  
Meiling Hu ◽  
Ziqing He ◽  
Dawei Jiang ◽  
...  

Hypoxia is a characteristic of tumor microenvironment (TME) and is a major contributor to tumor progression. Yet, subtype identification of tumor-associated non-malignant cells at single-cell resolution and how they influence cancer progression under hypoxia TME remain largely unexplored. Here, we used RNA-seq data of 424,194 single cells from 108 patients to identify the subtypes of cancer cells, stromal cells, and immune cells; to evaluate their hypoxia score; and also to uncover potential interaction signals between these cells in vivo across six cancer types. We identified SPP1+ tumor-associated macrophage (TAM) subpopulation potentially enhanced epithelial–mesenchymal transition (EMT) by interaction with cancer cells through paracrine pattern. We prioritized SPP1 as a TAM-secreted factor to act on cancer cells and found a significant enhanced migration phenotype and invasion ability in A549 lung cancer cells induced by recombinant protein SPP1. Besides, prognostic analysis indicated that a higher expression of SPP1 was found to be related to worse clinical outcome in six cancer types. SPP1 expression was higher in hypoxia-high macrophages based on single-cell data, which was further validated by an in vitro experiment that SPP1 was upregulated in macrophages under hypoxia-cultured compared with normoxic conditions. Additionally, a differential analysis demonstrated that hypoxia potentially influences extracellular matrix remodeling, glycolysis, and interleukin-10 signal activation in various cancer types. Our work illuminates the clearer underlying mechanism in the intricate interaction between different cell subtypes within hypoxia TME and proposes the guidelines for the development of therapeutic targets specifically for patients with high proportion of SPP1+ TAMs in hypoxic lesions.


2017 ◽  
Author(s):  
Zhun Miao ◽  
Ke Deng ◽  
Xiaowo Wang ◽  
Xuegong Zhang

AbstractSummaryThe excessive amount of zeros in single-cell RNA-seq data include “real” zeros due to the on-off nature of gene transcription in single cells and “dropout” zeros due to technical reasons. Existing differential expression (DE) analysis methods cannot distinguish these two types of zeros. We developed an R package DEsingle which employed Zero-Inflated Negative Binomial model to estimate the proportion of real and dropout zeros and to define and detect 3 types of DE genes in single-cell RNA-seq data with higher accuracy.Availability and ImplementationThe R package DEsingle is freely available at https://github.com/miaozhun/DEsingle and is under Bioconductor’s consideration [email protected] informationSupplementary data are available at bioRxiv online.


2019 ◽  
Vol 10 (1) ◽  
Author(s):  
Ayshwarya Subramanian ◽  
Eriene-Heidi Sidhom ◽  
Maheswarareddy Emani ◽  
Katherine Vernon ◽  
Nareh Sahakian ◽  
...  

AbstractHuman iPSC-derived kidney organoids have the potential to revolutionize discovery, but assessing their consistency and reproducibility across iPSC lines, and reducing the generation of off-target cells remain an open challenge. Here, we profile four human iPSC lines for a total of 450,118 single cells to show how organoid composition and development are comparable to human fetal and adult kidneys. Although cell classes are largely reproducible across time points, protocols, and replicates, we detect variability in cell proportions between different iPSC lines, largely due to off-target cells. To address this, we analyze organoids transplanted under the mouse kidney capsule and find diminished off-target cells. Our work shows how single cell RNA-seq (scRNA-seq) can score organoids for reproducibility, faithfulness and quality, that kidney organoids derived from different iPSC lines are comparable surrogates for human kidney, and that transplantation enhances their formation by diminishing off-target cells.


2020 ◽  
Author(s):  
Siamak Yousefi ◽  
Hao Chen ◽  
Jesse F. Ingels ◽  
Melinda S. McCarty ◽  
Arthur G. Centeno ◽  
...  

SUMMARYSingle cell RNA sequencing has enabled quantification of single cells and identification of different cell types and subtypes as well as cell functions in different tissues. Single cell RNA sequence analyses assume acquired RNAs correspond to cells, however, RNAs from contamination within the input data are also captured by these assays. The sequencing of background contamination as well as unwanted cells making their way to the final assay Potentially confound the correct biological interpretation of single cell transcriptomic data. Here we demonstrate two approaches to deal with background contamination as well as profiling of unwanted cells in the assays. We use three real-life datasets of whole-cell capture and nucleotide single-cell captures generated by Fluidigm and 10x technologies and show that these methods reduce the effect of contamination, strengthen clustering of cells and improves biological interpretation.


Blood ◽  
2018 ◽  
Vol 132 (Supplement 1) ◽  
pp. 3887-3887
Author(s):  
Moosa Qureshi ◽  
Fernando Calero-Nieto ◽  
Iwo Kucinski ◽  
Sarah Kinston ◽  
George Giotopoulos ◽  
...  

Abstract The C/EBPα transcription factor plays a pivotal role in myeloid differentiation and E2F-mediated cell cycle regulation. Although CEBPA mutations are common in acute myeloid leukaemia (AML), little is known regarding pre-leukemic alterations caused by mutated CEBPA. Here, we investigated early events involved in pre-leukemic transformation driven by CEBPA N321D in the LMPP-like cell line Hoxb8-FL (Redecke et al., Nat Methods 2013), which can be maintained in vitro as a self-renewing LMPP population using Flt3L and estradiol, as well as differentiated both in vitro and in vivo into myeloid and lymphoid cell types. Hoxb8-FL cells were retrovirally transduced with Empty Vector (EV), wild-type CEBPA (CEBPA WT) or its N321D mutant form (CEBPA N321D). CEBPA WT-transduced cells showed increased expression of cd11b and SIRPα and downregulation of c-kit, suggesting that wild-type CEBPA was sufficient to promote differentiation even under LMPP growth conditions. Interestingly, we did not observe the same phenotype in CEBPA N321D-transduced cells. Upon withdrawal of estradiol, both EV and CEBPA WT-transduced cells differentiated rapidly into a conventional dendritic cell (cDC) phenotype by day 7 and died within 12 days. By contrast, CEBPA N321D-transduced cells continued to grow for in excess of 56 days, with an initial cDC phenotype but by day 30 demonstrating a plasmacytoid dendritic cell precursor phenotype. CEBPA N321D-transduced cells were morphologically distinct from EV-transduced cells. To test leukemogenic potential in vivo, we performed transplantation experiments in lethally irradiated mice. Serial monitoring of peripheral blood demonstrated that Hoxb8-FL derived cells had disappeared by 4 weeks, and did not reappear. However, at 6 months CEBPA N321D-transduced cells could still be detected in bone marrow in contrast to EV-transduced cells but without any leukemic phenotype. To identify early events involved in pre-leukemic transformation, the differentiation profiles of EV, CEBPA WT and CEBPA N321D-transduced cells were examined with single cell RNA-seq (scRNA-seq). 576 single cells were taken from 3 biological replicates at days 0 and 5 post-differentiation, and analysed using the Automated Single-Cell Analysis Pipeline (Gardeux et al., Bioinformatics 2017). Visualisation by t-SNE (Fig 1) demonstrated: (i) CEBPA WT-transduced cells formed a distinct cluster at day 0 before withdrawal of estradiol; (ii) CEBPA N321D-transduced cells separated from EV and CEBPA WT-transduced cells after 5 days of differentiation, (iii) two subpopulations could be identified within the CEBPA N321D-transduced cells at day 5, with a cluster of five CEBPA N321D-transduced single cells distributed amongst or very close to the day 0 non-differentiated cells. Differential expression analysis identified 224 genes upregulated and 633 genes downregulated specifically in the CEBPA N321D-transduced cells when compared to EV cells after 5 days of differentiation. This gene expression signature revealed that CEBPA N321D-transduced cells switched on a HSC/MEP/CMP transcriptional program and switched off a myeloid dendritic cell program. Finally, in order to further dissect the effect of the N321D mutation, the binding profile of endogenous and CEBPA N321D was compared by ChIP-seq before and after 5 days of differentiation. Integration with scRNA-seq data identified 160 genes specifically downregulated in CEBPA N321D-transduced cells which were associated with the binding of the mutant protein. This list of genes included genes previously implicated in dendritic cell differentiation (such as NOTCH2, JAK2), as well as a number of genes not previously implicated in the evolution of AML, representing potentially novel therapeutic targets. Disclosures No relevant conflicts of interest to declare.


2021 ◽  
Author(s):  
Lin Di ◽  
Bo Liu ◽  
Yuzhu Lyu ◽  
Shihui Zhao ◽  
Yuhong Pang ◽  
...  

Many single cell RNA-seq applications aim to probe a wide dynamic range of gene expression, but most of them are still challenging to accurately quantify low-aboundance transcripts. Based on our previous finding that Tn5 transposase can directly cut-and-tag DNA/RNA hetero-duplexes, we present SHERRY2, an optimized protocol for sequencing transcriptomes of single cells or single nuclei. SHERRY2 is robust and scalable, and it has higher sensitivity and more uniform coverage in comparison with prevalent scRNA-seq methods. With throughput of a few thousand cells per batch, SHERRY2 can reveal the subtle transcriptomic differences between cells and facilitate important biological discoveries.


Sign in / Sign up

Export Citation Format

Share Document