scholarly journals Power Analysis of Single Cell RNA-Sequencing Experiments

2016 ◽  
Author(s):  
Valentine Svensson ◽  
Kedar Nath Natarajan ◽  
Lam-Ha Ly ◽  
Ricardo J Miragaia ◽  
Charlotte Labalette ◽  
...  

AbstractHigh-throughput single cell RNA sequencing (scRNA-seq) has become an established and powerful method to investigate transcriptomic cell-to-cell variation, and has revealed new cell types, and new insights into developmental process and stochasticity in gene expression. There are now several published scRNA-seq protocols, which all sequence transcriptomes from a minute amount of starting material. Therefore, a key question is how these methods compare in terms of sensitivity of detection of mRNA molecules, and accuracy of quantification of gene expression. Here, we assessed the sensitivity and accuracy of many published data sets based on standardized spike-ins with a uniform raw data processing pipeline. We developed a flexible and fast UMI counting tool (https://github.com/vals/umis) which is compatible with all UMI based protocols. This allowed us to relate these parameters to sequencing depth, and discuss the trade offs between the different methods. To confirm our results, we performed experiments on cells from the same population using three different protocols. We also investigated the effect of RNA degradation on spike-in molecules, and the average efficiency of scRNA-seq on spike-in molecules versus endogenous RNAs.

Author(s):  
Di He ◽  
Di Wang ◽  
Ping Lu ◽  
Nan Yang ◽  
Zhigang Xue ◽  
...  

Abstract Lung adenocarcinoma (LUAD) harboring EGFR mutations prevails in Asian population. However, the inter-patient and intra-tumor heterogeneity has not been addressed at single-cell resolution. Here we performed single-cell RNA sequencing (scRNA-seq) of total 125,674 cells from seven stage-I/II LUAD samples harboring EGFR mutations and five tumor-adjacent lung tissues. We identified diverse cell types within the tumor microenvironment (TME) in which myeloid cells and T cells were the most abundant stromal cell types in tumors and adjacent lung tissues. Within tumors, accompanied by an increase in CD1C+ dendritic cells, the tumor-associated macrophages (TAMs) showed pro-tumoral functions without signature gene expression of defined M1 or M2 polarization. Tumor-infiltrating T cells mainly displayed exhausted and regulatory T-cell features. The adenocarcinoma cells can be categorized into different subtypes based on their gene expression signatures in distinct pathways such as hypoxia, glycolysis, cell metabolism, translation initiation, cell cycle, and antigen presentation. By performing pseudotime trajectory, we found that ELF3 was among the most upregulated genes in more advanced tumor cells. In response to secretion of inflammatory cytokines (e.g., IL1B) from immune infiltrates, ELF3 in tumor cells was upregulated to trigger the activation of PI3K/Akt/NF-κB pathway and elevated expression of proliferation and anti-apoptosis genes such as BCL2L1 and CCND1. Taken together, our study revealed substantial heterogeneity within early-stage LUAD harboring EGFR mutations, implicating complex interactions among tumor cells, stromal cells and immune infiltrates in the TME.


2019 ◽  
Author(s):  
Allison Jevitt ◽  
Deeptiman Chatterjee ◽  
Gengqiang Xie ◽  
Xian-Feng Wang ◽  
Taylor Otwell ◽  
...  

AbstractOogenesis is a complex developmental process that involves spatiotemporally regulated coordination between the germline and supporting, somatic cell populations. This process has been modelled extensively using theDrosophilaovary. While different ovarian cell types have been identified through traditional means, the large-scale expression profiles underlying each cell type remain unknown. Using single-cell RNA sequencing technology, we have built a transcriptomic dataset for the adultDrosophilaovary and connected tissues. This dataset captures the entire transcriptional trajectory of the developing follicle cell population over time. Our findings provide detailed insight into processes such as cell-cycle switching, migration, symmetry breaking, nurse cell engulfment, egg-shell formation, and signaling during corpus luteum formation, marking a newly identified oogenesis-to-ovulation transition. Altogether, these findings provide a broad perspective on oogenesis at a single-cell resolution while revealing new genetic markers and fate-specific transcriptional signatures to facilitate future studies.


2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Xiuying Li ◽  
Guillaume Noell ◽  
Tracy Tabib ◽  
Alyssa D. Gregory ◽  
Humberto E. Trejo Bittar ◽  
...  

Abstract Background Whole lung tissue transcriptomic profiling studies in chronic obstructive pulmonary disease (COPD) have led to the identification of several genes associated with the severity of airflow limitation and/or the presence of emphysema, however, the cell types driving these gene expression signatures remain unidentified. Methods To determine cell specific transcriptomic changes in severe COPD, we conducted single-cell RNA sequencing (scRNA seq) on n = 29,961 cells from the peripheral lung parenchymal tissue of nonsmoking subjects without underlying lung disease (n = 3) and patients with severe COPD (n = 3). The cell type composition and cell specific gene expression signature was assessed. Gene set enrichment analysis (GSEA) was used to identify the specific cell types contributing to the previously reported transcriptomic signatures. Results T-distributed stochastic neighbor embedding and clustering of scRNA seq data revealed a total of 17 distinct populations. Among them, the populations with more differentially expressed genes in cases vs. controls (log fold change >|0.4| and FDR = 0.05) were: monocytes (n = 1499); macrophages (n = 868) and ciliated epithelial cells (n = 590), respectively. Using GSEA, we found that only ciliated and cytotoxic T cells manifested a trend towards enrichment of the previously reported 127 regional emphysema gene signatures (normalized enrichment score [NES] = 1.28 and = 1.33, FDR = 0.085 and = 0.092 respectively). Among the significantly altered genes present in ciliated epithelial cells of the COPD lungs, QKI and IGFBP5 protein levels were also found to be altered in the COPD lungs. Conclusions scRNA seq is useful for identifying transcriptional changes and possibly individual protein levels that may contribute to the development of emphysema in a cell-type specific manner.


2021 ◽  
Vol 11 ◽  
Author(s):  
Meijia Gu ◽  
Ti He ◽  
Yuncong Yuan ◽  
Suling Duan ◽  
Xin Li ◽  
...  

BackgroundCervical cancer is one of the most common gynecological cancers worldwide. The tumor microenvironment significantly influences the therapeutic response and clinical outcome. However, the complex tumor microenvironment of cervical cancer and the molecular mechanisms underlying chemotherapy resistance are not well studied. This study aimed to comprehensively analyze cells from pretreated and chemoresistant cervical cancer tissues to generate a molecular census of cell populations.MethodsBiopsy tissues collected from patients with cervical squamous cell carcinoma, cervical adenocarcinoma, and chronic cervicitis were subjected to single-cell RNA sequencing using the 10× Genomics platform. Unsupervised clustering analysis of cells was performed to identify the main cell types, and important cell clusters were reclustered into subpopulations. Gene expression profiles and functional enrichment analysis were used to explore gene expression and functional differences between cell subpopulations in cervicitis and cervical cancer samples and between chemoresistant and chemosensitive samples.ResultsA total of 24,371 cells were clustered into nine separate cell types, including immune and non-immune cells. Differentially expressed genes between chemoresistant and chemosensitive patients enriched in the phosphoinositide 3-kinase (PI3K)/AKT pathway were involved in tumor development, progression, and apoptosis, which might lead to chemotherapy resistance.ConclusionsOur study provides a comprehensive overview of the cancer microenvironment landscape and characterizes its gene expression and functional difference in chemotherapy resistance. Consequently, our study deepens the insights into cervical cancer biology through the identification of gene markers for diagnosis, prognosis, and therapy.


2021 ◽  
Author(s):  
Josephine Bageritz ◽  
Niklas Krausse ◽  
Schayan Yousefian ◽  
Svenja Leible ◽  
Erica Valentini ◽  
...  

Single cell RNA sequencing (scRNA-seq) has become an important method to identify cell types, delineate the trajectories of cell differentiation in whole organisms and understand the heterogeneity in cellular responses. Nevertheless, sample collection and processing remain a severe bottleneck for scRNA-seq experiments. Cell isolation protocols often lead to significant changes in the transcriptomes of cells, requiring novel methods to preserve cell states. Here, we developed and benchmarked protocols using glyoxal as a fixative for scRNA-seq application. Using Drop-seq methodology, we detected high numbers of transcripts and genes from glyoxal-fixed Drosophila cells after scRNA-seq. The effective glyoxal fixation of transcriptomes in Drosophila and human cells was further supported by a high correlation of gene expression data between glyoxal-fixed and unfixed samples. Accordingly, we also found highly expressed genes overlapping to a large extent between experimental conditions. These results indicated that our fixation protocol did not induce considerable changes in gene expression and conserved the transcriptome for subsequent single cell isolation procedures. In conclusion, we present glyoxal as a suitable fixative for Drosophila cells and potentially cells of other species that allows high-quality scRNA-seq applications.


2021 ◽  
Author(s):  
Mohammad Lotfollahi ◽  
leander Dony ◽  
Harshita Agarwala ◽  
Fabian J Theis

Learning robust representations can help uncover underlying biological variation in scRNA-seq data. Disentangled representation learning is one approach to obtain such informative as well interpretable representations. Here, we learn disentangled representations of scRNA-seq data using β-variational autoencoder (β-VAE) and apply the model for out-of-distribution (OOD) prediction. We demonstrate accurate gene expression predictions for cell types absent from training in a perturbation and a developmental dataset. We further show that β-VAE outperforms a state-of-the-art disentanglement method for scRNA-seq in OOD prediction while achieving better disentanglement performance.


2017 ◽  
Author(s):  
Wenfa Ng

Single cell studies increasing reveal myriad cellular subtypes beyond those postulated or observed through optical and fluorescence microscopy as well as DNA sequencing studies. While gene sequencing at the single cell level offer a path towards illuminating, in totality, the different subtypes of cells present, the technique nevertheless does not offer answers concerning the functional repertoire of the cell, which is defined by the collection of RNA transcribed from the genome. Known as the transcriptome, transcribed RNA defines the function of the cell as proteins or effector RNA molecules, while the genome is the collection of all information endowed in the cell type, expressed or not. Thus, a particular cell state, lineage, cell fate or cellular differentiation is more fully depicted by transcriptomic analysis compared to delineating the genomic context at the single cell level. While conceptually sound and could be analysed by contemporary single cell RNA sequencing technology and data analysis pipelines, the relative instability of RNA in view of RNase in the environment would make sample preparation particularly challenging, where degradation of cellular RNA by extraneous factors could provide a misinterpretation of specific functions available to a cell type. Hence, RNA as the de facto functional molecule of the cell defining the proteomics landscape as well as effector RNA repertoire, meant that RNA transcriptomics at the single cell level is the way forward if the goal is to understand all available cell types, lineage, cell fate and cellular differentiation. Given that a cell state is defined by the functions encoded by functional molecules such as proteins and RNA, single cell RNA sequencing offers a larger contextual basis for understanding cellular decision making and functions, for example, proteins are increasingly known to work in concert with RNA effector molecules in enabling a function. Hence, providing a view of the diverse cell types and lineages present in a body, single cell RNA sequencing is only hampered by the high sensitivity required to analyse the small amount of RNA available in single cells, as well as the perennial problem of RNA studies: how to prevent or reduce RNA degradation by environmental RNase enzymes. Ability to reduce RNA degradation would provide the cell biologist a unique view of the functional landscape of different cells in the body through the language of RNA.


2017 ◽  
Author(s):  
Wenfa Ng

Single cell studies increasing reveal myriad cellular subtypes beyond those postulated or observed through optical and fluorescence microscopy as well as DNA sequencing studies. While gene sequencing at the single cell level offer a path towards illuminating, in totality, the different subtypes of cells present, the technique nevertheless does not offer answers concerning the functional repertoire of the cell, which is defined by the collection of RNA transcribed from the genome. Known as the transcriptome, transcribed RNA defines the function of the cell as proteins or effector RNA molecules, while the genome is the collection of all information endowed in the cell type, expressed or not. Thus, a particular cell state, lineage, cell fate or cellular differentiation is more fully depicted by transcriptomic analysis compared to delineating the genomic context at the single cell level. While conceptually sound and could be analysed by contemporary single cell RNA sequencing technology and data analysis pipelines, the relative instability of RNA in view of RNase in the environment would make sample preparation particularly challenging, where degradation of cellular RNA by extraneous factors could provide a misinterpretation of specific functions available to a cell type. Hence, RNA as the de facto functional molecule of the cell defining the proteomics landscape as well as effector RNA repertoire, meant that RNA transcriptomics at the single cell level is the way forward if the goal is to understand all available cell types, lineage, cell fate and cellular differentiation. Given that a cell state is defined by the functions encoded by functional molecules such as proteins and RNA, single cell RNA sequencing offers a larger contextual basis for understanding cellular decision making and functions, for example, proteins are increasingly known to work in concert with RNA effector molecules in enabling a function. Hence, providing a view of the diverse cell types and lineages present in a body, single cell RNA sequencing is only hampered by the high sensitivity required to analyse the small amount of RNA available in single cells, as well as the perennial problem of RNA studies: how to prevent or reduce RNA degradation by environmental RNase enzymes. Ability to reduce RNA degradation would provide the cell biologist a unique view of the functional landscape of different cells in the body through the language of RNA.


2019 ◽  
Author(s):  
Kyle J. Travaglini ◽  
Ahmad N. Nabhan ◽  
Lolita Penland ◽  
Rahul Sinha ◽  
Astrid Gillich ◽  
...  

AbstractAlthough single cell RNA sequencing studies have begun providing compendia of cell expression profiles, it has proven more difficult to systematically identify and localize all molecular cell types in individual organs to create a full molecular cell atlas. Here we describe droplet- and plate-based single cell RNA sequencing applied to ∼75,000 human lung and blood cells, combined with a multi-pronged cell annotation approach, which have allowed us to define the gene expression profiles and anatomical locations of 58 cell populations in the human lung, including 41 of 45 previously known cell types or subtypes and 14 new ones. This comprehensive molecular atlas elucidates the biochemical functions of lung cell types and the cell-selective transcription factors and optimal markers for making and monitoring them; defines the cell targets of circulating hormones and predicts local signaling interactions including sources and targets of chemokines in immune cell trafficking and expression changes on lung homing; and identifies the cell types directly affected by lung disease genes and respiratory viruses. Comparison to mouse identified 17 molecular types that appear to have been gained or lost during lung evolution and others whose expression profiles have been substantially altered, revealing extensive plasticity of cell types and cell-type-specific gene expression during organ evolution including expression switches between cell types. This atlas provides the molecular foundation for investigating how lung cell identities, functions, and interactions are achieved in development and tissue engineering and altered in disease and evolution.


2020 ◽  
Author(s):  
Xiuying Li ◽  
Guillaume Noell ◽  
Tracy Tabib ◽  
Alyssa D Gregory ◽  
Humberto E Trejo Bittar ◽  
...  

Abstract Background: Whole lung tissue transcriptomic profiling studies in chronic obstructive pulmonary disease (COPD) have led to the identification of several genes associated with the severity of airflow limitation and/or the presence of emphysema, however, the cell types driving these gene expression signatures remain unidentified.Methods: To determine cell specific transcriptomic changes in severe COPD, we conducted single-cell RNA sequencing (scRNA seq) on n= 29,961 cells from the peripheral lung parenchymal tissue of nonsmoking subjects without underlying lung disease (n=3) and patients with severe COPD (n=3). The cell type composition and cell specific gene expression signature was assessed. Gene set enrichment analysis (GSEA) was used to identify the specific cell types contributing to the previously reported transcriptomic signatures.Results: T-distributed stochastic neighbor embedding and clustering of scRNA seq data revealed a total of 17 distinct populations. Among them, the populations with more differentially expressed genes in cases vs. controls (log fold change >|0.4| and FDR=0.05) were: monocytes (n=1499); macrophages (n=868) and ciliated epithelial cells (n= 590), respectively. Using GSEA, we found that only ciliated and cytotoxic T cells manifested a trend towards enrichment of the previously reported 127 regional emphysema gene signatures (normalized enrichment score [NES] = 1.28 and =1.33, FDR= 0.085 and =0.092 respectively). Among the significantly altered genes present in ciliated epithelial cells of the COPD lungs, QKI and IGFBP5 protein levels were also found to be altered in the COPD lungs. Conclusions: scRNA seq is useful to identify transcriptional changes and possibly individual protein levels that may contribute to the development of emphysema in a cell-type specific manner.


Sign in / Sign up

Export Citation Format

Share Document