scholarly journals A novel algorithm for the collective integration of single cell RNA-seq during embryogenesis

2019 ◽  
Author(s):  
Wuming Gong ◽  
Bhairab N. Singh ◽  
Pruthvi Shah ◽  
Satyabrata Das ◽  
Joshua Theisen ◽  
...  

AbstractSingle cell RNA-seq (scRNA-seq) over specified time periods has been widely used to dissect the cell populations during mammalian embryogenesis. Integrating such scRNA-seq data from different developmental stages and from different laboratories is critical to comprehensively define and understand the molecular dynamics and systematically reconstruct the lineage trajectories. Here, we describe a novel algorithm to integrate heterogenous temporal scRNA-seq datasets and to preserve the global developmental trajectories. We applied this algorithm and approach to integrate 3,387 single cells from seven heterogenous temporal scRNA-seq datasets, and reconstructed the cell atlas of early mouse cardiovascular development from E6.5 to E9.5. Using this integrated atlas, we identified an Etv2 downstream target, Ebf1, as an important transcription factor for mouse endothelial development.

2018 ◽  
Author(s):  
Yingxin Lin ◽  
Shila Ghazanfar ◽  
Kevin Wang ◽  
Johann A. Gagnon-Bartsch ◽  
Kitty K. Lo ◽  
...  

AbstractConcerted examination of multiple collections of single cell RNA-Seq (scRNA-Seq) data promises further biological insights that cannot be uncovered with individual datasets. However, such integrative analyses are challenging and require sophisticated methodologies. To enable effective interrogation of multiple scRNA-Seq datasets, we have developed a novel algorithm, named scMerge, that removes unwanted variation by combining stably expressed genes and utilizing pseudo-replicates across datasets. Analysis of large collections of publicly available datasets demonstrates that scMerge performs well in multiple scenarios and enhances biological discovery, including inferring cell developmental trajectories.


2021 ◽  
Vol 23 (1) ◽  
Author(s):  
Bhupinder Pal ◽  
Yunshun Chen ◽  
Michael J. G. Milevskiy ◽  
François Vaillant ◽  
Lexie Prokopuk ◽  
...  

Abstract Background Heterogeneity within the mouse mammary epithelium and potential lineage relationships have been recently explored by single-cell RNA profiling. To further understand how cellular diversity changes during mammary ontogeny, we profiled single cells from nine different developmental stages spanning late embryogenesis, early postnatal, prepuberty, adult, mid-pregnancy, late-pregnancy, and post-involution, as well as the transcriptomes of micro-dissected terminal end buds (TEBs) and subtending ducts during puberty. Methods The single cell transcriptomes of 132,599 mammary epithelial cells from 9 different developmental stages were determined on the 10x Genomics Chromium platform, and integrative analyses were performed to compare specific time points. Results The mammary rudiment at E18.5 closely aligned with the basal lineage, while prepubertal epithelial cells exhibited lineage segregation but to a less differentiated state than their adult counterparts. Comparison of micro-dissected TEBs versus ducts showed that luminal cells within TEBs harbored intermediate expression profiles. Ductal basal cells exhibited increased chromatin accessibility of luminal genes compared to their TEB counterparts suggesting that lineage-specific chromatin is established within the subtending ducts during puberty. An integrative analysis of five stages spanning the pregnancy cycle revealed distinct stage-specific profiles and the presence of cycling basal, mixed-lineage, and 'late' alveolar intermediates in pregnancy. Moreover, a number of intermediates were uncovered along the basal-luminal progenitor cell axis, suggesting a continuum of alveolar-restricted progenitor states. Conclusions This extended single cell transcriptome atlas of mouse mammary epithelial cells provides the most complete coverage for mammary epithelial cells during morphogenesis to date. Together with chromatin accessibility analysis of TEB structures, it represents a valuable framework for understanding developmental decisions within the mouse mammary gland.


2021 ◽  
Vol 7 (8) ◽  
pp. eabe3610
Author(s):  
Conor J. Kearney ◽  
Stephin J. Vervoort ◽  
Kelly M. Ramsbottom ◽  
Izabela Todorovski ◽  
Emily J. Lelliott ◽  
...  

Multimodal single-cell RNA sequencing enables the precise mapping of transcriptional and phenotypic features of cellular differentiation states but does not allow for simultaneous integration of critical posttranslational modification data. Here, we describe SUrface-protein Glycan And RNA-seq (SUGAR-seq), a method that enables detection and analysis of N-linked glycosylation, extracellular epitopes, and the transcriptome at the single-cell level. Integrated SUGAR-seq and glycoproteome analysis identified tumor-infiltrating T cells with unique surface glycan properties that report their epigenetic and functional state.


2019 ◽  
Author(s):  
Ning Wang ◽  
Andrew E. Teschendorff

AbstractInferring the activity of transcription factors in single cells is a key task to improve our understanding of development and complex genetic diseases. This task is, however, challenging due to the relatively large dropout rate and noisy nature of single-cell RNA-Seq data. Here we present a novel statistical inference framework called SCIRA (Single Cell Inference of Regulatory Activity), which leverages the power of large-scale bulk RNA-Seq datasets to infer high-quality tissue-specific regulatory networks, from which regulatory activity estimates in single cells can be subsequently obtained. We show that SCIRA can correctly infer regulatory activity of transcription factors affected by high technical dropouts. In particular, SCIRA can improve sensitivity by as much as 70% compared to differential expression analysis and current state-of-the-art methods. Importantly, SCIRA can reveal novel regulators of cell-fate in tissue-development, even for cell-types that only make up 5% of the tissue, and can identify key novel tumor suppressor genes in cancer at single cell resolution. In summary, SCIRA will be an invaluable tool for single-cell studies aiming to accurately map activity patterns of key transcription factors during development, and how these are altered in disease.


Author(s):  
Jinfen Wei ◽  
Zixi Chen ◽  
Meiling Hu ◽  
Ziqing He ◽  
Dawei Jiang ◽  
...  

Hypoxia is a characteristic of tumor microenvironment (TME) and is a major contributor to tumor progression. Yet, subtype identification of tumor-associated non-malignant cells at single-cell resolution and how they influence cancer progression under hypoxia TME remain largely unexplored. Here, we used RNA-seq data of 424,194 single cells from 108 patients to identify the subtypes of cancer cells, stromal cells, and immune cells; to evaluate their hypoxia score; and also to uncover potential interaction signals between these cells in vivo across six cancer types. We identified SPP1+ tumor-associated macrophage (TAM) subpopulation potentially enhanced epithelial–mesenchymal transition (EMT) by interaction with cancer cells through paracrine pattern. We prioritized SPP1 as a TAM-secreted factor to act on cancer cells and found a significant enhanced migration phenotype and invasion ability in A549 lung cancer cells induced by recombinant protein SPP1. Besides, prognostic analysis indicated that a higher expression of SPP1 was found to be related to worse clinical outcome in six cancer types. SPP1 expression was higher in hypoxia-high macrophages based on single-cell data, which was further validated by an in vitro experiment that SPP1 was upregulated in macrophages under hypoxia-cultured compared with normoxic conditions. Additionally, a differential analysis demonstrated that hypoxia potentially influences extracellular matrix remodeling, glycolysis, and interleukin-10 signal activation in various cancer types. Our work illuminates the clearer underlying mechanism in the intricate interaction between different cell subtypes within hypoxia TME and proposes the guidelines for the development of therapeutic targets specifically for patients with high proportion of SPP1+ TAMs in hypoxic lesions.


2017 ◽  
Author(s):  
Zhun Miao ◽  
Ke Deng ◽  
Xiaowo Wang ◽  
Xuegong Zhang

AbstractSummaryThe excessive amount of zeros in single-cell RNA-seq data include “real” zeros due to the on-off nature of gene transcription in single cells and “dropout” zeros due to technical reasons. Existing differential expression (DE) analysis methods cannot distinguish these two types of zeros. We developed an R package DEsingle which employed Zero-Inflated Negative Binomial model to estimate the proportion of real and dropout zeros and to define and detect 3 types of DE genes in single-cell RNA-seq data with higher accuracy.Availability and ImplementationThe R package DEsingle is freely available at https://github.com/miaozhun/DEsingle and is under Bioconductor’s consideration [email protected] informationSupplementary data are available at bioRxiv online.


2020 ◽  
Vol 117 (31) ◽  
pp. 18412-18423 ◽  
Author(s):  
Chia-Chen Hsu ◽  
Jiabao Xu ◽  
Bas Brinkhof ◽  
Hui Wang ◽  
Zhanfeng Cui ◽  
...  

Stem cells with the capability to self-renew and differentiate into multiple cell derivatives provide platforms for drug screening and promising treatment options for a wide variety of neural diseases. Nevertheless, clinical applications of stem cells have been hindered partly owing to a lack of standardized techniques to characterize cell molecular profiles noninvasively and comprehensively. Here, we demonstrate that a label-free and noninvasive single-cell Raman microspectroscopy (SCRM) platform was able to identify neural cell lineages derived from clinically relevant human induced pluripotent stem cells (hiPSCs). By analyzing the intrinsic biochemical profiles of single cells at a large scale (8,774 Raman spectra in total), iPSCs and iPSC-derived neural cells can be distinguished by their intrinsic phenotypic Raman spectra. We identified a Raman biomarker from glycogen to distinguish iPSCs from their neural derivatives, and the result was verified by the conventional glycogen detection assays. Further analysis with a machine learning classification model, utilizing t-distributed stochastic neighbor embedding (t-SNE)-enhanced ensemble stacking, clearly categorized hiPSCs in different developmental stages with 97.5% accuracy. The present study demonstrates the capability of the SCRM-based platform to monitor cell development using high content screening with a noninvasive and label-free approach. This platform as well as our identified biomarker could be extensible to other cell types and can potentially have a high impact on neural stem cell therapy.


Sign in / Sign up

Export Citation Format

Share Document