Cardelino: Integrating whole exomes and single-cell transcriptomes to reveal phenotypic impact of somatic variants

Mapping Intimacies ◽

10.1101/413047 ◽

2018 ◽

Cited By ~ 11

Author(s):

Davis J. McCarthy ◽

Raghd Rostom ◽

Yuanhua Huang ◽

Daniel J. Kunz ◽

Petr Danecek ◽

...

Keyword(s):

Cell Cycle ◽

Single Cell ◽

Dermal Fibroblast ◽

Human Fibroblasts ◽

Human Dermal Fibroblast ◽

Computational Method ◽

Neutral Evolution ◽

List Type ◽

Rna Seq ◽

Sequencing Data

AbstractDecoding the clonal substructures of somatic tissues sheds light on cell growth, development and differentiation in health, ageing and disease. DNA-sequencing, either using bulk or using single-cell assays, has enabled the reconstruction of clonal trees from frequency and co-occurrence patterns of somatic variants. However, approaches to systematically characterize phenotypic and functional variations between individual clones are not established. Here we present cardelino (https://github.com/PMBio/cardelino), a computational method for inferring the clone of origin of individual cells that have been assayed using single-cell RNA-seq (scRNA-seq). After validating our model using simulations, we apply cardelino to matched scRNA-seq and exome sequencing data from 32 human dermal fibroblast lines, identifying hundreds of differentially expressed genes between cells from different somatic clones. These genes are frequently enriched for cell cycle and proliferation pathways, indicating a key role for cell division genes in non-neutral somatic evolution.Key findingsA novel approach for integrating DNA-seq and single-cell RNA-seq data to reconstruct clonal substructure for single-cell transcriptomes.Evidence for non-neutral evolution of clonal populations in human fibroblasts.Proliferation and cell cycle pathways are commonly distorted in mutated clonal populations.

Download Full-text

An ultra-sensitive T-cell receptor detection method for TCR-Seq and RNA-Seq data

Bioinformatics ◽

10.1093/bioinformatics/btaa432 ◽

2020 ◽

Vol 36 (15) ◽

pp. 4255-4262

Author(s):

Si-Yi Chen ◽

Chun-Jie Liu ◽

Qiong Zhang ◽

An-Yuan Guo

Keyword(s):

T Cell ◽

Single Cell ◽

High Performance ◽

Cell Receptor ◽

Computational Method ◽

Read Length ◽

Supplementary Information ◽

De Bruijn Graph ◽

Rna Seq ◽

Sequencing Data

Abstract Motivation T-cell receptors (TCRs) function to recognize antigens and play vital roles in T-cell immunology. Surveying TCR repertoires by characterizing complementarity-determining region 3 (CDR3) is a key issue. Due to the high diversity of CDR3 and technological limitation, accurate characterization of CDR3 repertoires remains a great challenge. Results We propose a computational method named CATT for ultra-sensitive and precise TCR CDR3 sequences detection. CATT can be applied on TCR sequencing, RNA-Seq and single-cell TCR(RNA)-Seq data to characterize CDR3 repertoires. CATT integrated de Bruijn graph-based micro-assembly algorithm, data-driven error correction model and Bayesian inference algorithm, to self-adaptively and ultra-sensitively characterize CDR3 repertoires with high performance. Benchmark results of datasets from in silico and experimental data demonstrated that CATT showed superior recall and precision compared with existing tools, especially for data with short read length and small size and single-cell sequencing data. Thus, CATT will be a useful tool for TCR analysis in researches of cancer and immunology. Availability and implementation http://bioinfo.life.hust.edu.cn/CATT or https://github.com/GuoBioinfoLab/CATT. Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text

Nabo – a framework to define leukemia-initiating cells and differentiation in single-cell RNA-sequencing data

10.1101/2020.09.30.321216 ◽

2020 ◽

Author(s):

Parashar Dhapola ◽

Mohamed Eldeeb ◽

Amol Ugale ◽

Rasmus Olofzon ◽

Eva Erlandsson ◽

...

Keyword(s):

Gene Expression ◽

Single Cell ◽

Population Heterogeneity ◽

Computational Method ◽

Cellular Heterogeneity ◽

Specific Cell ◽

Rna Seq ◽

Cell Mapping ◽

Sequencing Data ◽

Cell Stage

ABSTRACTSingle-cell transcriptomics facilitates innovative approaches to define and identify cell types within tissues and cell populations. An emerging interest in the cancer field is to assess the heterogeneity of transformed cells, including the identification of tumor-initiating cells based on similarities to their normal counterparts. However, such cell mapping is often confounded by the large effects on total gene expression programs introduced by strong perturbations such as an oncogenic event. Here, we present Nabo, a novel computational method that allows mapping of cells from one population to the most similar cells in a reference population, independently of confounding changes to gene expression programs initiated by perturbation. We validated this method on multiple datasets from different sources and platforms and show that Nabo achieves higher rates of accuracy than conventional classification methods. Nabo is available as an integrated toolkit for preprocessing, cell mapping, differential gene expression identification, and visualization of single-cell RNA-Seq data. For exploratory studies, Nabo includes methods to help evaluate the reliability of cell mapping results. We applied Nabo on droplet-based single-cell RNA-Seq data of healthy and oncogene-induced (MLL-ENL) hematopoietic progenitor cells (GMLPs) differentiating in vitro. Despite a substantial cellular heterogeneity resulting from differentiation of GMLPs and the large transcriptional effects induced by the fusion oncogene, Nabo could pinpoint the specific cell stage where differentiation arrest occurs, which included an immunophenotypic definition of the tumor-initiating population. Thus, Nabo allows for relevant comparison between target and control cells, without being confounded by differences in population heterogeneity.

Download Full-text

Human dermal fibroblast subpopulations are conserved across single-cell RNA sequencing studies

Journal of Investigative Dermatology ◽

10.1016/j.jid.2020.11.028 ◽

2020 ◽

Author(s):

Alex M. Ascensión ◽

Sandra Fuertes-Álvarez ◽

Olga Ibañez-Solé ◽

Ander Izeta ◽

Marcos J. Araúzo-Bravo

Keyword(s):

Single Cell ◽

Rna Sequencing ◽

Dermal Fibroblast ◽

Human Dermal Fibroblast ◽

Single Cell Rna Sequencing ◽

Sequencing Studies

Download Full-text

K-mer counting with low memory consumption enables fast clustering of single-cell sequencing data without read alignment

10.1101/723833 ◽

2019 ◽

Author(s):

Christina Huan Shi ◽

Kevin Y. Yip

Keyword(s):

Single Cell ◽

State Of The Art ◽

Rna Seq ◽

Sequencing Data ◽

Memory Consumption ◽

Analysis Pipeline ◽

Cell Clusters ◽

Single Cell Sequencing ◽

Sequencing Errors ◽

Full Analysis

AbstractK-mer counting has many applications in sequencing data processing and analysis. However, sequencing errors can produce many false k-mers that substantially increase the memory requirement during counting. We propose a fast k-mer counting method, CQF-deNoise, which has a novel component for dynamically identifying and removing false k-mers while preserving counting accuracy. Compared with four state-of-the-art k-mer counting methods, CQF-deNoise consumed 49-76% less memory than the second best method, but still ran competitively fast. The k-mer counts from CQF-deNoise produced cell clusters from single-cell RNA-seq data highly consistent with CellRanger but required only 5% of the running time at the same memory consumption, suggesting that CQF-deNoise can be used for a preview of cell clusters for an early detection of potential data problems, before running a much more time-consuming full analysis pipeline.

Download Full-text

Spatial and Single-Cell Transcriptional Profiling Identifies Functionally Distinct Human Dermal Fibroblast Subpopulations

Journal of Investigative Dermatology ◽

10.1016/j.jid.2018.01.016 ◽

2018 ◽

Vol 138 (4) ◽

pp. 811-825 ◽

Cited By ~ 106

Author(s):

Christina Philippeos ◽

Stephanie B. Telerman ◽

Bénédicte Oulès ◽

Angela O. Pisco ◽

Tanya J. Shaw ◽

...

Keyword(s):

Single Cell ◽

Dermal Fibroblast ◽

Transcriptional Profiling ◽

Human Dermal Fibroblast

Download Full-text

Characterizing and inferring quantitative cell cycle phase in single-cell RNA-seq data analysis

Genome Research ◽

10.1101/gr.247759.118 ◽

2020 ◽

Vol 30 (4) ◽

pp. 611-621 ◽

Cited By ~ 3

Author(s):

Chiaowen Joyce Hsiao ◽

PoYuan Tung ◽

John D. Blischak ◽

Jonathan E. Burnett ◽

Kenneth A. Barr ◽

...

Keyword(s):

Cell Cycle ◽

Data Analysis ◽

Single Cell ◽

Cell Cycle Phase ◽

Cycle Phase ◽

Rna Seq

Download Full-text

A computational method to aid in the design and analysis of single cell RNA-seq experiments

2017 IEEE 7th International Conference on Computational Advances in Bio and Medical Sciences (ICCABS) ◽

10.1109/iccabs.2017.8114311 ◽

2017 ◽

Author(s):

Douglas Abrams ◽

Parveen Kumar ◽

Krishna Karuturi ◽

Joshy George

Keyword(s):

Single Cell ◽

Computational Method ◽

Rna Seq

Download Full-text

STACAS: Sub-Type Anchor Correction for Alignment in Seurat to integrate single-cell RNA-seq data

Bioinformatics ◽

10.1093/bioinformatics/btaa755 ◽

2020 ◽

Cited By ~ 1

Author(s):

Massimo Andreatta ◽

Santiago J Carmona

Keyword(s):

Single Cell ◽

Distance Measure ◽

Source Code ◽

Cell Types ◽

R Package ◽

Computational Method ◽

Biological Variability ◽

Rna Seq ◽

Batch Effects ◽

Guide Trees

Abstract Summary STACAS is a computational method for the identification of integration anchors in the Seurat environment, optimized for the integration of single-cell (sc) RNA-seq datasets that share only a subset of cell types. We demonstrate that by (i) correcting batch effects while preserving relevant biological variability across datasets, (ii) filtering aberrant integration anchors with a quantitative distance measure and (iii) constructing optimal guide trees for integration, STACAS can accurately align scRNA-seq datasets composed of only partially overlapping cell populations. Availability and implementation Source code and R package available at https://github.com/carmonalab/STACAS; Docker image available at https://hub.docker.com/repository/docker/mandrea1/stacas_demo.

Download Full-text

Identifying and removing the cell-cycle effect from single-cell RNA-Sequencing data

Scientific Reports ◽

10.1038/srep33892 ◽

2016 ◽

Vol 6 (1) ◽

Cited By ~ 45

Author(s):

Martin Barron ◽

Jun Li

Keyword(s):

Cell Cycle ◽

Single Cell ◽

Rna Sequencing ◽

Sequencing Data ◽

Single Cell Rna Sequencing

Download Full-text

SSCC: a novel computational framework for rapid and accurate clustering large single cell RNA-seq data

10.1101/344242 ◽

2018 ◽

Cited By ~ 2

Author(s):

Xianwen Ren ◽

Liangtao Zheng ◽

Zemin Zhang

Keyword(s):

Single Cell ◽

Rna Sequencing ◽

Large Scale ◽

Random Projection ◽

Rna Seq ◽

Sequencing Data ◽

Computational Framework ◽

Human Blood Cells ◽

Single Cell Rna Sequencing ◽

Data Volume

ABSTRACTClustering is a prevalent analytical means to analyze single cell RNA sequencing data but the rapidly expanding data volume can make this process computational challenging. New methods for both accurate and efficient clustering are of pressing needs. Here we proposed a new clustering framework based on random projection and feature construction for large scale single-cell RNA sequencing data, which greatly improves clustering accuracy, robustness and computational efficacy for various state-of-the-art algorithms benchmarked on multiple real datasets. On a dataset with 68,578 human blood cells, our method reached 20% improvements for clustering accuracy and 50-fold acceleration but only consumed 66% memory usage compared to the widely-used software package SC3. Compared to k-means, the accuracy improvement can reach 3-fold depending on the concrete dataset. An R implementation of the framework is available from https://github.com/Japrin/sscClust.

Download Full-text