scholarly journals RNA-Bloom provides lightweight reference-free transcriptome assembly for single cells

2019 ◽  
Author(s):  
Ka Ming Nip ◽  
Readman Chiu ◽  
Chen Yang ◽  
Justin Chu ◽  
Hamid Mohamadi ◽  
...  

We present RNA-Bloom, a de novo RNA-seq assembly algorithm that leverages the rich information content in single-cell transcriptome sequencing (scRNA-seq) data to reconstruct cell-specific isoforms. We benchmark RNA-Bloom’s performance against leading bulk RNA-seq assembly approaches, and illustrate its utility in detecting cell-specific gene fusion events using sequencing data from HiSeq-4000 and BGISEQ-500 platforms. We expect RNA-Bloom to boost the utility of scRNA-seq data, expanding what is informatically accessible now.

2017 ◽  
Author(s):  
Navpreet Ranu ◽  
Alexandra-Chloé Villani ◽  
Nir Hacohen ◽  
Paul C. Blainey

There is rising interest in applying single-cell transcriptome analysis and other single-cell sequencing methods to resolve differences between cells. Pooled processing of thousands of single cells is now routinely practiced by introducing cell-specific DNA barcodes early in cell processing protocols1-5. However, researchers must sequence a large number of cells to sample rare subpopulations6-8, even when fluorescence-activated cell sorting (FACS) is used to pre-enrich rare cell populations. Here, a new molecular enrichment method is used in conjunction with FACS enrichment to enable efficient sampling of rare dendritic cell (DC) populations, including the recently identified AXL+SIGLEC6+ (AS DCs) subset7, within a 10X Genomics single-cell RNA-Seq library. DC populations collectively represent 1-2% of total peripheral blood mononuclear cells (PBMC), with AS DC representing only 1-3% of human blood DCs and 0.01-0.06% of total PBMCs.


2018 ◽  
Author(s):  
Sarthak Sharma ◽  
Wei Wang ◽  
Alberto Stolfi

AbstractThe tadpole-type larva of Ciona has emerged as an intriguing model system for the study of neurodevelopment. The Ciona intestinalis connectome has been recently mapped, revealing the smallest central nervous system (CNS) known in any chordate, with only 177 neurons. This minimal CNS is highly reminiscent of larger CNS of vertebrates, sharing many conserved developmental processes, anatomical compartments, neuron subtypes, and even specific neural circuits. Thus, the Ciona tadpole offers a unique opportunity to understand the development and wiring of a chordate CNS at single-cell resolution. Here we report the use of single-cell RNAseq to profile the transcriptomes of single cells isolated by fluorescence-activated cell sorting (FACS) from the whole brain of Ciona robusta (formerly intestinalis Type A) larvae. We have also compared these profiles to bulk RNAseq data from specific subsets of brain cells isolated by FACS using cell type-specific reporter plasmid expression. Taken together, these datasets have begun to reveal the compartment- and cell-specific gene expression patterns that define the organization of the Ciona larval brain.


2015 ◽  
Vol 2015 ◽  
pp. 1-11 ◽  
Author(s):  
Vladimir A. Zhukov ◽  
Alexander I. Zhernakov ◽  
Olga A. Kulaeva ◽  
Nikita I. Ershov ◽  
Alexey Y. Borisov ◽  
...  

The large size and complexity of the garden pea (Pisum sativumL.) genome hamper its sequencing and the discovery of pea gene resources. Although transcriptome sequencing provides extensive information about expressed genes, some tissue-specific transcripts can only be identified from particular organs under appropriate conditions. In this study, we performed RNA sequencing of polyadenylated transcripts from young pea nodules and root tips on an Illumina GAIIx system, followed byde novotranscriptome assembly using the Trinity program. We obtained more than 58,000 and 37,000 contigs from “Nodules” and “Root Tips” assemblies, respectively. The quality of the assemblies was assessed by comparison with pea expressed sequence tags and transcriptome sequencing project data available from NCBI website. The “Nodules” assembly was compared with the “Root Tips” assembly and with pea transcriptome sequencing data from projects indicating tissue specificity. As a result, approximately 13,000 nodule-specific contigs were found and annotated by alignment to known plant protein-coding sequences and by Gene Ontology searching. Of these, 581 sequences were found to possess full CDSs and could thus be considered as novel nodule-specific transcripts of pea. The information about pea nodule-specific gene sequences can be applied for gene-based markers creation, polymorphism studies, and real-time PCR.


eLife ◽  
2019 ◽  
Vol 8 ◽  
Author(s):  
Ying Zhu ◽  
Mirko Scheibinger ◽  
Daniel Christian Ellwanger ◽  
Jocelyn F Krey ◽  
Dongseok Choi ◽  
...  

Hearing and balance rely on small sensory hair cells that reside in the inner ear. To explore dynamic changes in the abundant proteins present in differentiating hair cells, we used nanoliter-scale shotgun mass spectrometry of single cells, each ~1 picoliter, from utricles of embryonic day 15 chickens. We identified unique constellations of proteins or protein groups from presumptive hair cells and from progenitor cells. The single-cell proteomes enabled the de novo reconstruction of a developmental trajectory using protein expression levels, revealing proteins that greatly increased in expression during differentiation of hair cells (e.g., OCM, CRABP1, GPX2, AK1, GSTO1) and those that decreased during differentiation (e.g., TMSB4X, AGR3). Complementary single-cell transcriptome profiling showed corresponding changes in mRNA during maturation of hair cells. Single-cell proteomics data thus can be mined to reveal features of cellular development that may be missed with transcriptomics.


2017 ◽  
Author(s):  
William Stephenson ◽  
Laura T. Donlin ◽  
Andrew Butler ◽  
Cristina Rozo ◽  
Ali Rashidfarrokhi ◽  
...  

AbstractDroplet-based single cell RNA-seq has emerged as a powerful technique for massively parallel cellular profiling. While these approaches offer the exciting promise to deconvolute cellular heterogeneity in diseased tissues, the lack of cost-effective, reliable, and user-friendly instrumentation has hindered widespread adoption of droplet microfluidic techniques. To address this, we have developed a microfluidic control instrument that can be easily assembled from 3D printed parts and commercially available components costing approximately $540. We adapted this instrument for massively parallel scRNA-seq and deployed it in a clinical environment to perform single cell transcriptome profiling of disaggregated synovial tissue from a rheumatoid arthritis patient. We sequenced 8,716 single cells from a synovectomy, revealing 16 transcriptomically distinct clusters. These encompass a comprehensive and unbiased characterization of the autoimmune infiltrate, including inflammatory T and NK subsets that contribute to disease biology. Additionally, we identified fibroblast subpopulations that are demarcated via THY1 (CD90) and CD55 expression. Further experiments confirm that these represent synovial fibroblasts residing within the synovial intimal lining and subintimal lining, respectively, each under the influence of differing microenvironments. We envision that this instrument will have broad utility in basic and clinical settings, enabling low-cost and routine application of microfluidic techniques, and in particular single-cell transcriptome profiling.


2021 ◽  
Vol 23 (1) ◽  
Author(s):  
Bhupinder Pal ◽  
Yunshun Chen ◽  
Michael J. G. Milevskiy ◽  
François Vaillant ◽  
Lexie Prokopuk ◽  
...  

Abstract Background Heterogeneity within the mouse mammary epithelium and potential lineage relationships have been recently explored by single-cell RNA profiling. To further understand how cellular diversity changes during mammary ontogeny, we profiled single cells from nine different developmental stages spanning late embryogenesis, early postnatal, prepuberty, adult, mid-pregnancy, late-pregnancy, and post-involution, as well as the transcriptomes of micro-dissected terminal end buds (TEBs) and subtending ducts during puberty. Methods The single cell transcriptomes of 132,599 mammary epithelial cells from 9 different developmental stages were determined on the 10x Genomics Chromium platform, and integrative analyses were performed to compare specific time points. Results The mammary rudiment at E18.5 closely aligned with the basal lineage, while prepubertal epithelial cells exhibited lineage segregation but to a less differentiated state than their adult counterparts. Comparison of micro-dissected TEBs versus ducts showed that luminal cells within TEBs harbored intermediate expression profiles. Ductal basal cells exhibited increased chromatin accessibility of luminal genes compared to their TEB counterparts suggesting that lineage-specific chromatin is established within the subtending ducts during puberty. An integrative analysis of five stages spanning the pregnancy cycle revealed distinct stage-specific profiles and the presence of cycling basal, mixed-lineage, and 'late' alveolar intermediates in pregnancy. Moreover, a number of intermediates were uncovered along the basal-luminal progenitor cell axis, suggesting a continuum of alveolar-restricted progenitor states. Conclusions This extended single cell transcriptome atlas of mouse mammary epithelial cells provides the most complete coverage for mammary epithelial cells during morphogenesis to date. Together with chromatin accessibility analysis of TEB structures, it represents a valuable framework for understanding developmental decisions within the mouse mammary gland.


PLoS ONE ◽  
2015 ◽  
Vol 10 (5) ◽  
pp. e0125722 ◽  
Author(s):  
Yuli Li ◽  
Xiliang Wang ◽  
Tingting Chen ◽  
Fuwen Yao ◽  
Cuiping Li ◽  
...  

PeerJ ◽  
2017 ◽  
Vol 5 ◽  
pp. e3702 ◽  
Author(s):  
Santiago Montero-Mendieta ◽  
Manfred Grabherr ◽  
Henrik Lantz ◽  
Ignacio De la Riva ◽  
Jennifer A. Leonard ◽  
...  

Whole genome sequencing (WGS) is a very valuable resource to understand the evolutionary history of poorly known species. However, in organisms with large genomes, as most amphibians, WGS is still excessively challenging and transcriptome sequencing (RNA-seq) represents a cost-effective tool to explore genome-wide variability. Non-model organisms do not usually have a reference genome and the transcriptome must be assembledde-novo. We used RNA-seq to obtain the transcriptomic profile forOreobates cruralis, a poorly known South American direct-developing frog. In total, 550,871 transcripts were assembled, corresponding to 422,999 putative genes. Of those, we identified 23,500, 37,349, 38,120 and 45,885 genes present in the Pfam, EggNOG, KEGG and GO databases, respectively. Interestingly, our results suggested that genes related to immune system and defense mechanisms are abundant in the transcriptome ofO. cruralis. We also present a pipeline to assist with pre-processing, assembling, evaluating and functionally annotating ade-novotranscriptome from RNA-seq data of non-model organisms. Our pipeline guides the inexperienced user in an intuitive way through all the necessary steps to buildde-novotranscriptome assemblies using readily available software and is freely available at:https://github.com/biomendi/TRANSCRIPTOME-ASSEMBLY-PIPELINE/wiki.


2020 ◽  
Author(s):  
Maxim Ivanov ◽  
Albin Sandelin ◽  
Sebastian Marquardt

Abstract Background: The quality of gene annotation determines the interpretation of results obtained in transcriptomic studies. The growing number of genome sequence information calls for experimental and computational pipelines for de novo transcriptome annotation. Ideally, gene and transcript models should be called from a limited set of key experimental data. Results: We developed TranscriptomeReconstructoR, an R package which implements a pipeline for automated transcriptome annotation. It relies on integrating features from independent and complementary datasets: i) full-length RNA-seq for detection of splicing patterns and ii) high-throughput 5' and 3' tag sequencing data for accurate definition of gene borders. The pipeline can also take a nascent RNA-seq dataset to supplement the called gene model with transient transcripts.We reconstructed de novo the transcriptional landscape of wild type Arabidopsis thaliana seedlings as a proof-of-principle. A comparison to the existing transcriptome annotations revealed that our gene model is more accurate and comprehensive than the two most commonly used community gene models, TAIR10 and Araport11. In particular, we identify thousands of transient transcripts missing from the existing annotations. Our new annotation promises to improve the quality of A.thaliana genome research.Conclusions: Our proof-of-concept data suggest a cost-efficient strategy for rapid and accurate annotation of complex eukaryotic transcriptomes. We combine the choice of library preparation methods and sequencing platforms with the dedicated computational pipeline implemented in the TranscriptomeReconstructoR package. The pipeline only requires prior knowledge on the reference genomic DNA sequence, but not the transcriptome. The package seamlessly integrates with Bioconductor packages for downstream analysis.


Sign in / Sign up

Export Citation Format

Share Document