scholarly journals Single-cell ATAC-seq Signal Extraction and Enhancement with SCATE

2019 ◽  
Author(s):  
Zhicheng Ji ◽  
Weiqiang Zhou ◽  
Hongkai Ji

AbstractSingle-cell sequencing assay for transposase-accessible chromatin (scATAC-seq) is the state-of-the-art technology for analyzing genome-wide regulatory landscape in single cells. Single-cell ATAC-seq data are sparse and noisy. Analyzing such data is challenging. Existing computational methods cannot accurately reconstruct activities of individual cis-regulatory elements (CREs) in individual cells or rare cell subpopulations. We present a new statistical framework, SCATE, that adaptively integrates information from co-activated CREs, similar cells, and publicly available regulome data to substantially increase the accuracy for estimating activities of individual CREs. We show that using SCATE, one can better reconstruct the regulatory landscape of a heterogeneous sample.

2021 ◽  
Author(s):  
Xiaoyu Tu ◽  
Alexandre P Marand ◽  
Robert J. Schmitz ◽  
Silin Zhong

Understanding how cis-regulatory elements facilitate gene expression is a key question in biology. Recent advances in single-cell genomics have led to the discovery of cell-specific chromatin landscapes that underlie transcription programs. However, the high equipment and reagent costs of commercial systems limit their applications for many laboratories. In this study, we profiled the Arabidopsis root single-cell epigenome using a combinatorial index and dual PCR barcode strategy without the need of any specialized equipment. We generated chromatin accessibility profiles for 13,576 Arabidopsis thaliana root nuclei with an average of 12,784 unique Tn5 integrations per cell and 85% of the Tn5 insertions localizing to discrete accessible chromatin regions. Comparison with data generated from a commercial microfluidic platform revealed that our method is capable of unbiased identification of cell type-specific chromatin accessibility with improved throughput, quality, and efficiency. We anticipate that by removing cost, instrument, and other technical obstacles, this combinatorial indexing method will be a valuable tool for routine investigation of single-cell epigenomes and usher new insight into plant growth, development and their interactions with the environment.


2020 ◽  
Vol 6 (51) ◽  
pp. eaba9031
Author(s):  
Laiyi Fu ◽  
Lihua Zhang ◽  
Emmanuel Dollinger ◽  
Qinke Peng ◽  
Qing Nie ◽  
...  

Characterizing genome-wide binding profiles of transcription factors (TFs) is essential for understanding biological processes. Although techniques have been developed to assess binding profiles within a population of cells, determining them at a single-cell level remains elusive. Here, we report scFAN (single-cell factor analysis network), a deep learning model that predicts genome-wide TF binding profiles in individual cells. scFAN is pretrained on genome-wide bulk assay for transposase-accessible chromatin sequencing (ATAC-seq), DNA sequence, and chromatin immunoprecipitation sequencing (ChIP-seq) data and uses single-cell ATAC-seq to predict TF binding in individual cells. We demonstrate the efficacy of scFAN by both studying sequence motifs enriched within predicted binding peaks and using predicted TFs for discovering cell types. We develop a new metric “TF activity score” to characterize each cell and show that activity scores can reliably capture cell identities. scFAN allows us to discover and study cellular identities and heterogeneity based on chromatin accessibility profiles.


2016 ◽  
Vol 2016 ◽  
pp. 1-8 ◽  
Author(s):  
Baixin Ye ◽  
Qingping Gao ◽  
Zhi Zeng ◽  
Creed M. Stary ◽  
Zhihong Jian ◽  
...  

Cellular heterogeneity is a fundamental characteristic of many cancers. A lack of cellular homogeneity contributes to difficulty in designing targeted oncological therapies. Therefore, the development of novel methods to determine and characterize oncologic cellular heterogeneity is a critical next step in the development of novel cancer therapies. Single-cell sequencing (SCS) technology has been recently employed for analyzing the genetic polymorphisms of individual cells at the genome-wide level. SCS requires (1) precise isolation of the single cell of interest; (2) isolation and amplification of genetic material; and (3) descriptive analysis of genomic, transcriptomic, and epigenomic data. In addition to targeted analysis of single cells isolated from tumor biopsies, SCS technology may be applied to circulating tumor cells, which may aid in predicting tumor progression and metastasis. In this paper, we provide an overview of SCS technology and review the current literature on the potential application of SCS to clinical oncology and research.


Genes ◽  
2021 ◽  
Vol 12 (8) ◽  
pp. 1175
Author(s):  
Amarni L. Thomas ◽  
Judith Marsman ◽  
Jisha Antony ◽  
William Schierding ◽  
Justin M. O’Sullivan ◽  
...  

The RUNX1/AML1 gene encodes a developmental transcription factor that is an important regulator of haematopoiesis in vertebrates. Genetic disruptions to the RUNX1 gene are frequently associated with acute myeloid leukaemia. Gene regulatory elements (REs), such as enhancers located in non-coding DNA, are likely to be important for Runx1 transcription. Non-coding elements that modulate Runx1 expression have been investigated over several decades, but how and when these REs function remains poorly understood. Here we used bioinformatic methods and functional data to characterise the regulatory landscape of vertebrate Runx1. We identified REs that are conserved between human and mouse, many of which produce enhancer RNAs in diverse tissues. Genome-wide association studies detected single nucleotide polymorphisms in REs, some of which correlate with gene expression quantitative trait loci in tissues in which the RE is active. Our analyses also suggest that REs can be variant in haematological malignancies. In summary, our analysis identifies features of the RUNX1 regulatory landscape that are likely to be important for the regulation of this gene in normal and malignant haematopoiesis.


Micromachines ◽  
2018 ◽  
Vol 9 (8) ◽  
pp. 367 ◽  
Author(s):  
Yuguang Liu ◽  
Dirk Schulze-Makuch ◽  
Jean-Pierre de Vera ◽  
Charles Cockell ◽  
Thomas Leya ◽  
...  

Single-cell sequencing is a powerful technology that provides the capability of analyzing a single cell within a population. This technology is mostly coupled with microfluidic systems for controlled cell manipulation and precise fluid handling to shed light on the genomes of a wide range of cells. So far, single-cell sequencing has been focused mostly on human cells due to the ease of lysing the cells for genome amplification. The major challenges that bacterial species pose to genome amplification from single cells include the rigid bacterial cell walls and the need for an effective lysis protocol compatible with microfluidic platforms. In this work, we present a lysis protocol that can be used to extract genomic DNA from both gram-positive and gram-negative species without interfering with the amplification chemistry. Corynebacterium glutamicum was chosen as a typical gram-positive model and Nostoc sp. as a gram-negative model due to major challenges reported in previous studies. Our protocol is based on thermal and chemical lysis. We consider 80% of single-cell replicates that lead to >5 ng DNA after amplification as successful attempts. The protocol was directly applied to Gloeocapsa sp. and the single cells of the eukaryotic Sphaerocystis sp. and achieved a 100% success rate.


2021 ◽  
Vol 9 (Suppl 1) ◽  
pp. A12.1-A12
Author(s):  
Y Arjmand Abbassi ◽  
N Fang ◽  
W Zhu ◽  
Y Zhou ◽  
Y Chen ◽  
...  

Recent advances of high-throughput single cell sequencing technologies have greatly improved our understanding of the complex biological systems. Heterogeneous samples such as tumor tissues commonly harbor cancer cell-specific genetic variants and gene expression profiles, both of which have been shown to be related to the mechanisms of disease development, progression, and responses to treatment. Furthermore, stromal and immune cells within tumor microenvironment interact with cancer cells to play important roles in tumor responses to systematic therapy such as immunotherapy or cell therapy. However, most current high-throughput single cell sequencing methods detect only gene expression levels or epigenetics events such as chromatin conformation. The information on important genetic variants including mutation or fusion is not captured. To better understand the mechanisms of tumor responses to systematic therapy, it is essential to decipher the connection between genotype and gene expression patterns of both tumor cells and cells in the tumor microenvironment. We developed FocuSCOPE, a high-throughput multi-omics sequencing solution that can detect both genetic variants and transcriptome from same single cells. FocuSCOPE has been used to successfully perform single cell analysis of both gene expression profiles and point mutations, fusion genes, or intracellular viral sequences from thousands of cells simultaneously, delivering comprehensive insights of tumor and immune cells in tumor microenvironment at single cell resolution.Disclosure InformationY. Arjmand Abbassi: None. N. Fang: None. W. Zhu: None. Y. Zhou: None. Y. Chen: None. U. Deutsch: None.


Author(s):  
Pinar Demetci ◽  
Rebecca Santorella ◽  
Björn Sandstede ◽  
William Stafford Noble ◽  
Ritambhara Singh

AbstractData integration of single-cell measurements is critical for understanding cell development and disease, but the lack of correspondence between different types of measurements makes such efforts challenging. Several unsupervised algorithms can align heterogeneous single-cell measurements in a shared space, enabling the creation of mappings between single cells in different data domains. However, these algorithms require hyperparameter tuning for high-quality alignments, which is difficult in an unsupervised setting without correspondence information for validation. We present Single-Cell alignment using Optimal Transport (SCOT), an unsupervised learning algorithm that uses Gromov Wasserstein-based optimal transport to align single-cell multi-omics datasets. We compare the alignment performance of SCOT with state-of-the-art algorithms on four simulated and two real-world datasets. SCOT performs on par with state-of-the-art methods but is faster and requires tuning fewer hyperparameters. Furthermore, we provide an algorithm for SCOT to use Gromov Wasserstein distance to guide the parameter selection. Thus, unlike previous methods, SCOT aligns well without using any orthogonal correspondence information to pick the hyperparameters. Our source code and scripts for replicating the results are available at https://github.com/rsinghlab/SCOT.


2021 ◽  
Author(s):  
Hyobin Jeong ◽  
Karen Grimes ◽  
Peter-Martin Bruch ◽  
Tobias Rausch ◽  
Patrick Hasenfeld ◽  
...  

Somatic structural variants (SVs) are widespread in cancer genomes, however, their impact on tumorigenesis and intra-tumour heterogeneity is incompletely understood, since methods to functionally characterize the broad spectrum of SVs arising in cancerous single-cells are lacking. We present a computational method, scNOVA, that couples SV discovery with nucleosome occupancy analysis by haplotype-resolved single-cell sequencing, to systematically uncover SV effects on cis-regulatory elements and gene activity. Application to leukemias and cell lines uncovered SV outcomes at several loci, including dysregulated cancer-related pathways and mono-allelic oncogene expression near SV breakpoints. At the intra-patient level, we identified different yet overlapping subclonal SVs that converge on aberrant Wnt signaling. We also deconvoluted the effects of catastrophic chromosomal rearrangements resulting in oncogenic transcription factor dysregulation. scNOVA directly links SVs to their functional consequences, opening the door for single-cell multiomics of SVs in heterogeneous cell populations.


2019 ◽  
Author(s):  
Christina Huan Shi ◽  
Kevin Y. Yip

AbstractK-mer counting has many applications in sequencing data processing and analysis. However, sequencing errors can produce many false k-mers that substantially increase the memory requirement during counting. We propose a fast k-mer counting method, CQF-deNoise, which has a novel component for dynamically identifying and removing false k-mers while preserving counting accuracy. Compared with four state-of-the-art k-mer counting methods, CQF-deNoise consumed 49-76% less memory than the second best method, but still ran competitively fast. The k-mer counts from CQF-deNoise produced cell clusters from single-cell RNA-seq data highly consistent with CellRanger but required only 5% of the running time at the same memory consumption, suggesting that CQF-deNoise can be used for a preview of cell clusters for an early detection of potential data problems, before running a much more time-consuming full analysis pipeline.


2021 ◽  
Author(s):  
Dennis A Sun ◽  
Nipam H Patel

AbstractEmerging research organisms enable the study of biology that cannot be addressed using classical “model” organisms. The development of novel data resources can accelerate research in such animals. Here, we present new functional genomic resources for the amphipod crustacean Parhyale hawaiensis, facilitating the exploration of gene regulatory evolution using this emerging research organism. We use Omni-ATAC-Seq, an improved form of the Assay for Transposase-Accessible Chromatin coupled with next-generation sequencing (ATAC-Seq), to identify accessible chromatin genome-wide across a broad time course of Parhyale embryonic development. This time course encompasses many major morphological events, including segmentation, body regionalization, gut morphogenesis, and limb development. In addition, we use short- and long-read RNA-Seq to generate an improved Parhyale genome annotation, enabling deeper classification of identified regulatory elements. We leverage a variety of bioinformatic tools to discover differential accessibility, predict nucleosome positioning, infer transcription factor binding, cluster peaks based on accessibility dynamics, classify biological functions, and correlate gene expression with accessibility. Using a Minos transposase reporter system, we demonstrate the potential to identify novel regulatory elements using this approach, including distal regulatory elements. This work provides a platform for the identification of novel developmental regulatory elements in Parhyale, and offers a framework for performing such experiments in other emerging research organisms.Primary Findings-Omni-ATAC-Seq identifies cis-regulatory elements genome-wide during crustacean embryogenesis-Combined short- and long-read RNA-Seq improves the Parhyale genome annotation-ImpulseDE2 analysis identifies dynamically regulated candidate regulatory elements-NucleoATAC and HINT-ATAC enable inference of nucleosome occupancy and transcription factor binding-Fuzzy clustering reveals peaks with distinct accessibility and chromatin dynamics-Integration of accessibility and gene expression reveals possible enhancers and repressors-Omni-ATAC can identify known and novel regulatory elements


Sign in / Sign up

Export Citation Format

Share Document