Learning from mistakes: Accurate prediction of cell type-specific transcription factor binding

Mapping Intimacies ◽

10.1101/230011 ◽

2017 ◽

Cited By ~ 3

Author(s):

Jens Keilwagen ◽

Stefan Posch ◽

Jan Grau

Keyword(s):

Transcription Factor ◽

Cell Types ◽

Transcription Factor Binding ◽

Ensemble Prediction ◽

Training Procedure ◽

Cell Type ◽

Binding Motifs ◽

Factor Binding ◽

Cell Type Specific

Computational prediction of cell type-specific, in-vivo transcription factor binding sites is still one of the central challenges in regulatory genomics, and a variety of approaches has been proposed for this purpose.Here, we present our approach that earned a shared first rank in the “ENCODE-DREAM in vivo Transcription Factor Binding Site Prediction Challenge” in 2017. This approach employs features derived from chromatin accessibility, binding motifs, gene expression, genomic sequence and annotation to train classifiers using a supervised, discriminative learning principle. Two further key aspects of this approach are learning classifier parameters in an iterative training procedure that successively adds additional negative examples to the training set, and creating an ensemble prediction by averaging over classifiers obtained for different training cell types.In post-challenge analyses, we benchmark the influence of different feature sets and find that chromatin accessiblity and binding motifs are sufficient to yield state-of-the-art performance for in-vivo binding site predictions. We also show that the iterative training procedure and the ensemble prediction are pivotal for the final prediction performance.To make predictions of this approach readily accessible, we predict 682 peak lists for a total of 31 transcription factors in 22 primary cell types and tissues, which are available for download at https://www.synapse.org/#!Synapse:syn11526239, and we demonstrate that these may help to yield biological conclusions. Finally, we provide a user-friendly version of our approach as open source software at http://jstacs.de/index.php/[email protected]

Download Full-text

Sequence and chromatin determinants of transcription factor binding and the establishment of cell type-specific binding patterns

Biochimica et Biophysica Acta (BBA) - Gene Regulatory Mechanisms ◽

10.1016/j.bbagrm.2019.194443 ◽

2020 ◽

Vol 1863 (6) ◽

pp. 194443 ◽

Cited By ~ 2

Author(s):

Divyanshi Srivastava ◽

Shaun Mahony

Keyword(s):

Transcription Factor ◽

Specific Binding ◽

Transcription Factor Binding ◽

Cell Type ◽

Factor Binding ◽

Cell Type Specific

Download Full-text

Accurate prediction of cell type-specific transcription factor binding

Genome Biology ◽

10.1186/s13059-018-1614-y ◽

2019 ◽

Vol 20 (1) ◽

Cited By ~ 28

Author(s):

Jens Keilwagen ◽

Stefan Posch ◽

Jan Grau

Keyword(s):

Transcription Factor ◽

Transcription Factor Binding ◽

Accurate Prediction ◽

Cell Type ◽

Specific Transcription Factor ◽

Factor Binding ◽

Cell Type Specific

Download Full-text

Sequence and chromatin determinants of cell-type-specific transcription factor binding

Genome Research ◽

10.1101/gr.127712.111 ◽

2012 ◽

Vol 22 (9) ◽

pp. 1723-1734 ◽

Cited By ~ 153

Author(s):

A. Arvey ◽

P. Agius ◽

W. S. Noble ◽

C. Leslie

Keyword(s):

Transcription Factor ◽

Transcription Factor Binding ◽

Cell Type ◽

Specific Transcription Factor ◽

Factor Binding ◽

Cell Type Specific

Download Full-text

FactorNet: a deep learning framework for predicting cell type specific transcription factor binding from nucleotide-resolution sequential data

10.1101/151274 ◽

2017 ◽

Cited By ~ 17

Author(s):

Daniel Quang ◽

Xiaohui Xie

Keyword(s):

Neural Network ◽

Transcription Factor ◽

Deep Learning ◽

Cell Types ◽

Transcription Factor Binding ◽

Cell Type ◽

Neural Network Models ◽

Factor Binding ◽

Binding Data ◽

Nucleotide Resolution

AbstractDue to the large numbers of transcription factors (TFs) and cell types, querying binding profiles of all TF/cell type pairs is not experimentally feasible, owing to constraints in time and resources. To address this issue, we developed a convolutional-recurrent neural network model, called FactorNet, to computationally impute the missing binding data. FactorNet trains on binding data from reference cell types to make accurate predictions on testing cell types by leveraging a variety of features, including genomic sequences, genome annotations, gene expression, and single-nucleotide resolution sequential signals, such as DNase I cleavage. To the best of our knowledge, this is the first deep learning method to study the rules governing TF binding at such a fine resolution. With FactorNet, a researcher can perform a single sequencing assay, such as DNase-seq, on a cell type and computationally impute dozens of TF binding profiles. This is an integral step for reconstructing the complex networks underlying gene regulation. While neural networks can be computationally expensive to train, we introduce several novel strategies to significantly reduce the overhead. By visualizing the neural network models, we can interpret how the model predicts binding which in turn reveals additional insights into regulatory grammar. We also investigate the variables that affect cross-cell type predictive performance to explain why the model performs better on some TF/cell types than others, and offer insights to improve upon this field. Our method ranked among the top four teams in the ENCODE-DREAM in vivo Transcription Factor Binding Site Prediction Challenge.

Download Full-text

Prediction of Cell Type Specific Transcription Factor Binding Site Occupancy

Proceedings of the 7th ACM International Conference on Bioinformatics, Computational Biology, and Health Informatics - BCB '16 ◽

10.1145/2975167.2985652 ◽

2016 ◽

Author(s):

Faizy Ahsan ◽

Doina Precup ◽

Mathieu Blanchette

Keyword(s):

Transcription Factor ◽

Binding Site ◽

Transcription Factor Binding Site ◽

Site Occupancy ◽

Transcription Factor Binding ◽

Cell Type ◽

Factor Binding Site ◽

Specific Transcription Factor ◽

Factor Binding ◽

Cell Type Specific

Download Full-text

Fast decoding cell type–specific transcription factor binding landscape at single-nucleotide resolution

Genome Research ◽

10.1101/gr.269613.120 ◽

2021 ◽

Author(s):

Hongyang Li ◽

Yuanfang Guan

Keyword(s):

Transcription Factor ◽

Transcription Factor Binding ◽

Cell Type ◽

Single Nucleotide ◽

Specific Transcription Factor ◽

Factor Binding ◽

Fast Decoding ◽

Cell Type Specific ◽

Nucleotide Resolution ◽

Single Nucleotide Resolution

Download Full-text

FactorNet: A deep learning framework for predicting cell type specific transcription factor binding from nucleotide-resolution sequential data

Methods ◽

10.1016/j.ymeth.2019.03.020 ◽

2019 ◽

Vol 166 ◽

pp. 40-47 ◽

Cited By ~ 32

Author(s):

Daniel Quang ◽

Xiaohui Xie

Keyword(s):

Transcription Factor ◽

Deep Learning ◽

Transcription Factor Binding ◽

Sequential Data ◽

Cell Type ◽

Specific Transcription Factor ◽

Factor Binding ◽

Learning Framework ◽

Cell Type Specific ◽

Nucleotide Resolution

Download Full-text

Faculty Opinions recommendation of Distinct properties of cell-type-specific and shared transcription factor binding sites.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.718122928.793485028 ◽

2013 ◽

Author(s):

Andrew D Sharrocks

Keyword(s):

Transcription Factor ◽

Binding Sites ◽

Transcription Factor Binding Sites ◽

Transcription Factor Binding ◽

Cell Type ◽

Factor Binding ◽

Cell Type Specific

Download Full-text

Distinct Properties of Cell-Type-Specific and Shared Transcription Factor Binding Sites

Molecular Cell ◽

10.1016/j.molcel.2013.08.037 ◽

2013 ◽

Vol 52 (1) ◽

pp. 25-36 ◽

Cited By ~ 154

Author(s):

Jason Gertz ◽

Daniel Savic ◽

Katherine E. Varley ◽

E. Christopher Partridge ◽

Alexias Safi ◽

...

Keyword(s):

Transcription Factor ◽

Binding Sites ◽

Transcription Factor Binding Sites ◽

Transcription Factor Binding ◽

Cell Type ◽

Factor Binding ◽

Cell Type Specific

Download Full-text

Identification of functional clusters of transcription factor binding motifs in genome sequences: the MSCAN algorithm

Bioinformatics ◽

10.1093/bioinformatics/btg1021 ◽

2003 ◽

Vol 19 (Suppl 1) ◽

pp. i169-i176 ◽

Cited By ~ 40

Author(s):

O. Johansson ◽

W. Alkema ◽

W. W. Wasserman ◽

J. Lagergren

Keyword(s):

Transcription Factor ◽

Transcription Factor Binding ◽

Genome Sequences ◽

Binding Motifs ◽

Factor Binding ◽

Transcription Factor Binding Motifs

Download Full-text