scholarly journals APRICOT: an integrated computational pipeline for the sequence-based identification and characterization of RNA-binding proteins

2016 ◽  
Author(s):  
Malvika Sharan ◽  
Konrad U. Förstner ◽  
Ana Eulalio ◽  
Jörg Vogel

ABSTRACTRNA-binding proteins (RBPs) have been established as core components of several post-transcriptional gene regulation mechanisms. Experimental techniques such as cross-linking and co-immunoprecipitation have enabled the identification of RBPs, RNA-binding domains (RBDs), and their regulatory roles in the eukaryotic species such as human and yeast in large-scale. In contrast, our knowledge of the number and potential diversity of RBPs in bacteria is poorer due to the technical challenges associated with the existing global screening approaches.We introduce APRICOT, a computational pipeline for the sequence-based identification and characterization of proteins using RBDs known from experimental studies. The pipeline identifies functional motifs in protein sequences using Position Specific Scoring Matrices and Hidden Markov Models of the functional domains and statistically scores them based on a series of sequence-based features. Subsequently, APRICOT identifies putative RBPs and characterizes them by several biological properties. Here we demonstrate the application and adaptability of the pipeline on large-scale protein sets, including the bacterial proteome of Escherichia coli. APRICOT showed better performance on various datasets compared to other existing tools for the sequence-based prediction of RBPs by achieving an average sensitivity and specificity of 0.90 and 0.91 respectively. The command-line tool and its documentation are available at https://pypi.python.org/pypi/bio-apricot

Methods ◽  
1998 ◽  
Vol 15 (3) ◽  
pp. 225-232 ◽  
Author(s):  
Bertram L. Jacobs ◽  
Jeffrey O. Langland ◽  
Teresa Brandt

2018 ◽  
Vol 9 (1) ◽  
Author(s):  
Joel I. Perez-Perri ◽  
Birgit Rogell ◽  
Thomas Schwarzl ◽  
Frank Stein ◽  
Yang Zhou ◽  
...  

AbstractFollowing the realization that eukaryotic RNA-binding proteomes are substantially larger than anticipated, we must now understand their detailed composition and dynamics. Methods such as RNA interactome capture (RIC) have begun to address this need. However, limitations of RIC have been reported. Here we describe enhanced RNA interactome capture (eRIC), a method based on the use of an LNA-modified capture probe, which yields numerous advantages including greater specificity and increased signal-to-noise ratios compared to existing methods. In Jurkat cells, eRIC reduces the rRNA and DNA contamination by >10-fold compared to RIC and increases the detection of RNA-binding proteins. Due to its low background, eRIC also empowers comparative analyses of changes of RNA-bound proteomes missed by RIC. For example, in cells treated with dimethyloxalylglycine, which inhibits RNA demethylases, eRIC identifies m6A-responsive RNA-binding proteins that escape RIC. eRIC will facilitate the unbiased characterization of RBP dynamics in response to biological and pharmacological cues.


2016 ◽  
Author(s):  
Shuya Li ◽  
Fanghong Dong ◽  
Yuexin Wu ◽  
Sai Zhang ◽  
Chen Zhang ◽  
...  

AbstractCharacterizing the binding behaviors of RNA-binding proteins (RBPs) is important for understanding their functional roles in gene expression regulation. However, current high-throughput experimental methods for identifying RBP targets, such as CLIP-seq and RNAcompete, usually suffer from the false positive and false negative issues. Here, we develop a deep boosting based machine learning approach, called DeBooster, to accurately model the binding sequence preferences and identify the corresponding binding targets of RBPs from CLIP-seq data. Comprehensive validation tests have shown that DeBooster can outperform other state-of-the-art approaches in predicting RBP targets and recover false negatives that are common in current CLIP-seq data. In addition, we have demonstrated several new potential applications of DeBooster in understanding the regulatory functions of RBPs, including the binding effects of the RNA helicase MOV10 on mRNA degradation, the influence of different binding behaviors of the ADAR proteins on RNA editing, as well as the antagonizing effect of RBP binding on miRNA repression. Moreover, DeBooster may provide an effective index to investigate the effect of pathogenic mutations in RBP binding sites, especially those related to splicing events. We expect that DeBooster will be widely applied to analyze large-scale CLIP-seq experimental data and can provide a practically useful tool for novel biological discoveries in understanding the regulatory mechanisms of RBPs.


2012 ◽  
Vol 146 (3) ◽  
pp. 297-307 ◽  
Author(s):  
Min Kyung Kim ◽  
Hyun Ju Jung ◽  
Dong Hyun Kim ◽  
Hunseung Kang

2020 ◽  
Vol 21 (20) ◽  
pp. 7803
Author(s):  
Julie Miro ◽  
Anne-Laure Bougé ◽  
Eva Murauer ◽  
Emmanuelle Beyne ◽  
Dylan Da Cunha ◽  
...  

The Duchenne muscular dystrophy (DMD) gene has a complex expression pattern regulated by multiple tissue-specific promoters and by alternative splicing (AS) of the resulting transcripts. Here, we used an RNAi-based approach coupled with DMD-targeted RNA-seq to identify RNA-binding proteins (RBPs) that regulate splicing of its skeletal muscle isoform (Dp427m) in a human muscular cell line. A total of 16 RBPs comprising the major regulators of muscle-specific splicing events were tested. We show that distinct combinations of RBPs maintain the correct inclusion in the Dp427m of exons that undergo spatio-temporal AS in other dystrophin isoforms. In particular, our findings revealed the complex networks of RBPs contributing to the splicing of the two short DMD exons 71 and 78, the inclusion of exon 78 in the adult Dp427m isoform being crucial for muscle function. Among the RBPs tested, QKI and DDX5/DDX17 proteins are important determinants of DMD exon inclusion. This is the first large-scale study to determine which RBP proteins act on the physiological splicing of the DMD gene. Our data shed light on molecular mechanisms contributing to the expression of the different dystrophin isoforms, which could be influenced by a change in the function or expression level of the identified RBPs.


Sign in / Sign up

Export Citation Format

Share Document