scholarly journals An Ensemble Classifier to Predict Protein–Protein Interactions by Combining PSSM-based Evolutionary Information with Local Binary Pattern Model

2019 ◽  
Vol 20 (14) ◽  
pp. 3511 ◽  
Author(s):  
Yang Li ◽  
Li-Ping Li ◽  
Lei Wang ◽  
Chang-Qing Yu ◽  
Zheng Wang ◽  
...  

Protein plays a critical role in the regulation of biological cell functions. Among them, whether proteins interact with each other has become a fundamental problem, because proteins usually perform their functions by interacting with other proteins. Although a large amount of protein–protein interactions (PPIs) data has been produced by high-throughput biotechnology, the disadvantage of biological experimental technique is time-consuming and costly. Thus, computational methods for predicting protein interactions have become a research hot spot. In this research, we propose an efficient computational method that combines Rotation Forest (RF) classifier with Local Binary Pattern (LBP) feature extraction method to predict PPIs from the perspective of Position-Specific Scoring Matrix (PSSM). The proposed method has achieved superior performance in predicting Yeast, Human, and H. pylori datasets with average accuracies of 92.12%, 96.21%, and 86.59%, respectively. In addition, we also evaluated the performance of the proposed method on the four independent datasets of C. elegans, H. pylori, H. sapiens, and M. musculus datasets. These obtained experimental results fully prove that our model has good feasibility and robustness in predicting PPIs.

2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Yang Li ◽  
Zheng Wang ◽  
Li-Ping Li ◽  
Zhu-Hong You ◽  
Wen-Zhun Huang ◽  
...  

AbstractVarious biochemical functions of organisms are performed by protein–protein interactions (PPIs). Therefore, recognition of protein–protein interactions is very important for understanding most life activities, such as DNA replication and transcription, protein synthesis and secretion, signal transduction and metabolism. Although high-throughput technology makes it possible to generate large-scale PPIs data, it requires expensive cost of both time and labor, and leave a risk of high false positive rate. In order to formulate a more ingenious solution, biology community is looking for computational methods to quickly and efficiently discover massive protein interaction data. In this paper, we propose a computational method for predicting PPIs based on a fresh idea of combining orthogonal locality preserving projections (OLPP) and rotation forest (RoF) models, using protein sequence information. Specifically, the protein sequence is first converted into position-specific scoring matrices (PSSMs) containing protein evolutionary information by using the Position-Specific Iterated Basic Local Alignment Search Tool (PSI-BLAST). Then we characterize a protein as a fixed length feature vector by applying OLPP to PSSMs. Finally, we train an RoF classifier for the purpose of identifying non-interacting and interacting protein pairs. The proposed method yielded a significantly better results than existing methods, with 90.07% and 96.09% prediction accuracy on Yeast and Human datasets. Our experiment show the proposed method can serve as a useful tool to accelerate the process of solving key problems in proteomics.


2018 ◽  
Author(s):  
Oleksandr Narykov ◽  
Nathan Johnson ◽  
Dmitry Korkin

AbstractThe critical role of alternative splicing (AS) in cell functioning has recently become apparent, whether in studying tissue-or cell-specific regulation, or understanding molecular mechanisms governing a complex disorder. Studying the rewiring, or edgetic, effects of alternatively spliced isoforms on protein interactome can provide system-wide insights into these questions. Unfortunately, high-throughput experiments for such studies are expensive and time-consuming, hence the need to develop an in-silico approach. Here, we formulated the problem of characterization the edgetic effects of AS on protein-protein interactions (PPIs) as a binary classification problem and introduced a first computational approach to solve it. We first developed a supervised feature-based classifier that benefited from the traditional features describing a PPI, the problem-specific features that characterized the difference between the reference and alternative isoforms, and a novel domain interaction potential that allowed pinpointing the domains employed during a specific PPI. We then expanded this approach by including a large set of unlabeled interactomics data and developing a semi-supervised learning method. Our method called AS-IN (Alternatively Splicing INteraction prediction) Tool was compared with the state-of-the-art PPI prediction tools and showed a superior performance, achieving 0.92 in precision and recall. We demonstrated the utility of AS-IN Tool by applying it to the transcriptomic data obtained from the brain and liver tissues of a healthy mouse and western diet fed mouse that developed type two diabetes. We showed that the edgetic effects of differentially expressed transcripts associated with the disease condition are system-wide and unlikely to be detected by looking only at the gene-specific expression levels.


2020 ◽  
Vol 2020 ◽  
pp. 1-11
Author(s):  
Zheng Wang ◽  
Yang Li ◽  
Zhu-Hong You ◽  
Li-Ping Li ◽  
Xin-Ke Zhan ◽  
...  

Identifying protein-protein interactions (PPIs) plays a vital role in a number of biological activities such as signal transduction, transcriptional regulation, and apoptosis. Although advances in high-throughput technologies have generated large amounts of PPI data for different species, they only cover a small part of the entire PPI network. Furthermore, traditional experimental methods are generally expensive, time-consuming, tedious, and prone to high false-positive rates. Therefore, to overcome this problem, it is necessary to develop a novel computational method for predicting PPIs. In this article, we propose an efficient computational method to detect protein-protein interactions using only protein sequence information, which integrates the MatPCA feature extraction algorithm and the weighted sparse representation classifier. As a result, when predicting PPIs on yeast, human, and H. pylori datasets, the proposed method achieves superior prediction performance with an average accuracy of 94.55%, 97.48%, and 83.64%, respectively. These experimental results further illustrate that the proposed method is reliable and robust in predicting PPIs, which can be regarded as a useful complement to the experimental method.


2019 ◽  
Vol 19 (4) ◽  
pp. 232-241 ◽  
Author(s):  
Xuegong Chen ◽  
Wanwan Shi ◽  
Lei Deng

Background: Accumulating experimental studies have indicated that disease comorbidity causes additional pain to patients and leads to the failure of standard treatments compared to patients who have a single disease. Therefore, accurate prediction of potential comorbidity is essential to design more efficient treatment strategies. However, only a few disease comorbidities have been discovered in the clinic. Objective: In this work, we propose PCHS, an effective computational method for predicting disease comorbidity. Materials and Methods: We utilized the HeteSim measure to calculate the relatedness score for different disease pairs in the global heterogeneous network, which integrates six networks based on biological information, including disease-disease associations, drug-drug interactions, protein-protein interactions and associations among them. We built the prediction model using the Support Vector Machine (SVM) based on the HeteSim scores. Results and Conclusion: The results showed that PCHS performed significantly better than previous state-of-the-art approaches and achieved an AUC score of 0.90 in 10-fold cross-validation. Furthermore, some of our predictions have been verified in literatures, indicating the effectiveness of our method.


2019 ◽  
Vol 19 (6) ◽  
pp. 430-448 ◽  
Author(s):  
Khalid Bashir Dar ◽  
Aashiq Hussain Bhat ◽  
Shajrul Amin ◽  
Syed Anjum ◽  
Bilal Ahmad Reshi ◽  
...  

Protein-Protein Interactions (PPIs) drive major signalling cascades and play critical role in cell proliferation, apoptosis, angiogenesis and trafficking. Deregulated PPIs are implicated in multiple malignancies and represent the critical targets for treating cancer. Herein, we discuss the key protein-protein interacting domains implicated in cancer notably PDZ, SH2, SH3, LIM, PTB, SAM and PH. These domains are present in numerous enzymes/kinases, growth factors, transcription factors, adaptor proteins, receptors and scaffolding proteins and thus represent essential sites for targeting cancer. This review explores the candidature of various proteins involved in cellular trafficking (small GTPases, molecular motors, matrix-degrading enzymes, integrin), transcription (p53, cMyc), signalling (membrane receptor proteins), angiogenesis (VEGFs) and apoptosis (BCL-2family), which could possibly serve as targets for developing effective anti-cancer regimen. Interactions between Ras/Raf; X-linked inhibitor of apoptosis protein (XIAP)/second mitochondria-derived activator of caspases (Smac/DIABLO); Frizzled (FRZ)/Dishevelled (DVL) protein; beta-catenin/T Cell Factor (TCF) have also been studied as prospective anticancer targets. Efficacy of diverse molecules/ drugs targeting such PPIs although evaluated in various animal models/cell lines, there is an essential need for human-based clinical trials. Therapeutic strategies like the use of biologicals, high throughput screening (HTS) and fragment-based technology could play an imperative role in designing cancer therapeutics. Moreover, bioinformatic/computational strategies based on genome sequence, protein sequence/structure and domain data could serve as competent tools for predicting PPIs. Exploring hot spots in proteomic networks represents another approach for developing targetspecific therapeutics. Overall, this review lays emphasis on a productive amalgamation of proteomics, genomics, biochemistry, and molecular dynamics for successful treatment of cancer.


2012 ◽  
Vol 23 (19) ◽  
pp. 3911-3922 ◽  
Author(s):  
Yongqiang Wang ◽  
Xinlei Zhang ◽  
Hong Zhang ◽  
Yi Lu ◽  
Haolong Huang ◽  
...  

The highly abundant α-helical coiled-coil motif not only mediates crucial protein–protein interactions in the cell but is also an attractive scaffold in synthetic biology and material science and a potential target for disease intervention. Therefore a systematic understanding of the coiled-coil interactions (CCIs) at the organismal level would help unravel the full spectrum of the biological function of this interaction motif and facilitate its application in therapeutics. We report the first identified genome-wide CCI network in Saccharomyces cerevisiae, which consists of 3495 pair-wise interactions among 598 predicted coiled-coil regions. Computational analysis revealed that the CCI network is specifically and functionally organized and extensively involved in the organization of cell machinery. We further show that CCIs play a critical role in the assembly of the kinetochore, and disruption of the CCI network leads to defects in kinetochore assembly and cell division. The CCI network identified in this study is a valuable resource for systematic characterization of coiled coils in the shaping and regulation of a host of cellular machineries and provides a basis for the utilization of coiled coils as domain-based probes for network perturbation and pharmacological applications.


2021 ◽  
Vol 4 (1) ◽  
Author(s):  
Veenstra TD ◽  

Identifying all the molecular components within a living cell is the first step into understanding how it functions. To further understand how a cell functions requires identifying the interactions that occur between these components. This fact is especially relevant for proteins. No protein within a human cell functions on its own without interacting with another biomolecule - usually another protein. While Protein-Protein Interactions (PPI) have historically been determined by examining a single protein per study, novel technologies developed over the past couple of decades are enabling high-throughput methods that aim to describe entire protein networks within cells. In this review, some of the technologies that have led to these developments are described along with applications of these techniques. Ultimately the goal of these technologies is to map out the entire circuitry of PPI within human cells to be able to predict the global consequences of perturbations to the cell system. This predictive capability will have major impacts on the future of both disease diagnosis and treatment.


Author(s):  
Sailu Sarvagalla ◽  
Mohane Selvaraj Coumar

Most of the developed kinase inhibitor drugs are ATP competitive and suffer from drawbacks such as off-target kinase activity, development of resistance due to mutation in the ATP binding pocket and unfavorable intellectual property situations. Besides the ATP binding pocket, protein kinases have binding sites that are involved in Protein-Protein Interactions (PPIs); these PPIs directly or indirectly regulate the protein kinase activity. Of recent, small molecule inhibitors of PPIs are emerging as an alternative to ATP competitive agents. Rational design of inhibitors for kinase PPIs could be carried out using molecular modeling techniques. In silico tools available for the prediction of hot spot residues and cavities at the PPI sites and the means to utilize this information for the identification of inhibitors are discussed. Moreover, in silico studies to target the Aurora B-INCENP PPI sites are discussed in context. Overall, this chapter provides detailed in silico strategies that are available to the researchers for carrying out structure-based drug design of PPI inhibitors.


2019 ◽  
Vol 20 (S16) ◽  
Author(s):  
Da Zhang ◽  
Mansur Kabuka

Abstract Background Protein-protein interactions(PPIs) engage in dynamic pathological and biological procedures constantly in our life. Thus, it is crucial to comprehend the PPIs thoroughly such that we are able to illuminate the disease occurrence, achieve the optimal drug-target therapeutic effect and describe the protein complex structures. However, compared to the protein sequences obtainable from various species and organisms, the number of revealed protein-protein interactions is relatively limited. To address this dilemma, lots of research endeavor have investigated in it to facilitate the discovery of novel PPIs. Among these methods, PPI prediction techniques that merely rely on protein sequence data are more widespread than other methods which require extensive biological domain knowledge. Results In this paper, we propose a multi-modal deep representation learning structure by incorporating protein physicochemical features with the graph topological features from the PPI networks. Specifically, our method not only bears in mind the protein sequence information but also discerns the topological representations for each protein node in the PPI networks. In our paper, we construct a stacked auto-encoder architecture together with a continuous bag-of-words (CBOW) model based on generated metapaths to study the PPI predictions. Following by that, we utilize the supervised deep neural networks to identify the PPIs and classify the protein families. The PPI prediction accuracy for eight species ranged from 96.76% to 99.77%, which signifies that our multi-modal deep representation learning framework achieves superior performance compared to other computational methods. Conclusion To the best of our knowledge, this is the first multi-modal deep representation learning framework for examining the PPI networks.


Sign in / Sign up

Export Citation Format

Share Document