Large-Scale Protein-Protein Interactions Detection by Integrating Big Biosensing Data with Computational Model

INFERRING PROTEIN-PROTEIN INTERACTIONS FROM MESSENGER RNA EXPRESSION PROFILES WITH SVM

Journal of Biological System ◽

10.1142/s0218339005001525 ◽

2005 ◽

Vol 13 (03) ◽

pp. 287-298 ◽

Cited By ~ 1

Author(s):

JUN CAI ◽

YING HUANG ◽

LIANG JI ◽

YANDA LI

Keyword(s):

High Throughput ◽

Protein Interactions ◽

Messenger Rna ◽

Expression Profiles ◽

Support Vector ◽

Svm Classifier ◽

Good Prediction ◽

Protein Protein Interactions ◽

Protein Protein Interaction ◽

High Throughput Experiments

In post-genomic biology, researchers in the field of proteome focus their attention on the networks of protein interactions that control the lives of cells and organisms. Protein-protein interactions play a useful role in dynamic cellular machinery. In this paper, we developed a method to infer protein-protein interactions based on the theory of support vector machine (SVM). For a given pair of proteins, a new strategy of calculating cross-correlation function of mRNA expression profiles was used to encode SVM vectors. We compared the performance with other methods of inferring protein-protein interaction. Results suggested that, through five-fold cross validation, our SVM model achieved a good prediction. It enables us to show that expression profiles in transcription level can be used to distinguish physical or functional interactions of proteins as well as sequence contents. Lastly, we applied our SVM classifier to evaluate data quality of interaction data sets from four high-throughput experiments. The results show that high-throughput experiments sacrifice some accuracy in determination of interactions because of limitation of experiment technologies.

Download Full-text

High throughput flow cytometry based yeast two-hybrid array approach for large-scale analysis of protein-protein interactions

Cytometry Part A ◽

10.1002/cyto.a.21144 ◽

2011 ◽

Vol 81A (1) ◽

pp. 90-98 ◽

Cited By ~ 7

Author(s):

Jun Chen ◽

Mark B. Carter ◽

Bruce S. Edwards ◽

Hong Cai ◽

Larry A. Sklar

Keyword(s):

Flow Cytometry ◽

High Throughput ◽

Protein Interactions ◽

Large Scale ◽

Scale Analysis ◽

Protein Protein Interactions ◽

Yeast Two Hybrid ◽

Large Scale Analysis ◽

Two Hybrid

Download Full-text

Yeast Two-Hybrid Systems and Protein Interaction Mapping Projects for Yeast and Worm

Yeast ◽

10.1002/1097-0061(20000630)17:2<88::aid-yea20>3.0.co;2-y ◽

2000 ◽

Vol 1 (2) ◽

pp. 88-94 ◽

Cited By ~ 82

Author(s):

Albertha J. M. Walhout ◽

Simon J. Boulton ◽

Marc Vidal

Keyword(s):

Hybrid Systems ◽

High Throughput ◽

Protein Interaction ◽

Protein Interactions ◽

Large Scale ◽

Protein Protein Interactions ◽

Yeast Two Hybrid ◽

Systematic Analysis ◽

Technical Features ◽

Two Hybrid

The availability of complete genome sequences necessitates the development of standardized functional assays to analyse the tens of thousands of predicted gene products in high-throughput experimental settings. Such approaches are collectively referred to as ‘functional genomics’. One approach to investigate the properties of a proteome of interest is by systematic analysis of protein–protein interactions. So far, the yeast two-hybrid system is the most commonly used method for large-scale, high-throughput identification of potential protein–protein interactions. Here, we discuss several technical features of variants of the two-hybrid systems in light of data recently obtained from different protein interaction mapping projects for the budding yeastSaccharomyces cerevisiaeand the nematodeCaenorhabditis elegans.

Download Full-text

Yeast Two-Hybrid Systems and Protein Interaction Mapping Projects for Yeast and Worm

Yeast ◽

10.1155/2000/156745 ◽

2000 ◽

Vol 1 (2) ◽

pp. 88-94 ◽

Cited By ~ 1

Author(s):

Albertha J. M. Walhout ◽

Simon J. Boulton ◽

Marc Vidal

Keyword(s):

Hybrid Systems ◽

High Throughput ◽

Protein Interaction ◽

Protein Interactions ◽

Large Scale ◽

Protein Protein Interactions ◽

Yeast Two Hybrid ◽

Systematic Analysis ◽

Technical Features ◽

Two Hybrid

The availability of complete genome sequences necessitates the development of standardized functional assays to analyse the tens of thousands of predicted gene products in high-throughput experimental settings. Such approaches are collectively referred to as ‘functional genomics’. One approach to investigate the properties of a proteome of interest is by systematic analysis of protein–protein interactions. So far, the yeast two-hybrid system is the most commonly used method for large-scale, high-throughput identification of potential protein–protein interactions. Here, we discuss several technical features of variants of the two-hybrid systems in light of data recently obtained from different protein interaction mapping projects for the budding yeast Saccharomyces cerevisiae and the nematode Caenorhabditis elegans.

Download Full-text

Maximum margin classifier working in a set of strings

Proceedings of The Royal Society A Mathematical Physical and Engineering Sciences ◽

10.1098/rspa.2015.0551 ◽

2016 ◽

Vol 472 (2187) ◽

pp. 20150551 ◽

Cited By ~ 1

Author(s):

Hitoshi Koyano ◽

Morihiro Hayashida ◽

Tatsuya Akutsu

Keyword(s):

Probability Theory ◽

Protein Interactions ◽

Consensus Sequence ◽

Classification Problem ◽

Amino Acid Sequences ◽

Support Vector ◽

Generalization Error ◽

Protein Protein Interactions ◽

String Kernels ◽

Learning Machine

Numbers and numerical vectors account for a large portion of data. However, recently, the amount of string data generated has increased dramatically. Consequently, classifying string data is a common problem in many fields. The most widely used approach to this problem is to convert strings into numerical vectors using string kernels and subsequently apply a support vector machine that works in a numerical vector space. However, this non-one-to-one conversion involves a loss of information and makes it impossible to evaluate, using probability theory, the generalization error of a learning machine, considering that the given data to train and test the machine are strings generated according to probability laws. In this study, we approach this classification problem by constructing a classifier that works in a set of strings. To evaluate the generalization error of such a classifier theoretically, probability theory for strings is required. Therefore, we first extend a limit theorem for a consensus sequence of strings demonstrated by one of the authors and co-workers in a previous study. Using the obtained result, we then demonstrate that our learning machine classifies strings in an asymptotically optimal manner. Furthermore, we demonstrate the usefulness of our machine in practical data analysis by applying it to predicting protein–protein interactions using amino acid sequences and classifying RNAs by the secondary structure using nucleotide sequences.

Download Full-text

Target-Templated de novo Design of Macrocyclic D-/L-Peptides: Inhibitors of the PD-1/PD-L1 Interaction

10.26434/chemrxiv.11663337.v3 ◽

2020 ◽

Author(s):

Salvador Guardiola ◽

Monica Varese ◽

Xavier Roig ◽

Jesús Garcia ◽

Ernest Giralt

Keyword(s):

Protein Interactions ◽

Cyclic Peptides ◽

General Framework ◽

Large Scale ◽

De Novo ◽

Inhibitory Effect ◽

Original Text ◽

Protein Protein Interactions ◽

Retraction Notice ◽

Pharmaceutical Properties

NOTE: This preprint has been retracted by consensus from all authors. See the retraction notice in place above; the original text can be found under "Version 1", accessible from the version selector above. ------------------------------------------------------------------------ Peptides, together with antibodies, are among the most potent biochemical tools to modulate challenging protein-protein interactions. However, current structure-based methods are largely limited to natural peptides and are not suitable for designing target-specific binders with improved pharmaceutical properties, such as macrocyclic peptides. Here we report a general framework that leverages the computational power of Rosetta for large-scale backbone sampling and energy scoring, followed by side-chain composition, to design heterochiral cyclic peptides that bind to a protein surface of interest. To showcase the applicability of our approach, we identified two peptides (PD-i3 and PD-i6) that target PD-1, a key immune checkpoint, and work as protein ligand decoys. A comprehensive biophysical evaluation confirmed their binding mechanism to PD-1 and their inhibitory effect on the PD-1/PD-L1 interaction. Finally, elucidation of their solution structures by NMR served as validation of our de novo design approach. We anticipate that our results will provide a general framework for designing target-specific drug-like peptides.

Download Full-text

Target-Templated de novo Design of Macrocyclic D-/L-Peptides: Inhibitors of the PD-1/PD-L1 Interaction

10.26434/chemrxiv.11663337 ◽

2020 ◽

Author(s):

Salvador Guardiola ◽

Monica Varese ◽

Xavier Roig ◽

Jesús Garcia ◽

Ernest Giralt

Keyword(s):

Protein Interactions ◽

Cyclic Peptides ◽

General Framework ◽

Large Scale ◽

De Novo ◽

Inhibitory Effect ◽

Original Text ◽

Protein Protein Interactions ◽

Retraction Notice ◽

Pharmaceutical Properties

NOTE: This preprint has been retracted by consensus from all authors. See the retraction notice in place above; the original text can be found under "Version 1", accessible from the version selector above. ------------------------------------------------------------------------ Peptides, together with antibodies, are among the most potent biochemical tools to modulate challenging protein-protein interactions. However, current structure-based methods are largely limited to natural peptides and are not suitable for designing target-specific binders with improved pharmaceutical properties, such as macrocyclic peptides. Here we report a general framework that leverages the computational power of Rosetta for large-scale backbone sampling and energy scoring, followed by side-chain composition, to design heterochiral cyclic peptides that bind to a protein surface of interest. To showcase the applicability of our approach, we identified two peptides (PD-i3 and PD-i6) that target PD-1, a key immune checkpoint, and work as protein ligand decoys. A comprehensive biophysical evaluation confirmed their binding mechanism to PD-1 and their inhibitory effect on the PD-1/PD-L1 interaction. Finally, elucidation of their solution structures by NMR served as validation of our de novo design approach. We anticipate that our results will provide a general framework for designing target-specific drug-like peptides.

Download Full-text

Faculty Opinions recommendation of Comparative assessment of large-scale data sets of protein-protein interactions.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.1006598.82257 ◽

2002 ◽

Author(s):

Rob Russell

Keyword(s):

Protein Interactions ◽

Large Scale ◽

Comparative Assessment ◽

Data Sets ◽

Protein Protein Interactions ◽

Large Scale Data ◽

Scale Data ◽

Large Scale Data Sets

Download Full-text

Prediction of Disease Comorbidity Using HeteSim Scores based on Multiple Heterogeneous Networks

Current Gene Therapy ◽

10.2174/1566523219666190917155959 ◽

2019 ◽

Vol 19 (4) ◽

pp. 232-241 ◽

Cited By ~ 5

Author(s):

Xuegong Chen ◽

Wanwan Shi ◽

Lei Deng

Keyword(s):

Protein Interactions ◽

Experimental Studies ◽

Treatment Strategies ◽

Computational Method ◽

Biological Information ◽

Support Vector ◽

Protein Protein Interactions ◽

Efficient Treatment ◽

Disease Associations ◽

Previous State

Background: Accumulating experimental studies have indicated that disease comorbidity causes additional pain to patients and leads to the failure of standard treatments compared to patients who have a single disease. Therefore, accurate prediction of potential comorbidity is essential to design more efficient treatment strategies. However, only a few disease comorbidities have been discovered in the clinic. Objective: In this work, we propose PCHS, an effective computational method for predicting disease comorbidity. Materials and Methods: We utilized the HeteSim measure to calculate the relatedness score for different disease pairs in the global heterogeneous network, which integrates six networks based on biological information, including disease-disease associations, drug-drug interactions, protein-protein interactions and associations among them. We built the prediction model using the Support Vector Machine (SVM) based on the HeteSim scores. Results and Conclusion: The results showed that PCHS performed significantly better than previous state-of-the-art approaches and achieved an AUC score of 0.90 in 10-fold cross-validation. Furthermore, some of our predictions have been verified in literatures, indicating the effectiveness of our method.

Download Full-text