Udock, the interactive docking entertainment system

Faraday Discussions ◽

10.1039/c3fd00147d ◽

2014 ◽

Vol 169 ◽

pp. 425-441 ◽

Cited By ~ 10

Author(s):

Guillaume Levieux ◽

Guillaume Tiger ◽

Stéphanie Mader ◽

Jean-François Zagury ◽

Stéphane Natkin ◽

...

Keyword(s):

Protein Interactions ◽

Protein Complexes ◽

Protein Structures ◽

Protein Docking ◽

Conformational Space ◽

Protein Protein Interactions ◽

Binding Interface ◽

Protein Interfaces ◽

Almost All ◽

Experimental Partner

Protein–protein interactions play a crucial role in biological processes. Protein docking calculations' goal is to predict, given two proteins of known structures, the associate conformation of the corresponding complex. Here, we present a new interactive protein docking system, Udock, that makes use of users' cognitive capabilities added up. In Udock, the users tackle simplified representations of protein structures and explore protein–protein interfaces’ conformational space using a gamified interactive docking system with on the fly scoring. We assumed that if given appropriate tools, a naïve user's cognitive capabilities could provide relevant data for (1) the prediction of correct interfaces in binary protein complexes and (2) the identification of the experimental partner in interaction among a set of decoys. To explore this approach experimentally, we conducted a preliminary two week long playtest where the registered users could perform a cross-docking on a dataset comprising 4 binary protein complexes. The users explored almost all the surface of the proteins that were available in the dataset but favored certain regions that seemed more attractive as potential docking spots. These favored regions were located inside or nearby the experimental binding interface for 5 out of the 8 proteins in the dataset. For most of them, the best scores were obtained with the experimental partner. The alpha version of Udock is freely accessible at http://udock.fr.

Download Full-text

Hotspot coevolution at protein-protein interfaces is a key identifier of native protein complexes

10.1101/698233 ◽

2019 ◽

Cited By ~ 1

Author(s):

Sambit K. Mishra ◽

Sarah J. Cooper ◽

Jerry M. Parks ◽

Julie C. Mitchell

Keyword(s):

Protein Interactions ◽

Protein Complexes ◽

Fundamental Problem ◽

Scoring Function ◽

Binding Mode ◽

Protein Docking ◽

Evolutionary Information ◽

Protein Protein Interactions ◽

Novel Approach ◽

Protein Interfaces

AbstractProtein-protein interactions play a key role in mediating numerous biological functions, with more than half the proteins in living organisms existing as either homo- or hetero-oligomeric assemblies. Protein subunits that form oligomers minimize the free energy of the complex, but exhaustive computational search-based docking methods have not comprehensively addressed the protein docking challenge of distinguishing a natively bound complex from non-native forms. In this study, we propose a scoring function, KFC-E, that accounts for both conservation and coevolution of putative binding hotspot residues at protein-protein interfaces. For a benchmark set of 53 bound complexes, KFC-E identifies a near-native binding mode as the top-scoring pose in 38% and in the top 5 in 55% of the complexes. For a set of 17 unbound complexes, KFC-E identifies a near-native pose in the top 10 ranked poses in more than 50% of the cases. By contrast, a scoring function that incorporates information on coevolution at predicted non-hotspots performs poorly by comparison. Our study highlights the importance of coevolution at hotspot residues in forming natively bound complexes and suggests a novel approach for coevolutionary scoring in protein docking.Author SummaryA fundamental problem in biology is to distinguish between the native and non-native bound forms of protein-protein complexes. Experimental methods are often used to detect the native bound forms of proteins but, are demanding in terms of time and resources. Computational approaches have proven to be a useful alternative; they sample the different binding configurations for a pair of interacting proteins and then use an heuristic or physical model to score them. In this study we propose a new scoring approach, KFC-E, which focuses on the evolutionary contributions from a subset of key interface residues (hotspots) to identify native bound complexes. KFC-E capitalizes on the wealth of information in protein sequence databases by incorporating residue-level conservation and coevolution of putative binding hotspots. As hotspot residues mediate the binding energetics of protein-protein interactions, we hypothesize that the knowledge of putative hotspots coupled with their evolutionary information should be helpful in the identification of native bound protein-protein complexes.

Download Full-text

Mass spectrometry-based cross-linking study shows that the Psb28 protein binds to cytochrome b559 in Photosystem II

Proceedings of the National Academy of Sciences ◽

10.1073/pnas.1620360114 ◽

2017 ◽

Vol 114 (9) ◽

pp. 2224-2229 ◽

Cited By ~ 18

Author(s):

Daniel A. Weisz ◽

Haijun Liu ◽

Hao Zhang ◽

Sundarapandian Thangapandian ◽

Emad Tajkhorshid ◽

...

Keyword(s):

Mass Spectrometry ◽

Photosystem Ii ◽

Protein Interactions ◽

Protein Complexes ◽

Protein Docking ◽

Cross Linking ◽

Protein Protein Interactions ◽

Cytochrome B559 ◽

Reaction Center Complex ◽

Assembly Intermediate

Photosystem II (PSII), a large pigment protein complex, undergoes rapid turnover under natural conditions. During assembly of PSII, oxidative damage to vulnerable assembly intermediate complexes must be prevented. Psb28, the only cytoplasmic extrinsic protein in PSII, protects the RC47 assembly intermediate of PSII and assists its efficient conversion into functional PSII. Its role is particularly important under stress conditions when PSII damage occurs frequently. Psb28 is not found, however, in any PSII crystal structure, and its structural location has remained unknown. In this study, we used chemical cross-linking combined with mass spectrometry to capture the transient interaction of Psb28 with PSII. We detected three cross-links between Psb28 and the α- and β-subunits of cytochrome b559, an essential component of the PSII reaction-center complex. These distance restraints enable us to position Psb28 on the cytosolic surface of PSII directly above cytochrome b559, in close proximity to the QB site. Protein–protein docking results also support Psb28 binding in this region. Determination of the Psb28 binding site and other biochemical evidence allow us to propose a mechanism by which Psb28 exerts its protective effect on the RC47 intermediate. This study also shows that isotope-encoded cross-linking with the “mass tags” selection criteria allows confident identification of more cross-linked peptides in PSII than has been previously reported. This approach thus holds promise to identify other transient protein–protein interactions in membrane protein complexes.

Download Full-text

Protein-protein docking using learned three-dimensional representations

10.1101/738690 ◽

2019 ◽

Cited By ~ 1

Author(s):

Georgy Derevyanko ◽

Guillaume Lamoureux

Keyword(s):

Protein Interactions ◽

Network Architecture ◽

Protein Complexes ◽

Three Dimensional ◽

Spatial Arrangement ◽

Protein Docking ◽

Protein Protein Interactions ◽

Translational Invariance ◽

Shape Complementarity ◽

Spatial Features

AbstractProtein-protein interactions are determined by a number of hard-to-capture features related to shape complementarity, electrostatics, and hydrophobicity. These features may be intrinsic to the protein or induced by the presence of a partner. A conventional approach to protein-protein docking consists in engineering a small number of spatial features for each protein, and in minimizing the sum of their correlations with respect to the spatial arrangement of the two proteins. To generalize this approach, we introduce a deep neural network architecture that transforms the raw atomic densities of each protein into complex three-dimensional representations. Each point in the volume containing the protein is described by 48 learned features, which are correlated and combined with the features of a second protein to produce a score dependent on the relative position and orientation of the two proteins. The architecture is based on multiple layers of SE(3)-equivariant convolutional neural networks, which provide built-in rotational and translational invariance of the score with respect to the structure of the complex. The model is trained end-to-end on a set of decoy conformations generated from 851 nonredundant protein-protein complexes and is tested on data from the Protein-Protein Docking Benchmark Version 4.0.

Download Full-text

Conservation of coevolving protein interfaces bridges prokaryote–eukaryote homologies in the twilight zone

Proceedings of the National Academy of Sciences ◽

10.1073/pnas.1611861114 ◽

2016 ◽

Vol 113 (52) ◽

pp. 15018-15023 ◽

Cited By ~ 24

Author(s):

Juan Rodriguez-Rivas ◽

Simone Marsili ◽

David Juan ◽

Alfonso Valencia

Keyword(s):

Protein Interactions ◽

Protein Complexes ◽

Accurate Information ◽

Twilight Zone ◽

Sequence Information ◽

Protein Protein Interactions ◽

Sequence Alignments ◽

Multiple Sequence ◽

Protein Interfaces ◽

Recent Developments

Protein–protein interactions are fundamental for the proper functioning of the cell. As a result, protein interaction surfaces are subject to strong evolutionary constraints. Recent developments have shown that residue coevolution provides accurate predictions of heterodimeric protein interfaces from sequence information. So far these approaches have been limited to the analysis of families of prokaryotic complexes for which large multiple sequence alignments of homologous sequences can be compiled. We explore the hypothesis that coevolution points to structurally conserved contacts at protein–protein interfaces, which can be reliably projected to homologous complexes with distantly related sequences. We introduce a domain-centered protocol to study the interplay between residue coevolution and structural conservation of protein–protein interfaces. We show that sequence-based coevolutionary analysis systematically identifies residue contacts at prokaryotic interfaces that are structurally conserved at the interface of their eukaryotic counterparts. In turn, this allows the prediction of conserved contacts at eukaryotic protein–protein interfaces with high confidence using solely mutational patterns extracted from prokaryotic genomes. Even in the context of high divergence in sequence (the twilight zone), where standard homology modeling of protein complexes is unreliable, our approach provides sequence-based accurate information about specific details of protein interactions at the residue level. Selected examples of the application of prokaryotic coevolutionary analysis to the prediction of eukaryotic interfaces further illustrate the potential of this approach.

Download Full-text

IN SILICO SCREENING OF PROTEIN–PROTEIN INTERACTIONS WITH ALL-TO-ALL RIGID DOCKING AND CLUSTERING: AN APPLICATION TO PATHWAY ANALYSIS

Journal of Bioinformatics and Computational Biology ◽

10.1142/s0219720009004461 ◽

2009 ◽

Vol 07 (06) ◽

pp. 991-1012 ◽

Cited By ~ 16

Author(s):

YURI MATSUZAKI ◽

YUSUKE MATSUZAKI ◽

TOSHIYUKI SATO ◽

YUTAKA AKIYAMA

Keyword(s):

Pathway Analysis ◽

Structure Data ◽

Protein Interactions ◽

Tertiary Structure ◽

Protein Structures ◽

Protein Docking ◽

Protein Protein Interactions ◽

Benchmark Data ◽

Computational Screening ◽

F Measure

We propose a computational screening system of protein–protein interactions using tertiary structure data. Our system combines all-to-all protein docking and clustering to find interacting protein pairs. We tuned our prediction system by applying various parameters and clustering algorithms and succeeded in outperforming previous methods. This method was also applied to a biological pathway estimation problem to show its use in network level analysis. The structural data were collected from the Protein Data Bank, PDB. Then all-to-all docking among target protein structures was conducted using a conventional protein–protein docking software package, ZDOCK. The highest-ranked 2000 decoys were clustered based on structural similarity among the predicted docking forms. The features of generated clusters were analyzed to estimate the biological relevance of protein–protein interactions. Our system achieves a best F-measure value of 0.43 when applied to a subset of general protein–protein docking benchmark data. The same system was applied to protein data in a bacterial chemotaxis pathway, utilizing essentially the same parameter set as the benchmark data. We obtained 0.45 for the F-measure value. The proposed approach to computational PPI detection is a promising methodology for mediating between structural studies and systems biology by utilizing cumulative protein structure data for pathway analysis.

Download Full-text

Human mitochondrial protein complexes revealed by large-scale coevolution analysis and deep learning-based structure modeling

10.1101/2021.09.14.460228 ◽

2021 ◽

Author(s):

Jimin Pei ◽

Jing Zhang ◽

Qian Cong

Keyword(s):

Deep Learning ◽

Protein Interactions ◽

Large Scale ◽

Protein Complexes ◽

Mitochondrial Protein ◽

Protein Structures ◽

Complex Structures ◽

Protein Protein Interactions ◽

Learning Methods ◽

Contact Probability

AbstractRecent development of deep-learning methods has led to a breakthrough in the prediction accuracy of 3-dimensional protein structures. Extending these methods to protein pairs is expected to allow large-scale detection of protein-protein interactions and modeling protein complexes at the proteome level. We applied RoseTTAFold and AlphaFold2, two of the latest deep-learning methods for structure predictions, to analyze coevolution of human proteins residing in mitochondria, an organelle of vital importance in many cellular processes including energy production, metabolism, cell death, and antiviral response. Variations in mitochondrial proteins have been linked to a plethora of human diseases and genetic conditions. RoseTTAFold, with high computational speed, was used to predict the coevolution of about 95% of mitochondrial protein pairs. Top-ranked pairs were further subject to the modeling of the complex structures by AlphaFold2, which also produced contact probability with high precision and in many cases consistent with RoseTTAFold. Most of the top ranked pairs with high contact probability were supported by known protein-protein interactions and/or similarities to experimental structural complexes. For high-scoring pairs without experimental complex structures, our coevolution analyses and structural models shed light on the details of their interfaces, including CHCHD4-AIFM1, MTERF3-TRUB2, FMC1-ATPAF2, ECSIT-NDUFAF1 and COQ7-COQ9, among others. We also identified novel PPIs (PYURF-NDUFAF5, LYRM1-MTRF1L and COA8-COX10) for several proteins without experimentally characterized interaction partners, leading to predictions of their molecular functions and the biological processes they are involved in.

Download Full-text

Contacts-based prediction of binding affinity in protein–protein complexes

eLife ◽

10.7554/elife.07454 ◽

2015 ◽

Vol 4 ◽

Cited By ~ 119

Author(s):

Anna Vangone ◽

Alexandre MJJ Bonvin

Keyword(s):

Binding Affinity ◽

Conformational Changes ◽

Protein Interactions ◽

Prediction Accuracy ◽

Protein Complexes ◽

Structural Features ◽

Specific Protein ◽

Strong Impact ◽

Protein Protein Interactions ◽

Almost All

Almost all critical functions in cells rely on specific protein–protein interactions. Understanding these is therefore crucial in the investigation of biological systems. Despite all past efforts, we still lack a thorough understanding of the energetics of association of proteins. Here, we introduce a new and simple approach to predict binding affinity based on functional and structural features of the biological system, namely the network of interfacial contacts. We assess its performance against a protein–protein binding affinity benchmark and show that both experimental methods used for affinity measurements and conformational changes have a strong impact on prediction accuracy. Using a subset of complexes with reliable experimental binding affinities and combining our contacts and contact-types-based model with recent observations on the role of the non-interacting surface in protein–protein interactions, we reach a high prediction accuracy for such a diverse dataset outperforming all other tested methods.

Download Full-text

InterPep2: global peptide–protein docking using interaction surface templates

Bioinformatics ◽

10.1093/bioinformatics/btaa005 ◽

2020 ◽

Vol 36 (8) ◽

pp. 2458-2465 ◽

Cited By ~ 2

Author(s):

Isak Johansson-Åkhe ◽

Claudio Mirabello ◽

Björn Wallner

Keyword(s):

Protein Interactions ◽

Protein Complexes ◽

Structural Features ◽

Protein Docking ◽

Supplementary Information ◽

Peptide Ligand ◽

Protein Protein Interactions ◽

Intrinsically Disordered ◽

Intrinsically Disordered Regions ◽

Improved Performance

Abstract Motivation Interactions between proteins and peptides or peptide-like intrinsically disordered regions are involved in many important biological processes, such as gene expression and cell life-cycle regulation. Experimentally determining the structure of such interactions is time-consuming and difficult because of the inherent flexibility of the peptide ligand. Although several prediction-methods exist, most are limited in performance or availability. Results InterPep2 is a freely available method for predicting the structure of peptide–protein interactions. Improved performance is obtained by using templates from both peptide–protein and regular protein–protein interactions, and by a random forest trained to predict the DockQ-score for a given template using sequence and structural features. When tested on 252 bound peptide–protein complexes from structures deposited after the complexes used in the construction of the training and templates sets of InterPep2, InterPep2-Refined correctly positioned 67 peptides within 4.0 Å LRMSD among top10, similar to another state-of-the-art template-based method which positioned 54 peptides correctly. However, InterPep2 displays a superior ability to evaluate the quality of its own predictions. On a previously established set of 27 non-redundant unbound-to-bound peptide–protein complexes, InterPep2 performs on-par with leading methods. The extended InterPep2-Refined protocol managed to correctly model 15 of these complexes within 4.0 Å LRMSD among top10, without using templates from homologs. In addition, combining the template-based predictions from InterPep2 with ab initio predictions from PIPER-FlexPepDock resulted in 22% more near-native predictions compared to the best single method (22 versus 18). Availability and implementation The program is available from: http://wallnerlab.org/InterPep2. Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text

Mapping the Energy Landscape of PROTAC-Mediated Protein-protein Interactions

10.1101/2021.08.31.458424 ◽

2021 ◽

Author(s):

Jose A Villegas ◽

Tasneem M Vaid ◽

Michael E Johnson ◽

Terry W Moore

Keyword(s):

Protein Interactions ◽

Degrees Of Freedom ◽

Protein Complexes ◽

Energy Landscape ◽

Search Space ◽

Protein Docking ◽

Conformational Space ◽

Conformational Sampling ◽

Test Case ◽

Coordinate Descent Algorithm

One of the principal difficulties in computational modeling of macromolecules is the vast conformational space that arises out of large numbers of atomic degrees of freedom. This problem is a familiar issue in the area of protein-protein docking, where models of protein complexes are generated from the monomeric subunits. Although restriction of molecular flexibility is a commonly used approximation that decreases the dimensionality of the problem, the seemingly endless number of possible ways two binding partners can interact generally necessitates the use of further approximations to explore the search space. Recently, growing interest in using computational tools to build predictive models of PROTAC-mediated complexes has led to the application of state-of-the-art protein-protein docking techniques to tackle this problem. Additionally, the atomic degrees of freedom introduced by flexibility of linkers used in the construction of PROTACs further expands the configurational search space, a problem that can be tackled with conformational sampling tools. However, repurposing existing tools to carry out protein-protein docking and linker conformer generation independently results in extensive sampling of structures incompatible with PROTAC-mediated complex formation. Here we show that it is possible to restrict the search to the space of protein-protein conformations that can be bridged by a PROTAC molecule with a given linker composition by using a cyclic coordinate descent algorithm to position PROTACs into complex-bound configurations. We use this methodology to construct a picture of the energy landscape of PROTAC-mediated interactions in a model test case, and show that the global minimum lies in the space of native-like conformations.

Download Full-text

PatchMAN docking: Modeling peptide-protein interactions in the context of the receptor surface

10.1101/2021.09.02.458699 ◽

2021 ◽

Author(s):

Alisa Khramushin ◽

Tomer Tsaban ◽

Julia Varga ◽

Orly Avraham ◽

Ora Schueler-Furman

Keyword(s):

Protein Interactions ◽

Protein Complexes ◽

Protein Structures ◽

Binding Pocket ◽

Protein Docking ◽

Peptide Sequence ◽

Sequence Information ◽

Peptide Docking ◽

Docking Approach ◽

Conformer Ensemble

AbstractPeptide docking can be perceived as a subproblem of protein-protein docking. However, due to the short length and flexible nature of peptides, many do not adopt one defined conformation prior to binding. Therefore, to tackle a peptide docking problem, not only the relative orientation between the two partners, but also the bound conformation of the peptide needs to be modeled. Traditional peptide-centered approaches use information about the peptide sequence to generate a representative conformer ensemble, which can then be rigid body docked to the receptor. Alternatively, one may look at this problem from the viewpoint of the receptor, namely that the protein surface defines the peptide bound conformation.We present PatchMAN (Patch-Motif AligNments), a novel peptide docking approach which uses structural motifs to map the receptor surface with backbone scaffolds extracted from protein structures. On a non-redundant set of protein-peptide complexes, starting from free receptor structures, PatchMAN successfully models and identifies near-native peptide-protein complexes in 62% / 81% within 2.5Å / 5Å RMSD, with corresponding sampling in 81% / 100% of the cases, outperforming other approaches. PatchMAN leverages the observation that structural units of peptides with their binding pocket can be found not only within interfaces, but also within monomers. We show that the conformation of the bound peptide is sampled based on the structural context of the receptor only, without taking into account any sequence information. Beyond peptide docking, this approach opens exciting new avenues to study principles of peptide-protein association, and to the design of new peptide binders.

Download Full-text