Entropy-Based Graph Clustering of PPI Networks for Predicting Overlapping Functional Modules of Proteins

Hoyeon Jeong; Yoonbee Kim; Yi-Sue Jung; Dae Ryong Kang; Young-Rae Cho

doi:10.3390/e23101271

Entropy-Based Graph Clustering of PPI Networks for Predicting Overlapping Functional Modules of Proteins

Entropy ◽

10.3390/e23101271 ◽

2021 ◽

Vol 23 (10) ◽

pp. 1271

Author(s):

Hoyeon Jeong ◽

Yoonbee Kim ◽

Yi-Sue Jung ◽

Dae Ryong Kang ◽

Young-Rae Cho

Keyword(s):

Protein Interactions ◽

Protein Complexes ◽

Clustering Algorithms ◽

Graph Clustering ◽

Functional Modules ◽

Protein Protein Interactions ◽

Overlapping Clusters ◽

Novel Proteins ◽

Ppi Networks ◽

Function Modules

Functional modules can be predicted using genome-wide protein–protein interactions (PPIs) from a systematic perspective. Various graph clustering algorithms have been applied to PPI networks for this task. In particular, the detection of overlapping clusters is necessary because a protein is involved in multiple functions under different conditions. graph entropy (GE) is a novel metric to assess the quality of clusters in a large, complex network. In this study, the unweighted and weighted GE algorithm is evaluated to prove the validity of predicting function modules. To measure clustering accuracy, the clustering results are compared to protein complexes and Gene Ontology (GO) annotations as references. We demonstrate that the GE algorithm is more accurate in overlapping clusters than the other competitive methods. Moreover, we confirm the biological feasibility of the proteins that occur most frequently in the set of identified clusters. Finally, novel proteins for the additional annotation of GO terms are revealed.

Download Full-text

USING INDIRECT PROTEIN–PROTEIN INTERACTIONS FOR PROTEIN COMPLEX PREDICTION

Journal of Bioinformatics and Computational Biology ◽

10.1142/s0219720008003497 ◽

2008 ◽

Vol 06 (03) ◽

pp. 435-466 ◽

Cited By ~ 97

Author(s):

HON NIAN CHUA ◽

KANG NING ◽

WING-KIN SUNG ◽

HON WAI LEONG ◽

LIMSOON WONG

Keyword(s):

Protein Interactions ◽

Protein Complex ◽

Protein Complexes ◽

Clustering Algorithms ◽

Indirect Interactions ◽

Protein Protein Interactions ◽

Protein Complex Prediction ◽

Ppi Networks ◽

Level 2 ◽

Novel Protein

Protein complexes are fundamental for understanding principles of cellular organizations. As the sizes of protein–protein interaction (PPI) networks are increasing, accurate and fast protein complex prediction from these PPI networks can serve as a guide for biological experiments to discover novel protein complexes. However, it is not easy to predict protein complexes from PPI networks, especially in situations where the PPI network is noisy and still incomplete. Here, we study the use of indirect interactions between level-2 neighbors (level-2 interactions) for protein complex prediction. We know from previous work that proteins which do not interact but share interaction partners (level-2 neighbors) often share biological functions. We have proposed a method in which all direct and indirect interactions are first weighted using topological weight (FS-Weight), which estimates the strength of functional association. Interactions with low weight are removed from the network, while level-2 interactions with high weight are introduced into the interaction network. Existing clustering algorithms can then be applied to this modified network. We have also proposed a novel algorithm that searches for cliques in the modified network, and merge cliques to form clusters using a "partial clique merging" method. Experiments show that (1) the use of indirect interactions and topological weight to augment protein–protein interactions can be used to improve the precision of clusters predicted by various existing clustering algorithms; and (2) our complex-finding algorithm performs very well on interaction networks modified in this way. Since no other information except the original PPI network is used, our approach would be very useful for protein complex prediction, especially for prediction of novel protein complexes.

Download Full-text

From the static interactome to dynamic protein complexes: Three challenges

Journal of Bioinformatics and Computational Biology ◽

10.1142/s0219720015710018 ◽

2015 ◽

Vol 13 (02) ◽

pp. 1571001 ◽

Cited By ~ 14

Author(s):

Chern Han Yong ◽

Limsoon Wong

Keyword(s):

Protein Interactions ◽

Protein Complexes ◽

Clustering Algorithms ◽

Ppi Network ◽

Protein Protein Interaction ◽

Static Interaction ◽

Ppi Networks ◽

Interaction Screening ◽

Discovery Algorithms ◽

Insight Into

Protein interactions and complexes behave in a dynamic fashion, but this dynamism is not captured by interaction screening technologies, and not preserved in protein–protein interaction (PPI) networks. The analysis of static interaction data to derive dynamic protein complexes leads to several challenges, of which we identify three. First, many proteins participate in multiple complexes, leading to overlapping complexes embedded within highly-connected regions of the PPI network. This makes it difficult to accurately delimit the boundaries of such complexes. Second, many condition- and location-specific PPIs are not detected, leading to sparsely-connected complexes that cannot be picked out by clustering algorithms. Third, the majority of complexes are small complexes (made up of two or three proteins), which are extra sensitive to the effects of extraneous edges and missing co-complex edges. We show that many existing complex-discovery algorithms have trouble predicting such complexes, and show that our insight into the disparity between the static interactome and dynamic protein complexes can be used to improve the performance of complex discovery.

Download Full-text

Identifying Hierarchical and Overlapping Protein Complexes Based on Essential Protein-Protein Interactions and “Seed-Expanding” Method

BioMed Research International ◽

10.1155/2014/838714 ◽

2014 ◽

Vol 2014 ◽

pp. 1-12

Author(s):

Jun Ren ◽

Wei Zhou ◽

Jianxin Wang

Keyword(s):

Protein Interactions ◽

Time Complexity ◽

Protein Complexes ◽

Hierarchical Organization ◽

Experimental Results ◽

Functional Enrichment ◽

Detection Methods ◽

Ppi Network ◽

Protein Protein Interactions ◽

Ppi Networks

Many evidences have demonstrated that protein complexes are overlapping and hierarchically organized in PPI networks. Meanwhile, the large size of PPI network wants complex detection methods have low time complexity. Up to now, few methods can identify overlapping and hierarchical protein complexes in a PPI network quickly. In this paper, a novel method, called MCSE, is proposed based onλ-module and “seed-expanding.” First, it chooses seeds as essential PPIs or edges with high edge clustering values. Then, it identifies protein complexes by expanding each seed to aλ-module. MCSE is suitable for large PPI networks because of its low time complexity. MCSE can identify overlapping protein complexes naturally because a protein can be visited by different seeds. MCSE uses the parameterλ_th to control the range of seed expanding and can detect a hierarchical organization of protein complexes by tuning the value ofλ_th. Experimental results ofS. cerevisiaeshow that this hierarchical organization is similar to that of known complexes in MIPS database. The experimental results also show that MCSE outperforms other previous competing algorithms, such as CPM, CMC, Core-Attachment, Dpclus, HC-PIN, MCL, and NFC, in terms of the functional enrichment and matching with known protein complexes.

Download Full-text

CLUSTERING ALGORITHMS FOR DETECTING FUNCTIONAL MODULES IN PROTEIN INTERACTION NETWORKS

Journal of Bioinformatics and Computational Biology ◽

10.1142/s0219720009004023 ◽

2009 ◽

Vol 07 (01) ◽

pp. 217-242 ◽

Cited By ~ 24

Author(s):

LIN GAO ◽

PENG-GANG SUN ◽

JIA SONG

Keyword(s):

Protein Interaction ◽

Protein Complexes ◽

Clustering Algorithms ◽

Interaction Network ◽

Future Research ◽

Functional Modules ◽

Sources Of Information ◽

Protein Protein Interaction ◽

Metabolic Functions ◽

Ppi Networks

Protein–Protein Interaction (PPI) networks are believed to be important sources of information related to biological processes and complex metabolic functions of the cell. When studying the workings of a biological cell, it is useful to be able to detect known and predict still undiscovered protein complexes within the cell's PPI networks. Such predictions may be used as an inexpensive tool to direct biological experiments. The increasing amount of available PPI data necessitate a fast, accurate approach to biological complex identification. Because of its importance in the studies of protein interaction network, there are different models and algorithms in identifying functional modules in PPI networks. In this paper, we review some representative algorithms, focusing on the algorithms underlying the approaches and how the algorithms relate to each other. In particular, a comparison is given based on the property of the algorithms. Since the PPI network is noisy and still incomplete, some methods which consider other additional properties for preprocessing and purifying of PPI data are presented. We also give a discussion about the functional annotation and validation of protein complexes. Finally, new progress and future research directions are discussed from the computational viewpoint.

Download Full-text

Discovery–Versus Hypothesis–Driven Detection of Protein–Protein Interactions and Complexes

International Journal of Molecular Sciences ◽

10.3390/ijms22094450 ◽

2021 ◽

Vol 22 (9) ◽

pp. 4450

Author(s):

Isabell Bludau

Keyword(s):

Mass Spectrometry ◽

Data Analysis ◽

Protein Interactions ◽

Prior Information ◽

Protein Complexes ◽

Functional Modules ◽

Protein Protein Interactions ◽

Future Studies ◽

Protein Protein Interaction ◽

Selection Of

Protein complexes are the main functional modules in the cell that coordinate and perform the vast majority of molecular functions. The main approaches to identify and quantify the interactome to date are based on mass spectrometry (MS). Here I summarize the benefits and limitations of different MS-based interactome screens, with a focus on untargeted interactome acquisition, such as co-fractionation MS. Specific emphasis is given to the discussion of discovery- versus hypothesis-driven data analysis concepts and their applicability to large, proteome-wide interactome screens. Hypothesis-driven analysis approaches, i.e., complex- or network-centric, are highlighted as promising strategies for comparative studies. While these approaches require prior information from public databases, also reviewed herein, the available wealth of interactomic data continuously increases, thereby providing more exhaustive information for future studies. Finally, guidance on the selection of interactome acquisition and analysis methods is provided to aid the reader in the design of protein-protein interaction studies.

Download Full-text

Mapping of Protein-Protein Interactions: Web-Based Resources for Revealing Interactomes

Current Medicinal Chemistry ◽

10.2174/0929867325666180214113704 ◽

2019 ◽

Vol 26 (21) ◽

pp. 3890-3910 ◽

Cited By ~ 6

Author(s):

Branislava Gemovic ◽

Neven Sumonja ◽

Radoslav Davidovic ◽

Vladimir Perovic ◽

Nevena Veljkovic

Keyword(s):

Drug Discovery ◽

Protein Interactions ◽

Protein Complexes ◽

Experimental Studies ◽

Protein Protein Interactions ◽

Web Based ◽

Prediction Tools ◽

Physiological Processes ◽

Modern Drug ◽

Ppi Prediction

Background: The significant number of protein-protein interactions (PPIs) discovered by harnessing concomitant advances in the fields of sequencing, crystallography, spectrometry and two-hybrid screening suggests astonishing prospects for remodelling drug discovery. The PPI space which includes up to 650 000 entities is a remarkable reservoir of potential therapeutic targets for every human disease. In order to allow modern drug discovery programs to leverage this, we should be able to discern complete PPI maps associated with a specific disorder and corresponding normal physiology. Objective: Here, we will review community available computational programs for predicting PPIs and web-based resources for storing experimentally annotated interactions. Methods: We compared the capacities of prediction tools: iLoops, Struck2Net, HOMCOS, COTH, PrePPI, InterPreTS and PRISM to predict recently discovered protein interactions. Results: We described sequence-based and structure-based PPI prediction tools and addressed their peculiarities. Additionally, since the usefulness of prediction algorithms critically depends on the quality and quantity of the experimental data they are built on; we extensively discussed community resources for protein interactions. We focused on the active and recently updated primary and secondary PPI databases, repositories specialized to the subject or species, as well as databases that include both experimental and predicted PPIs. Conclusion: PPI complexes are the basis of important physiological processes and therefore, possible targets for cell-penetrating ligands. Reliable computational PPI predictions can speed up new target discoveries through prioritization of therapeutically relevant protein–protein complexes for experimental studies.

Download Full-text

Protein Interaction Domains and Post-Translational Modifications: Structural Features and Drug Discovery Applications

Current Medicinal Chemistry ◽

10.2174/0929867326666190620101637 ◽

2020 ◽

Vol 27 (37) ◽

pp. 6306-6355 ◽

Cited By ~ 2

Author(s):

Marian Vincenzi ◽

Flavia Anna Mercurio ◽

Marilisa Leone

Keyword(s):

Drug Discovery ◽

Protein Interaction ◽

Protein Interactions ◽

Structural Information ◽

Protein Complexes ◽

Structural Features ◽

Protein Protein Interactions ◽

Modular Architecture ◽

Post Translational Modifications ◽

Interaction Domains

Background:: Many pathways regarding healthy cells and/or linked to diseases onset and progression depend on large assemblies including multi-protein complexes. Protein-protein interactions may occur through a vast array of modules known as protein interaction domains (PIDs). Objective:: This review concerns with PIDs recognizing post-translationally modified peptide sequences and intends to provide the scientific community with state of art knowledge on their 3D structures, binding topologies and potential applications in the drug discovery field. Method:: Several databases, such as the Pfam (Protein family), the SMART (Simple Modular Architecture Research Tool) and the PDB (Protein Data Bank), were searched to look for different domain families and gain structural information on protein complexes in which particular PIDs are involved. Recent literature on PIDs and related drug discovery campaigns was retrieved through Pubmed and analyzed. Results and Conclusion:: PIDs are rather versatile as concerning their binding preferences. Many of them recognize specifically only determined amino acid stretches with post-translational modifications, a few others are able to interact with several post-translationally modified sequences or with unmodified ones. Many PIDs can be linked to different diseases including cancer. The tremendous amount of available structural data led to the structure-based design of several molecules targeting protein-protein interactions mediated by PIDs, including peptides, peptidomimetics and small compounds. More studies are needed to fully role out, among different families, PIDs that can be considered reliable therapeutic targets, however, attacking PIDs rather than catalytic domains of a particular protein may represent a route to obtain selective inhibitors.

Download Full-text

Short loop functional commonality identified in leukaemia proteome highlights crucial protein sub-networks

NAR Genomics and Bioinformatics ◽

10.1093/nargab/lqab010 ◽

2021 ◽

Vol 3 (1) ◽

Author(s):

Sun Sook Chung ◽

Joseph C F Ng ◽

Anna Laddach ◽

N Shaun B Thomas ◽

Franca Fraternali

Keyword(s):

Protein Interactions ◽

Large Scale ◽

Interaction Network ◽

Protein Protein Interactions ◽

Protein Protein Interaction ◽

Ppi Networks ◽

Short Loop ◽

New Strategy ◽

Loop Network ◽

Protein Protein Interaction Network

Abstract Direct drug targeting of mutated proteins in cancer is not always possible and efficacy can be nullified by compensating protein–protein interactions (PPIs). Here, we establish an in silico pipeline to identify specific PPI sub-networks containing mutated proteins as potential targets, which we apply to mutation data of four different leukaemias. Our method is based on extracting cyclic interactions of a small number of proteins topologically and functionally linked in the Protein–Protein Interaction Network (PPIN), which we call short loop network motifs (SLM). We uncover a new property of PPINs named ‘short loop commonality’ to measure indirect PPIs occurring via common SLM interactions. This detects ‘modules’ of PPI networks enriched with annotated biological functions of proteins containing mutation hotspots, exemplified by FLT3 and other receptor tyrosine kinase proteins. We further identify functional dependency or mutual exclusivity of short loop commonality pairs in large-scale cellular CRISPR–Cas9 knockout screening data. Our pipeline provides a new strategy for identifying new therapeutic targets for drug discovery.

Download Full-text

Frequent assembly of chimeric complexes in the protein interaction network of an interspecies yeast hybrid

Molecular Biology and Evolution ◽

10.1093/molbev/msaa298 ◽

2020 ◽

Author(s):

Rohan Dandage ◽

Caroline M Berger ◽

Isabelle Gagnon-Arsenault ◽

Kyung-Mee Moon ◽

Richard Greg Stacey ◽

...

Keyword(s):

Protein Interactions ◽

Molecular Level ◽

Yeast Species ◽

Protein Complexes ◽

Mitochondrial Protein ◽

Chimeric Protein ◽

Interaction Network ◽

Protein Protein Interactions ◽

Extreme Phenotypes ◽

Yeast Hybrid

Abstract Hybrids between species often show extreme phenotypes, including some that take place at the molecular level. In this study, we investigated the phenotypes of an interspecies diploid hybrid in terms of protein-protein interactions inferred from protein correlation profiling. We used two yeast species, Saccharomyces cerevisiae and Saccharomyces uvarum, which are interfertile, but yet have proteins diverged enough to be differentiated using mass spectrometry. Most of the protein-protein interactions are similar between hybrid and parents, and are consistent with the assembly of chimeric complexes, which we validated using an orthogonal approach for the prefoldin complex. We also identified instances of altered protein-protein interactions in the hybrid, for instance in complexes related to proteostasis and in mitochondrial protein complexes. Overall, this study uncovers the likely frequent occurrence of chimeric protein complexes with few exceptions, which may result from incompatibilities or imbalances between the parental proteins.

Download Full-text

Protein–protein interactions in bacteria: a promising and challenging avenue towards the discovery of new antibiotics

Beilstein Journal of Organic Chemistry ◽

10.3762/bjoc.14.267 ◽

2018 ◽

Vol 14 ◽

pp. 2881-2896 ◽

Cited By ~ 7

Author(s):

Laura Carro

Keyword(s):

Protein Interactions ◽

Bacterial Infections ◽

Multidrug Resistant ◽

Resistant Bacteria ◽

Protein Protein Interactions ◽

Lead Discovery ◽

Antibiotic Development ◽

Cellular Processes ◽

Ppi Networks ◽

New Antibiotics

Antibiotics are potent pharmacological weapons against bacterial infections; however, the growing antibiotic resistance of microorganisms is compromising the efficacy of the currently available pharmacotherapies. Even though antimicrobial resistance is not a new problem, antibiotic development has failed to match the growth of resistant pathogens and hence, it is highly critical to discover new anti-infective drugs with novel mechanisms of action which will help reducing the burden of multidrug-resistant microorganisms. Protein–protein interactions (PPIs) are involved in a myriad of vital cellular processes and have become an attractive target to treat diseases. Therefore, targeting PPI networks in bacteria may offer a new and unconventional point of intervention to develop novel anti-infective drugs which can combat the ever-increasing rate of multidrug-resistant bacteria. This review describes the progress achieved towards the discovery of molecules that disrupt PPI systems in bacteria for which inhibitors have been identified and whose targets could represent an alternative lead discovery strategy to obtain new anti-infective molecules.

Download Full-text