protein similarity networks Latest Research Papers

Comparative Analysis of Unsupervised Protein Similarity Prediction Based on Graph Embedding

Frontiers in Genetics ◽

10.3389/fgene.2021.744334 ◽

2021 ◽

Vol 12 ◽

Author(s):

Yuanyuan Zhang ◽

Ziqi Wang ◽

Shudong Wang ◽

Junliang Shang

Keyword(s):

Link Prediction ◽

Structural Information ◽

Graph Embedding ◽

Go Annotation ◽

Protein Protein Interaction ◽

Protein Functions ◽

Cell Components ◽

Protein Similarity Networks ◽

Go Terms ◽

Embedding Methods

The study of protein–protein interaction and the determination of protein functions are important parts of proteomics. Computational methods are used to study the similarity between proteins based on Gene Ontology (GO) to explore their functions and possible interactions. GO is a series of standardized terms that describe gene products from molecular functions, biological processes, and cell components. Previous studies on assessing the similarity of GO terms were primarily based on Information Content (IC) between GO terms to measure the similarity of proteins. However, these methods tend to ignore the structural information between GO terms. Therefore, considering the structural information of GO terms, we systematically analyze the performance of the GO graph and GO Annotation (GOA) graph in calculating the similarity of proteins using different graph embedding methods. When applied to the actual Human and Yeast datasets, the feature vectors of GO terms and proteins are learned based on different graph embedding methods. To measure the similarity of the proteins annotated by different GO numbers, we used Dynamic Time Warping (DTW) and cosine to calculate protein similarity in GO graph and GOA graph, respectively. Link prediction experiments were then performed to evaluate the reliability of protein similarity networks constructed by different methods. It is shown that graph embedding methods have obvious advantages over the traditional IC-based methods. We found that random walk graph embedding methods, in particular, showed excellent performance in calculating the similarity of proteins. By comparing link prediction experiment results from GO(DTW) and GOA(cosine) methods, it is shown that GO(DTW) features provide highly effective information for analyzing the similarity among proteins.

Download Full-text

MOCASSIN-prot: a multi-objective clustering approach for protein similarity networks

Bioinformatics ◽

10.1093/bioinformatics/btx755 ◽

2017 ◽

Vol 34 (8) ◽

pp. 1270-1277

Author(s):

Brittney N Keel ◽

Bo Deng ◽

Etsuko N Moriyama

Keyword(s):

Multi Objective ◽

Clustering Approach ◽

Similarity Networks ◽

Protein Similarity Networks

Download Full-text

Fusing multiple protein-protein similarity networks to effectively predict lncRNA-protein interactions

BMC Bioinformatics ◽

10.1186/s12859-017-1819-1 ◽

2017 ◽

Vol 18 (S12) ◽

Cited By ~ 16

Author(s):

Xiaoxiong Zheng ◽

Yang Wang ◽

Kai Tian ◽

Jiaogen Zhou ◽

Jihong Guan ◽

...

Keyword(s):

Protein Interactions ◽

Multiple Protein ◽

Similarity Networks ◽

Protein Similarity Networks

Download Full-text

Visualizing and Clustering Protein Similarity Networks: Sequences, Structures, and Functions

Journal of Proteome Research ◽

10.1021/acs.jproteome.5b01031 ◽

2016 ◽

Vol 15 (7) ◽

pp. 2123-2131 ◽

Cited By ~ 10

Author(s):

Te-Lun Mai ◽

Geng-Ming Hu ◽

Chi-Ming Chen

Keyword(s):

Similarity Networks ◽

Protein Similarity Networks

Download Full-text

Protein networks identify novel symbiogenetic genes resulting from plastid endosymbiosis

Proceedings of the National Academy of Sciences ◽

10.1073/pnas.1517551113 ◽

2016 ◽

Vol 113 (13) ◽

pp. 3579-3584 ◽

Cited By ~ 38

Author(s):

Raphaël Méheust ◽

Ehud Zelzion ◽

Debashish Bhattacharya ◽

Philippe Lopez ◽

Eric Bapteste

Keyword(s):

Functional Data ◽

Genetic Information ◽

Calvin Cycle ◽

Carotenoid Biosynthesis ◽

Photosynthetic Eukaryotes ◽

Similarity Networks ◽

Protein Similarity Networks ◽

Oxygen Evolving ◽

S Genes ◽

Wide Swath

The integration of foreign genetic information is central to the evolution of eukaryotes, as has been demonstrated for the origin of the Calvin cycle and of the heme and carotenoid biosynthesis pathways in algae and plants. For photosynthetic lineages, this coordination involved three genomes of divergent phylogenetic origins (the nucleus, plastid, and mitochondrion). Major hurdles overcome by the ancestor of these lineages were harnessing the oxygen-evolving organelle, optimizing the use of light, and stabilizing the partnership between the plastid endosymbiont and host through retargeting of proteins to the nascent organelle. Here we used protein similarity networks that can disentangle reticulate gene histories to explore how these significant challenges were met. We discovered a previously hidden component of algal and plant nuclear genomes that originated from the plastid endosymbiont: symbiogenetic genes (S genes). These composite proteins, exclusive to photosynthetic eukaryotes, encode a cyanobacterium-derived domain fused to one of cyanobacterial or another prokaryotic origin and have emerged multiple, independent times during evolution. Transcriptome data demonstrate the existence and expression of S genes across a wide swath of algae and plants, and functional data indicate their involvement in tolerance to oxidative stress, phototropism, and adaptation to nitrogen limitation. Our research demonstrates the “recycling” of genetic information by photosynthetic eukaryotes to generate novel composite genes, many of which function in plastid maintenance.

Download Full-text

Protein Similarity Networks Reveal Relationships among Sequence, Structure, and Function within the Cupin Superfamily

PLoS ONE ◽

10.1371/journal.pone.0074477 ◽

2013 ◽

Vol 8 (9) ◽

pp. e74477 ◽

Cited By ~ 42

Author(s):

Richard Uberto ◽

Ellen W. Moomaw

Keyword(s):

Structure And Function ◽

Sequence Structure ◽

Cupin Superfamily ◽

Similarity Networks ◽

Protein Similarity Networks ◽

And Function

Download Full-text

Pythoscape: a framework for generation of large protein similarity networks

Bioinformatics ◽

10.1093/bioinformatics/bts532 ◽

2012 ◽

Vol 28 (21) ◽

pp. 2845-2846 ◽

Cited By ~ 39

Author(s):

Alan E. Barber ◽

Patricia C. Babbitt

Keyword(s):

Large Protein ◽

Similarity Networks ◽

Protein Similarity Networks

Download Full-text

Protein similarity networks and Genetic Algorithm driven feature selection for fold recognition

2008 8th IEEE International Conference on BioInformatics and BioEngineering ◽

10.1109/bibe.2008.4696704 ◽

2008 ◽

Cited By ~ 1

Author(s):

Ioannis K. Valavanis ◽

George M. Spyrou ◽

Konstantina S. Nikita

Keyword(s):

Genetic Algorithm ◽

Feature Selection ◽

Fold Recognition ◽

Similarity Networks ◽

Selection For ◽

Protein Similarity Networks

Download Full-text

protein similarity networks
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Comparative Analysis of Unsupervised Protein Similarity Prediction Based on Graph Embedding

MOCASSIN-prot: a multi-objective clustering approach for protein similarity networks

Fusing multiple protein-protein similarity networks to effectively predict lncRNA-protein interactions

Visualizing and Clustering Protein Similarity Networks: Sequences, Structures, and Functions

Protein networks identify novel symbiogenetic genes resulting from plastid endosymbiosis

Protein Similarity Networks Reveal Relationships among Sequence, Structure, and Function within the Cupin Superfamily

Pythoscape: a framework for generation of large protein similarity networks

Protein similarity networks and Genetic Algorithm driven feature selection for fold recognition

Export Citation Format

protein similarity networksRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Comparative Analysis of Unsupervised Protein Similarity Prediction Based on Graph Embedding

MOCASSIN-prot: a multi-objective clustering approach for protein similarity networks

Fusing multiple protein-protein similarity networks to effectively predict lncRNA-protein interactions

Visualizing and Clustering Protein Similarity Networks: Sequences, Structures, and Functions

Protein networks identify novel symbiogenetic genes resulting from plastid endosymbiosis

Protein Similarity Networks Reveal Relationships among Sequence, Structure, and Function within the Cupin Superfamily

Pythoscape: a framework for generation of large protein similarity networks

Protein similarity networks and Genetic Algorithm driven feature selection for fold recognition

protein similarity networks
Recently Published Documents