ProtFold-DFG: protein fold recognition by combining Directed Fusion Graph and PageRank algorithm

Author(s):  
Jiangyi Shao ◽  
Bin Liu

Abstract As one of the most important tasks in protein structure prediction, protein fold recognition has attracted more and more attention. In this regard, some computational predictors have been proposed with the development of machine learning and artificial intelligence techniques. However, these existing computational methods are still suffering from some disadvantages. In this regard, we propose a new network-based predictor called ProtFold-DFG for protein fold recognition. We propose the Directed Fusion Graph (DFG) to fuse the ranking lists generated by different methods, which employs the transitive closure to incorporate more relationships among proteins and uses the KL divergence to calculate the relationship between two proteins so as to improve its generalization ability. Finally, the PageRank algorithm is performed on the DFG to accurately recognize the protein folds by considering the global interactions among proteins in the DFG. Tested on a widely used and rigorous benchmark data set, LINDAHL dataset, experimental results show that the ProtFold-DFG outperforms the other 35 competing methods, indicating that ProtFold-DFG will be a useful method for protein fold recognition. The source code and data of ProtFold-DFG can be downloaded from http://bliulab.net/ProtFold-DFG/download

2019 ◽  
Vol 21 (6) ◽  
pp. 2185-2193 ◽  
Author(s):  
Bin Liu ◽  
Yulin Zhu ◽  
Ke Yan

Abstract As an important task in protein structure and function studies, protein fold recognition has attracted more and more attention. The existing computational predictors in this field treat this task as a multi-classification problem, ignoring the relationship among proteins in the dataset. However, previous studies showed that their relationship is critical for protein homology analysis. In this study, the protein fold recognition is treated as an information retrieval task. The Learning to Rank model (LTR) was employed to retrieve the query protein against the template proteins to find the template proteins in the same fold with the query protein in a supervised manner. The triadic closure principle (TCP) was performed on the ranking list generated by the LTR to improve its accuracy by considering the relationship among the query protein and the template proteins in the ranking list. Finally, a predictor called Fold-LTR-TCP was proposed. The rigorous test on the LE benchmark dataset showed that the Fold-LTR-TCP predictor achieved an accuracy of 73.2%, outperforming all the other competing methods.


2014 ◽  
Vol 11 (95) ◽  
pp. 20131147 ◽  
Author(s):  
Agnel Praveen Joseph ◽  
Alexandre G. de Brevern

Protein folding has been a major area of research for many years. Nonetheless, the mechanisms leading to the formation of an active biological fold are still not fully apprehended. The huge amount of available sequence and structural information provides hints to identify the putative fold for a given sequence. Indeed, protein structures prefer a limited number of local backbone conformations, some being characterized by preferences for certain amino acids. These preferences largely depend on the local structural environment. The prediction of local backbone conformations has become an important factor to correctly identifying the global protein fold. Here, we review the developments in the field of local structure prediction and especially their implication in protein fold recognition.


2016 ◽  
Vol 6 (1) ◽  
Author(s):  
Yassine Ghouzam ◽  
Guillaume Postic ◽  
Pierre-Edouard Guerin ◽  
Alexandre G. de Brevern ◽  
Jean-Christophe Gelly

2014 ◽  
Vol 30 (13) ◽  
pp. 1850-1857 ◽  
Author(s):  
Pooya Zakeri ◽  
Ben Jeuris ◽  
Raf Vandebril ◽  
Yves Moreau

Sign in / Sign up

Export Citation Format

Share Document