CONFOLD2: Improved contact-driven ab initio protein structure modeling

Mapping Intimacies ◽

10.1101/228460 ◽

2017 ◽

Cited By ~ 1

Author(s):

Badri Adhikari ◽

Jianlin Cheng

Keyword(s):

Protein Structure ◽

Ab Initio ◽

Protein Structure Prediction ◽

Protein Sequence ◽

Structure Prediction ◽

Structural Models ◽

Prediction Methods ◽

Structure Modeling ◽

Residue Contact ◽

Protein Structure Modeling

AbstractBackgroundContact-guided protein structure prediction methods are becoming more and more successful because of the latest advances in residue-residue contact prediction. To support the contact-driven structure prediction, effective tools that can quickly build tertiary structural models of good quality from predicted contacts need to be developed.ResultsWe develop an improved contact-driven protein modeling method, CONFOLD2, and study how it may be effectively used for ab initio protein structure prediction with predicted contacts as input. It builds models using various subsets of input contacts to explore the fold space under the guidance of a soft square energy function, and then clusters the models to obtain top five models. CONFOLD2 is benchmarked on various datasets including CASP11 and 12 datasets with publicly available predicted contacts and yields better performance than the popular CONFOLD method.ConclusionCONFOLD2 allows to quickly generate top five structural models for a protein sequence, when its secondary structures and contacts predictions at hand. CONFOLD2 is publicly available at https://github.com/multicom-toolbox/CONFOLD2/.

Download Full-text

Hybridized distance- and contact-based hierarchical structure modeling for folding soluble and membrane proteins

10.1101/2020.07.05.188466 ◽

2020 ◽

Author(s):

Rahmatullah Roche ◽

Sutanu Bhattacharya ◽

Debswapna Bhattacharya

Keyword(s):

Protein Folding ◽

Protein Structure ◽

Membrane Proteins ◽

Ab Initio ◽

Protein Structure Prediction ◽

Hierarchical Structure ◽

Structure Determination ◽

Structure Prediction ◽

Structure Modeling ◽

Contact Maps

AbstractCrystallography and NMR system (CNS) is currently the de facto standard for fragment-free ab initio protein folding from inter-residue distance or contact maps. Despite its widespread use in protein structure prediction, CNS is a decade-old macromolecular structure determination system that was originally developed for solving macromolecular geometry from experimental restraints as opposed to predictive modeling driven by interaction map data. As such, the adaptation of the CNS experimental structure determination protocol for ab initio protein folding is intrinsically anomalous that may undermine the folding accuracy of computational protein structure prediction. In this paper, we propose a new CNS-free hierarchical structure modeling method called DConStruct for folding both soluble and membrane proteins driven by distance and contact information. Rigorous experimental validation shows that DConStruct attains much better reconstruction accuracy than CNS when tested with the same input contact map at varying contact thresholds. The hierarchical modeling with iterative self-correction employed in DConStruct scales at a much higher degree of folding accuracy than CNS with the increase in contact thresholds, ultimately approaching near-optimal reconstruction accuracy at higher-thresholded contact maps. The folding accuracy of DConStruct can be further improved by exploiting distance-based hybrid interaction maps at tri-level thresholding, as demonstrated by the better performance of our method in folding difficult free modeling targets from the 12th and 13th rounds of the Critical Assessment of techniques for protein Structure Prediction (CASP) experiments compared to several popular CNS- and fragment-based approaches, some of which even using much finer-grained distance maps than ours. Additional large-scale benchmarking shows that DConStruct can significantly improve the folding accuracy of membrane proteins compared to a CNS-based approach. These results collectively demonstrate the feasibility of greatly improving the accuracy of ab initio protein folding by optimally exploiting the information encoded in inter-residue interaction maps beyond what is possible by CNS.Author summaryPredicting the folded and functional 3-dimensional structure of a protein molecule from its amino acid sequence is of central importance to structural biology. Recently, promising advances have been made in ab initio protein folding due to the reasonably accurate estimation of inter-residue interaction maps at increasingly higher resolutions that range from binary contacts to finer-grained distances. Despite the progress in predicting the interaction maps, approaches for turning the residue-residue interactions projected in these maps into their precise spatial positioning heavily rely on a decade-old experimental structure determination protocol that is not suitable for predictive modeling. This paper presents a new hierarchical structure modeling method, DConStruct, which can better exploit the information encoded in the interaction maps at multiple granularities, from binary contact maps to distance-based hybrid maps at tri-level thresholding, for improved ab initio folding. Multiple large-scale benchmarking experiments show that our proposed method can substantially improve the folding accuracy for both soluble and membrane proteins compared to state-of-the-art approaches. DConStruct is licensed under the GNU General Public License v3 and freely available at https://github.com/Bhattacharya-Lab/DConStruct.

Download Full-text

Performance comparison of ab initio protein structure prediction methods

Ain Shams Engineering Journal ◽

10.1016/j.asej.2019.03.004 ◽

2019 ◽

Vol 10 (4) ◽

pp. 713-719 ◽

Cited By ~ 1

Author(s):

Mohamad Yousef ◽

Tamer Abdelkader ◽

Khaled El-Bahnasy

Keyword(s):

Protein Structure ◽

Ab Initio ◽

Protein Structure Prediction ◽

Structure Prediction ◽

Performance Comparison ◽

Prediction Methods

Download Full-text

Hybridized distance- and contact-based hierarchical structure modeling for folding soluble and membrane proteins

PLoS Computational Biology ◽

10.1371/journal.pcbi.1008753 ◽

2021 ◽

Vol 17 (2) ◽

pp. e1008753

Author(s):

Rahmatullah Roche ◽

Sutanu Bhattacharya ◽

Debswapna Bhattacharya

Keyword(s):

Protein Folding ◽

Protein Structure ◽

Membrane Proteins ◽

Ab Initio ◽

Protein Structure Prediction ◽

Structure Determination ◽

Structure Prediction ◽

Structure Modeling ◽

Reconstruction Accuracy ◽

Contact Maps

Crystallography and NMR system (CNS) is currently a widely used method for fragment-free ab initio protein folding from inter-residue distance or contact maps. Despite its widespread use in protein structure prediction, CNS is a decade-old macromolecular structure determination system that was originally developed for solving macromolecular geometry from experimental restraints as opposed to predictive modeling driven by interaction map data. As such, the adaptation of the CNS experimental structure determination protocol for ab initio protein folding is intrinsically anomalous that may undermine the folding accuracy of computational protein structure prediction. In this paper, we propose a new CNS-free hierarchical structure modeling method called DConStruct for folding both soluble and membrane proteins driven by distance and contact information. Rigorous experimental validation shows that DConStruct attains much better reconstruction accuracy than CNS when tested with the same input contact map at varying contact thresholds. The hierarchical modeling with iterative self-correction employed in DConStruct scales at a much higher degree of folding accuracy than CNS with the increase in contact thresholds, ultimately approaching near-optimal reconstruction accuracy at higher-thresholded contact maps. The folding accuracy of DConStruct can be further improved by exploiting distance-based hybrid interaction maps at tri-level thresholding, as demonstrated by the better performance of our method in folding free modeling targets from the 12th and 13th rounds of the Critical Assessment of techniques for protein Structure Prediction (CASP) experiments compared to popular CNS- and fragment-based approaches and energy-minimization protocols, some of which even using much finer-grained distance maps than ours. Additional large-scale benchmarking shows that DConStruct can significantly improve the folding accuracy of membrane proteins compared to a CNS-based approach. These results collectively demonstrate the feasibility of greatly improving the accuracy of ab initio protein folding by optimally exploiting the information encoded in inter-residue interaction maps beyond what is possible by CNS.

Download Full-text

Ab Initio Protein Structure Prediction: Methods and challenges

Biological Knowledge Discovery Handbook ◽

10.1002/9781118617151.ch32 ◽

2013 ◽

pp. 703-724 ◽

Cited By ~ 4

Author(s):

Jad Abbass ◽

Jean-Christophe Nebel ◽

Nashat Mansour

Keyword(s):

Protein Structure ◽

Ab Initio ◽

Protein Structure Prediction ◽

Structure Prediction ◽

Prediction Methods

Download Full-text

Ab Initio structure prediction for Escherichia coli: towards genome-wide protein structure modeling and fold assignment

Scientific Reports ◽

10.1038/srep01895 ◽

2013 ◽

Vol 3 (1) ◽

Cited By ~ 31

Author(s):

Dong Xu ◽

Yang Zhang

Keyword(s):

Escherichia Coli ◽

Protein Structure ◽

Ab Initio ◽

Structure Prediction ◽

Structure Modeling ◽

Protein Structure Modeling ◽

Genome Wide ◽

Ab Initio Structure

Download Full-text

Faculty Opinions recommendation of Contact order and ab initio protein structure prediction.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.1008523.107412 ◽

2002 ◽

Author(s):

Kevin Plaxco

Keyword(s):

Protein Structure ◽

Ab Initio ◽

Protein Structure Prediction ◽

Structure Prediction ◽

Contact Order

Download Full-text

Faculty Opinions recommendation of Fully automated ab initio protein structure prediction using I-SITES, HMMSTR and ROSETTA.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.1008895.124064 ◽

2002 ◽

Author(s):

Burkhard Rost

Keyword(s):

Protein Structure ◽

Ab Initio ◽

Protein Structure Prediction ◽

Structure Prediction

Download Full-text

Ab Initio Protein Structure Prediction Using Pathway Models

Comparative and Functional Genomics ◽

10.1002/cfg.305 ◽

2003 ◽

Vol 4 (4) ◽

pp. 397-401 ◽

Cited By ~ 1

Author(s):

Xin Yuan ◽

Yu Shao ◽

Christopher Bystroff

Keyword(s):

Protein Structure ◽

Ab Initio ◽

Protein Structure Prediction ◽

Structure Prediction ◽

Pathway Models

Download Full-text

FALCON2: a web server for high-quality prediction of protein tertiary structures

BMC Bioinformatics ◽

10.1186/s12859-021-04353-8 ◽

2021 ◽

Vol 22 (1) ◽

Author(s):

Lupeng Kong ◽

Fusong Ju ◽

Haicang Zhang ◽

Shiwei Sun ◽

Dongbo Bu

Keyword(s):

Protein Structure ◽

Ab Initio ◽

Protein Structure Prediction ◽

Structure Prediction ◽

Web Server ◽

High Quality ◽

Tertiary Structures ◽

Target Proteins ◽

High Quality Protein ◽

Protein Functions

Abstract Background Accurate prediction of protein tertiary structures is highly desired as the knowledge of protein structures provides invaluable insights into protein functions. We have designed two approaches to protein structure prediction, including a template-based modeling approach (called ProALIGN) and an ab initio prediction approach (called ProFOLD). Briefly speaking, ProALIGN aligns a target protein with templates through exploiting the patterns of context-specific alignment motifs and then builds the final structure with reference to the homologous templates. In contrast, ProFOLD uses an end-to-end neural network to estimate inter-residue distances of target proteins and builds structures that satisfy these distance constraints. These two approaches emphasize different characteristics of target proteins: ProALIGN exploits structure information of homologous templates of target proteins while ProFOLD exploits the co-evolutionary information carried by homologous protein sequences. Recent progress has shown that the combination of template-based modeling and ab initio approaches is promising. Results In the study, we present FALCON2, a web server that integrates ProALIGN and ProFOLD to provide high-quality protein structure prediction service. For a target protein, FALCON2 executes ProALIGN and ProFOLD simultaneously to predict possible structures and selects the most likely one as the final prediction result. We evaluated FALCON2 on widely-used benchmarks, including 104 CASP13 (the 13th Critical Assessment of protein Structure Prediction) targets and 91 CASP14 targets. In-depth examination suggests that when high-quality templates are available, ProALIGN is superior to ProFOLD and in other cases, ProFOLD shows better performance. By integrating these two approaches with different emphasis, FALCON2 server outperforms the two individual approaches and also achieves state-of-the-art performance compared with existing approaches. Conclusions By integrating template-based modeling and ab initio approaches, FALCON2 provides an easy-to-use and high-quality protein structure prediction service for the community and we expect it to enable insights into a deep understanding of protein functions.

Download Full-text

AIMOES: Archive information assisted multi-objective evolutionary strategy for ab initio protein structure prediction

Knowledge-Based Systems ◽

10.1016/j.knosys.2018.01.028 ◽

2018 ◽

Vol 146 ◽

pp. 58-72 ◽

Cited By ~ 17

Author(s):

Shuangbao Song ◽

Shangce Gao ◽

Xingqian Chen ◽

Dongbao Jia ◽

Xiaoxiao Qian ◽

...

Keyword(s):

Protein Structure ◽

Ab Initio ◽

Protein Structure Prediction ◽

Structure Prediction ◽

Evolutionary Strategy ◽

Multi Objective

Download Full-text