scholarly journals Protofold II: Enhanced Model and Implementation for Kinetostatic Protein Folding1

Author(s):  
Pouya Tavousi ◽  
Morad Behandish ◽  
Horea T. Ilieş ◽  
Kazem Kazerounian

A reliable prediction of three-dimensional (3D) protein structures from sequence data remains a big challenge due to both theoretical and computational difficulties. We have previously shown that our kinetostatic compliance method (KCM) implemented into the Protofold package can overcome some of the key difficulties faced by other de novo structure prediction methods, such as the very small time steps required by the molecular dynamics (MD) approaches or the very large number of samples needed by the Monte Carlo (MC) sampling techniques. In this paper, we improve the free energy formulation used in Protofold by including the typically underrated entropic effects, imparted due to differences in hydrophobicity of the chemical groups, which dominate the folding of most water-soluble proteins. In addition to the model enhancement, we revisit the numerical implementation by redesigning the algorithms and introducing efficient data structures that reduce the expected complexity from quadratic to linear. Moreover, we develop and optimize parallel implementations of the algorithms on both central and graphics processing units (CPU/GPU) achieving speed-ups up to two orders of magnitude on the GPU. Our simulations are consistent with the general behavior observed in the folding process in aqueous solvent, confirming the effectiveness of model improvements. We report on the folding process at multiple levels, namely, the formation of secondary structural elements and tertiary interactions between secondary elements or across larger domains. We also observe significant enhancements in running times that make the folding simulation tractable for large molecules.

Author(s):  
Pouya Tavousi ◽  
Morad Behandish ◽  
Kazem Kazerounian ◽  
Horea T. Ilieş

Protein structure prediction remains one of the significant challenges in computational biology. We have previously shown that our kinetostatic compliance method can overcome some of the key difficulties faced by other de novo structural prediction methods, such as the very small time steps required by the molecular dynamics approaches, or the very large number of samples required by the sampling based techniques. In this paper we extend the previous free energy formulation by adding the solvent effects, which contribute predominantly to the folding phenomena. We show that the addition of the solvation effects, which complement the existing Coulombic and van der Waals interactions, lead to a physically effective energy function. Furthermore, we achieve significant computational speed-up by employing efficient algorithms and data structures that effectively reduce the time complexity from O(n2) to O(n), n being the number of atoms. Our simulations are consistent with the general behavior observed in protein folding, and show that the hydrophobic atoms tend to pack inside the core of the molecule in an aqueous solvent, while a vacuum environment produces no such effect.


Author(s):  
CHANDRAYANI N. ROKDE ◽  
DR.MANALI KSHIRSAGAR

Protein structure prediction (PSP) from amino acid sequence is one of the high focus problems in bioinformatics today. This is due to the fact that the biological function of the protein is determined by its three dimensional structure. The understanding of protein structures is vital to determine the function of a protein and its interaction with DNA, RNA and enzyme. Thus, protein structure is a fundamental area of computational biology. Its importance is intensed by large amounts of sequence data coming from PDB (Protein Data Bank) and the fact that experimentally methods such as X-ray crystallography or Nuclear Magnetic Resonance (NMR)which are used to determining protein structures remains very expensive and time consuming. In this paper, different types of protein structures and methods for its prediction are described.


eLife ◽  
2015 ◽  
Vol 4 ◽  
Author(s):  
Sergey Ovchinnikov ◽  
Lisa Kinch ◽  
Hahnbeom Park ◽  
Yuxing Liao ◽  
Jimin Pei ◽  
...  

The prediction of the structures of proteins without detectable sequence similarity to any protein of known structure remains an outstanding scientific challenge. Here we report significant progress in this area. We first describe de novo blind structure predictions of unprecendented accuracy we made for two proteins in large families in the recent CASP11 blind test of protein structure prediction methods by incorporating residue–residue co-evolution information in the Rosetta structure prediction program. We then describe the use of this method to generate structure models for 58 of the 121 large protein families in prokaryotes for which three-dimensional structures are not available. These models, which are posted online for public access, provide structural information for the over 400,000 proteins belonging to the 58 families and suggest hypotheses about mechanism for the subset for which the function is known, and hypotheses about function for the remainder.


2020 ◽  
Author(s):  
Jiangyan Feng ◽  
Diwakar Shukla

AbstractProteins are dynamic molecules which perform diverse molecular functions by adopting different three-dimensional structures. Recent progress in residue-residue contacts prediction opens up new avenues for the de novo protein structure prediction from sequence information. However, it is still difficult to predict more than one conformation from residue-residue contacts alone. This is due to the inability to deconvolve the complex signals of residue-residue contacts, i.e. spatial contacts relevant for protein folding, conformational diversity, and ligand binding. Here, we introduce a machine learning based method, called FingerprintContacts, for extending the capabilities of residue-residue contacts. This algorithm leverages the features of residue-residue contacts, that is, (1) a single conformation outperforms the others in the structural prediction using all the top ranking residue-residue contacts as structural constraints, and (2) conformation specific contacts rank lower and constitute a small fraction of residue-residue contacts. We demonstrate the capabilities of FingerprintContacts on eight ligand binding proteins with varying conformational motions. Furthermore, FingerprintContacts identifies small clusters of residue-residue contacts which are preferentially located in the dynamically fluctuating regions. With the rapid growth in protein sequence information, we expect FingerprintContacts to be a powerful first step in structural understanding of protein functional mechanisms.


Author(s):  
Arun G. Ingale

To predict the structure of protein from a primary amino acid sequence is computationally difficult. An investigation of the methods and algorithms used to predict protein structure and a thorough knowledge of the function and structure of proteins are critical for the advancement of biology and the life sciences as well as the development of better drugs, higher-yield crops, and even synthetic bio-fuels. To that end, this chapter sheds light on the methods used for protein structure prediction. This chapter covers the applications of modeled protein structures and unravels the relationship between pure sequence information and three-dimensional structure, which continues to be one of the greatest challenges in molecular biology. With this resource, it presents an all-encompassing examination of the problems, methods, tools, servers, databases, and applications of protein structure prediction, giving unique insight into the future applications of the modeled protein structures. In this chapter, current protein structure prediction methods are reviewed for a milieu on structure prediction, the prediction of structural fundamentals, tertiary structure prediction, and functional imminent. The basic ideas and advances of these directions are discussed in detail.


2019 ◽  
Vol 20 (10) ◽  
pp. 2442 ◽  
Author(s):  
Teppei Ikeya ◽  
Peter Güntert ◽  
Yutaka Ito

To date, in-cell NMR has elucidated various aspects of protein behaviour by associating structures in physiological conditions. Meanwhile, current studies of this method mostly have deduced protein states in cells exclusively based on ‘indirect’ structural information from peak patterns and chemical shift changes but not ‘direct’ data explicitly including interatomic distances and angles. To fully understand the functions and physical properties of proteins inside cells, it is indispensable to obtain explicit structural data or determine three-dimensional (3D) structures of proteins in cells. Whilst the short lifetime of cells in a sample tube, low sample concentrations, and massive background signals make it difficult to observe NMR signals from proteins inside cells, several methodological advances help to overcome the problems. Paramagnetic effects have an outstanding potential for in-cell structural analysis. The combination of a limited amount of experimental in-cell data with software for ab initio protein structure prediction opens an avenue to visualise 3D protein structures inside cells. Conventional nuclear Overhauser effect spectroscopy (NOESY)-based structure determination is advantageous to elucidate the conformations of side-chain atoms of proteins as well as global structures. In this article, we review current progress for the structure analysis of proteins in living systems and discuss the feasibility of its future works.


2004 ◽  
Vol 02 (03) ◽  
pp. 471-495 ◽  
Author(s):  
LUIGI PALOPOLI ◽  
GIORGIO TERRACINA

Predicting the three-dimensional structure of proteins is a difficult task. In the last few years several approaches have been proposed for performing this task taking into account different protein chemical and physical properties. As a result, a growing number of protein structure prediction tools is becoming available, some of them specialized to work on either some aspects of the predictions or on some categories of proteins; however, they are still not sufficiently accurate and reliable for predicting all kinds of proteins. In this context, it is useful to jointly apply different prediction tools and combine their results in order to improve the quality of the predictions. However, several problems have to be solved in order to make this a viable possibility. In this paper a framework and a tool is proposed which allows: (i) definition of a common reference applicative domain for different prediction tools; (ii) characterization of prediction tools through evaluating some quality parameters; (iii) characterization of the performances of a team of predictors jointly applied over a prediction problem; (iv) the singling out of the best team for a prediction problem; and (v) the integration of predictor results in the team in order to obtain a unique prediction. A system implementing the various steps of the proposed framework (CooPPS) has been developed and several experiments for testing the effectiveness of the proposed approach have been carried out.


2021 ◽  
Vol 12 (3) ◽  
pp. 3259-3304

Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) that transmitted from animal to human became a life-threatening pandemic in 2020. Scientists are currently testing several drugs to eradicate the COVID-19 outbreak. However, there is no 100 % effective drug or vaccine against SARS-CoV-2 has been discovered so far. In this study, we explored the structure prediction and functional analysis of 75 Malaysia SARS-CoV-2 strain’s structural and accessory proteins without the presence of experimental models. Physiochemical analysis, secondary structure analysis, structure prediction, functional characterization, active site identification, and evolutionary analysis based on the amino acid sequences retrieved from National Centre for Biotechnology Information (NCBI). Three-dimensional (3-D) protein structures were built using the Swiss model. The quality of protein models was verified by ERRAT, PROCHECK, and Verify 3D tools. Active prediction analysis revealed the high potential active sites of proteins where the anti-viral drug or vaccine may bind and inhibit the viral activities. Molecular phylogenetic analysis of ORF10, ORF8, and ORF6 proteins from five different species was analyzed. The results from this analysis proved that Homo sapiens SARS-CoV-2 had high genetic similarity with the bat coronavirus. These analyses may help in designing structure-based anti-viral drugs or to develop potential vaccines for SARS-CoV-2.


2021 ◽  
Author(s):  
Prasun Kumar ◽  
Neil G. Paterson ◽  
Jonathan Clayden ◽  
Derek N. Woolfson

Compared with the iconic α helix, 310 helices occur much less frequently in protein structures. The different 310-helical parameters lead to energetically less favourable internal energies, and a reduced tendency to pack into defined higher-order structures. Consequently, in natural proteins, 310 helices rarely extend past 6 residues, and do not form regular supersecondary, tertiary, or quaternary interactions. Here, we show that despite their absence in nature, synthetic protein-like assemblies can be built from 310 helices. We report the rational design, solution-phase characterisation, and an X-ray crystal structure for water-soluble bundles of 310 helices with consolidated hydrophobic cores. The design uses 6-residue repeats informed by analysing natural 310 helices, and incorporates aminoisobutyric acid residues. Design iterations reveal a tipping point between α-helical and 310-helical folding, and identify features required for stabilising assemblies in this unexplored region of protein-structure space.


2021 ◽  
Vol 8 (3) ◽  
pp. 103-111
Author(s):  
Krishna R Gupta ◽  
Uttam Patle ◽  
Uma Kabra ◽  
P. Mishra ◽  
Milind J Umekar

Three-dimensional protein structure prediction from amino acid sequence has been a thought-provoking task for decades, but it of pivotal importance as it provides a better understanding of its function. In recent years, the methods for prediction of protein structures have advanced considerably. Computational techniques and increase in protein sequence and structure databases have influence the laborious protein structure determination process. Still there is no single method which can predict all the protein structures. In this review, we describe the four stages of protein structure determination. We have also explored the currenttechniques used to uncover the protein structure and highpoint best suitable method for a given protein.


Sign in / Sign up

Export Citation Format

Share Document