statistical potentials
Recently Published Documents


TOTAL DOCUMENTS

81
(FIVE YEARS 5)

H-INDEX

26
(FIVE YEARS 0)

Author(s):  
Takayuki Kimura ◽  
Nobuaki Yasuo ◽  
Masakazu Sekijima ◽  
Brooke Lustig

2021 ◽  
Author(s):  
Ya-Lan Tan ◽  
Xunxun Wang ◽  
Ya-Zhou Shi ◽  
Wenbing Zhang ◽  
Zhi-Jie Tan

Knowledge-based statistical potentials have been shown to be rather effective in protein 3-dimensional (3D) structure evaluation and prediction. Recently, several statistical potentials have been developed for RNA 3D structure evaluation, while their performances are either still at low level for the test datasets from structure prediction models or dependent on the "black-box" process through neural networks. In this work, we have developed an all-atom distance-dependent statistical potential based on residue separation for RNA 3D structure evaluation, namely rsRNASP, which is composed of short- and long-ranged potentials distinguished by residue separation. The extensive examinations against available RNA test datasets show that, rsRNASP has apparently higher performance than the existing statistical potentials for the realistic test datasets with large RNAs from structure prediction models including the newly released RNA-Puzzles dataset, and is comparable to the existing top statistical potentials for the test datasets with small RNAs or near-native decoys. Additionally, rsRNASP is also superior to RNA3DCNN, a recently developed scoring function through 3D convolutional neural networks. rsRNASP and the relevant databases are available at website https://github.com/Tan-group/rsRNASP.


2021 ◽  
Author(s):  
Stefano Zamuner ◽  
Flavio Seno ◽  
Antonio Trovato

Knowledge-based approaches use the statistics collected from protein data-bank structures to estimate effective interaction potentials between amino acid pairs. Empirical relations are typically employed that are based on the crucial choice of a reference state associated to the null interaction case. Despite their significant effectiveness, the physical interpretation of knowledge-based potentials has been repeatedly questioned, with no consensus on the choice of the reference state. Here we use the fact that the Flory theorem, originally derived for chains in a dense polymer melt, holds also for chain fragments within the core of globular proteins, if the average over buried fragments collected from different non-redundant native structures is considered. After verifying that the ensuing Gaussian statistics, a hallmark of effectively non-interacting polymer chains, holds for a wide range of fragment lengths, we use it to define a `bona fide' reference state. Notably, despite the latter does depend on fragment length, deviations from it do not. This allows to estimate an effective interaction potential which is not biased by the presence of correlations due to the connectivity of the protein chain. We show how different sequence-independent effective statistical potentials can be derived using this approach by coarse-graining the protein representation at varying levels. The possibility of defining sequence-dependent potentials is explored.


2021 ◽  
Author(s):  
Stefano Zamuner ◽  
Flavio Seno ◽  
Antonio Trovato

Abstract Knowledge-based approaches use the statistics collected from protein databank structures to estimate effective interaction potentials between amino acid pairs. Empirical relations are typically employed that are based on the crucial choice of a reference state associated to the null interaction case. Despite their significant effectiveness, the physical interpretation of knowledge-based potentials has been repeatedly questioned, with no consensus on the choice of the reference state. Here we use the fact that the Flory theorem, originally derived for chains in a dense polymer melt, holds also for chain fragments within the core of globular proteins. After verifying that the ensuing Gaussian statistics, a hallmark of effectively non-interacting polymer chains, holds for a wide range of fragment lengths, we use it to define a ‘bona fide’ reference state. Notably, despite the latter does depend on fragment length, deviations from it do not. This allows to estimate an effective interaction potential which is not biased by the presence of correlations due to the connectivity of the protein chain. We show how different sequence-independent effective statistical potentials can be derived using this approach by coarse-graining the protein representation at varying levels. The possibility of defining sequence-dependent potentials is explored.


2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Joaquim Aguirre-Plans ◽  
Alberto Meseguer ◽  
Ruben Molina-Fernandez ◽  
Manuel Alejandro Marín-López ◽  
Gaurav Jumde ◽  
...  

Abstract Background Statistical potentials, also named knowledge-based potentials, are scoring functions derived from empirical data that can be used to evaluate the quality of protein folds and protein–protein interaction (PPI) structures. In previous works we decomposed the statistical potentials in different terms, named Split-Statistical Potentials, accounting for the type of amino acid pairs, their hydrophobicity, solvent accessibility and type of secondary structure. These potentials have been successfully used to identify near-native structures in protein structure prediction, rank protein docking poses, and predict PPI binding affinities. Results Here, we present the SPServer, a web server that applies the Split-Statistical Potentials to analyze protein folds and protein interfaces. SPServer provides global scores as well as residue/residue-pair profiles presented as score plots and maps. This level of detail allows users to: (1) identify potentially problematic regions on protein structures; (2) identify disrupting amino acid pairs in protein interfaces; and (3) compare and analyze the quality of tertiary and quaternary structural models. Conclusions While there are many web servers that provide scoring functions to assess the quality of either protein folds or PPI structures, SPServer integrates both aspects in a unique easy-to-use web server. Moreover, the server permits to locally assess the quality of the structures and interfaces at a residue level and provides tools to compare the local assessment between structures. Server address https://sbi.upf.edu/spserver/.


2020 ◽  
Vol 117 (46) ◽  
pp. 28795-28805
Author(s):  
Suman Das ◽  
Yi-Hsuan Lin ◽  
Robert M. Vernon ◽  
Julie D. Forman-Kay ◽  
Hue Sun Chan

Endeavoring toward a transferable, predictive coarse-grained explicit-chain model for biomolecular condensates underlain by liquid–liquid phase separation (LLPS) of proteins, we conducted multiple-chain simulations of the N-terminal intrinsically disordered region (IDR) of DEAD-box helicase Ddx4, as a test case, to assess roles of electrostatic, hydrophobic, cation–π, and aromatic interactions in amino acid sequence-dependent LLPS. We evaluated three different residue–residue interaction schemes with a shared electrostatic potential. Neither a common hydrophobicity scheme nor one augmented with arginine/lysine-aromatic cation–π interactions consistently accounted for available experimental LLPS data on the wild-type, a charge-scrambled, a phenylalanine-to-alanine (FtoA), and an arginine-to-lysine (RtoK) mutant of Ddx4 IDR. In contrast, interactions based on contact statistics among folded globular protein structures reproduce the overall experimental trend, including that the RtoK mutant has a much diminished LLPS propensity. Consistency between simulation and experiment was also found for RtoK mutants of P-granule protein LAF-1, underscoring that, to a degree, important LLPS-driving π-related interactions are embodied in classical statistical potentials. Further elucidation is necessary, however, especially of phenylalanine’s role in condensate assembly because experiments on FtoA and tyrosine-to-phenylalanine mutants suggest that LLPS-driving phenylalanine interactions are significantly weaker than posited by common statistical potentials. Protein–protein electrostatic interactions are modulated by relative permittivity, which in general depends on aqueous protein concentration. Analytical theory suggests that this dependence entails enhanced interprotein interactions in the condensed phase but more favorable protein–solvent interactions in the dilute phase. The opposing trends lead to only a modest overall impact on LLPS.


2020 ◽  
Author(s):  
Ya-Lan Tan ◽  
Chen-Jie Feng ◽  
Xunxun Wang ◽  
Wenbing Zhang ◽  
Zhi-Jie Tan

Author(s):  
Guillaume Launay ◽  
Masahito Ohue ◽  
Julia Prieto Santero ◽  
Yuri Matsuzaki ◽  
Cécile Hilpert ◽  
...  

ABSTRACTScoring is a challenging step in protein-protein docking, where typically thousands of solutions are generated. Successful scoring is more often based on physicochemical evaluation of the generated interfaces and/or statistical potentials that reproduce known interface properties. Another route is offered by consensus-based rescoring, where the set of solutions is used to build statistics in order to identify recurrent solutions. We explore several ways to perform consensus-based rescoring on the ZDOCK decoy set for Benchmark 4. We show that the information of the interface size is critical for successful rescoring. We combine consensus-based rescoring with the ZDOCK native scoring function and show that this improves the initial results.


Sign in / Sign up

Export Citation Format

Share Document