Application of the maximum entropy principle to determine ensembles of intrinsically disordered proteins from residual dipolar couplings

M. Sanchez-Martinez; R. Crehuet

doi:10.1039/c4cp03114h

Measurement and Analysis of NMR Residual Dipolar Couplings for the Study of Intrinsically Disordered Proteins

Methods in Molecular Biology - Intrinsically Disordered Protein Analysis ◽

10.1007/978-1-61779-927-3_9 ◽

2012 ◽

pp. 115-125 ◽

Cited By ~ 8

Author(s):

Loïc Salmon ◽

Malene Ringkjøbing Jensen ◽

Pau Bernadó ◽

Martin Blackledge

Keyword(s):

Residual Dipolar Couplings ◽

Intrinsically Disordered Proteins ◽

Dipolar Couplings ◽

Disordered Proteins ◽

Intrinsically Disordered ◽

Measurement And Analysis ◽

Nmr Residual Dipolar Couplings

Download Full-text

Studying Partially Folded and Intrinsically Disordered Proteins Using NMR Residual Dipolar Couplings

Protein NMR Spectroscopy: Practical Techniques and Applications ◽

10.1002/9781119972006.ch9 ◽

2011 ◽

pp. 319-345 ◽

Cited By ~ 1

Author(s):

Malene Ringkjøbing Jensen ◽

Loic Salmon ◽

Gabrielle Nodet ◽

Phineus Markwick ◽

Pau Bernadó ◽

...

Keyword(s):

Residual Dipolar Couplings ◽

Intrinsically Disordered Proteins ◽

Dipolar Couplings ◽

Disordered Proteins ◽

Intrinsically Disordered ◽

Nmr Residual Dipolar Couplings

Download Full-text

Residual dipolar couplings measured in unfolded proteins are sensitive to amino-acid-specific geometries as well as local conformational sampling

Biochemical Society Transactions ◽

10.1042/bst20120187 ◽

2012 ◽

Vol 40 (5) ◽

pp. 989-994 ◽

Cited By ~ 7

Author(s):

Jie-rong Huang ◽

Martin Gentner ◽

Navratna Vajpai ◽

Stephan Grzesiek ◽

Martin Blackledge

Keyword(s):

Amino Acid ◽

Residual Dipolar Couplings ◽

Intrinsically Disordered Proteins ◽

Dipolar Couplings ◽

Disordered Proteins ◽

Conformational Sampling ◽

Intrinsically Disordered ◽

Potential Energy Landscape ◽

Order Of Magnitude ◽

Folded Structures

Many functional proteins do not have well defined folded structures. In recent years, both experimental and computational approaches have been developed to study the conformational behaviour of this type of protein. It has been shown previously that experimental RDCs (residual dipolar couplings) can be used to study the backbone sampling of disordered proteins in some detail. In these studies, the backbone structure was modelled using a common geometry for all amino acids. In the present paper, we demonstrate that experimental RDCs are also sensitive to the specific geometry of each amino acid as defined by energy-minimized internal co-ordinates. We have modified the FM (flexible-Meccano) algorithm that constructs conformational ensembles on the basis of a statistical coil model, to account for these differences. The modified algorithm inherits the advantages of the FM algorithm to efficiently sample the potential energy landscape for coil conformations. The specific geometries incorporated in the new algorithm result in a better reproduction of experimental RDCs and are generally applicable for further studies to characterize the conformational properties of intrinsically disordered proteins. In addition, the internal-co-ordinate-based algorithm is an order of magnitude more efficient, and facilitates side-chain construction, surface osmolyte simulation, spin-label distribution sampling and proline cis/trans isomer simulation.

Download Full-text

Quantitative Determination of the Conformational Properties of Partially Folded and Intrinsically Disordered Proteins Using NMR Dipolar Couplings

Structure ◽

10.1016/j.str.2009.08.001 ◽

2009 ◽

Vol 17 (9) ◽

pp. 1169-1185 ◽

Cited By ~ 127

Author(s):

Malene Ringkjøbing Jensen ◽

Phineus R.L. Markwick ◽

Sebastian Meier ◽

Christian Griesinger ◽

Markus Zweckstetter ◽

...

Keyword(s):

Quantitative Determination ◽

Intrinsically Disordered Proteins ◽

Dipolar Couplings ◽

Disordered Proteins ◽

Intrinsically Disordered ◽

Conformational Properties

Download Full-text

Co-Evolution of Intrinsically Disordered Proteins with Folded Partners Witnessed by Evolutionary Couplings

International Journal of Molecular Sciences ◽

10.3390/ijms19113315 ◽

2018 ◽

Vol 19 (11) ◽

pp. 3315 ◽

Cited By ~ 10

Author(s):

Rita Pancsa ◽

Fruzsina Zsolyomi ◽

Peter Tompa

Keyword(s):

Large Scale ◽

Intrinsically Disordered Proteins ◽

Protein Structures ◽

Disordered Proteins ◽

Cellular Interaction ◽

Structural Constraints ◽

Protein Residues ◽

Intrinsically Disordered ◽

Evolutionary Changes ◽

Folded Proteins

Although improved strategies for the detection and analysis of evolutionary couplings (ECs) between protein residues already enable the prediction of protein structures and interactions, they are mostly restricted to conserved and well-folded proteins. Whereas intrinsically disordered proteins (IDPs) are central to cellular interaction networks, due to the lack of strict structural constraints, they undergo faster evolutionary changes than folded domains. This makes the reliable identification and alignment of IDP homologs difficult, which led to IDPs being omitted in most large-scale residue co-variation analyses. By preforming a dedicated analysis of phylogenetically widespread bacterial IDP–partner interactions, here we demonstrate that partner binding imposes constraints on IDP sequences that manifest in detectable interprotein ECs. These ECs were not detected for interactions mediated by short motifs, rather for those with larger IDP–partner interfaces. Most identified coupled residue pairs reside close (<10 Å) to each other on the interface, with a third of them forming multiple direct atomic contacts. EC-carrying interfaces of IDPs are enriched in negatively charged residues, and the EC residues of both IDPs and partners preferentially reside in helices. Our analysis brings hope that IDP–partner interactions difficult to study could soon be successfully dissected through residue co-variation analysis.

Download Full-text

SAXS-Restrained Ensemble Simulations of Intrinsically Disordered Proteins with Commitment to the Principle of Maximum Entropy

Journal of Chemical Theory and Computation ◽

10.1021/acs.jctc.9b00338 ◽

2019 ◽

Vol 15 (9) ◽

pp. 5103-5115 ◽

Cited By ~ 12

Author(s):

Markus R. Hermann ◽

Jochen S. Hub

Keyword(s):

Maximum Entropy ◽

Intrinsically Disordered Proteins ◽

Disordered Proteins ◽

Intrinsically Disordered ◽

Principle Of Maximum Entropy ◽

Ensemble Simulations ◽

Principle Of Maximum

Download Full-text

CheSPI: Chemical shift Secondary structure Population Inference

10.1101/2021.02.20.432095 ◽

2021 ◽

Author(s):

Jakob Toudahl Nielsen ◽

Frans A.A. Mulder

Keyword(s):

Intrinsically Disordered Proteins ◽

Chemical Shifts ◽

Protein Structures ◽

Random Coil ◽

Disordered Proteins ◽

Residual Structure ◽

Intrinsically Disordered ◽

Population Inference ◽

Local Protein Structure ◽

Structural Classes

AbstractNMR chemical shifts (CSs) are delicate reporters of local protein structure, and recent advances in random coil CS (RCCS) prediction and interpretation now offer the compelling prospect of inferring small populations of structure from small deviations from RCCSs. Here, we present CheSPI, a simple and efficient method that provides unbiased and sensitive aggregate measures of local structure and disorder. It is demonstrated that CheSPI can predict even very small amounts of residual structure and robustly delineate subtle differences into four structural classes for intrinsically disordered proteins. For structured regions and proteins, CheSPI can assign up to eight structural classes, which coincide with the well-known DSSP classification. The program is freely available, and can either be invoked from URL www.protein-nmr.org as a web implementation, or run locally from command line as a python program. CheSPI generates comprehensive numeric and graphical output for intuitive annotation and visualization of protein structures. A number of examples are provided.

Download Full-text

Distance-based metrics for comparing conformational ensembles of intrinsically disordered proteins

10.1101/2020.04.06.027979 ◽

2020 ◽

Author(s):

Tamas Lazar ◽

Mainak Guharoy ◽

Wim Vranken ◽

Sarah Rauscher ◽

Shoshana J. Wodak ◽

...

Keyword(s):

Root Mean Square ◽

Root Mean Square Deviation ◽

Intrinsically Disordered Proteins ◽

Statistical Significance ◽

Protein Structures ◽

Disordered Proteins ◽

Mean Square ◽

Mean Square Deviation ◽

Conformational Ensembles ◽

Intrinsically Disordered

AbstractIntrinsically disordered proteins (IDPs) are proteins whose native functional states represent ensembles of highly diverse conformations. Such ensembles are a challenge for quantitative structure comparisons as their conformational diversity precludes optimal superimposition of the atomic coordinates, necessary for deriving common similarity measures such as the root-mean-square deviation (RMSD) of these coordinates. Here we introduce superimposition-free metrics, which are based on computing matrices of Cα-Cα distance distributions within ensembles and comparing these matrices between ensembles. Differences between two matrices yield information on the similarity between specific regions of the polypeptide, whereas the global structural similarity is captured by the ens_dRMS, defined as the root-mean-square difference between the medians of the Cα-Cαdistance distributions of two ensembles. Together, our metrics enable rigorous investigations of structure-function relationships in conformational ensembles of IDPs derived using experimental restraints or by molecular simulations, and for proteins containing both structured and disordered regions.Statement of SignificanceImportant biological insight is obtained from comparing the high-resolution structures of proteins. Such comparisons commonly involve superimposing two protein structures and computing the residual root-mean-square deviation of the atomic positions. This approach cannot be applied to intrinsically disordered proteins (IDPs) because IDPs do not adopt well-defined 3D structures, rather, their native functional state is defined by ensembles of heterogeneous conformations that cannot be meaningfully superimposed. We report new measures that quantify the local and global similarity between different conformational ensembles by evaluating differences between the distributions of residue-residue distances and their statistical significance. Applying these measures to IDP ensembles and to a protein containing both structured and intrinsically disordered domains provides deeper insights into how structural features relate to function.

Download Full-text

Comparative roles of charge,π, and hydrophobic interactions in sequence-dependent phase separation of intrinsically disordered proteins

Proceedings of the National Academy of Sciences ◽

10.1073/pnas.2008122117 ◽

2020 ◽

Vol 117 (46) ◽

pp. 28795-28805

Author(s):

Suman Das ◽

Yi-Hsuan Lin ◽

Robert M. Vernon ◽

Julie D. Forman-Kay ◽

Hue Sun Chan

Keyword(s):

Phase Separation ◽

Electrostatic Interactions ◽

Intrinsically Disordered Proteins ◽

Protein Structures ◽

Coarse Grained ◽

Disordered Proteins ◽

Chain Model ◽

Statistical Potentials ◽

Intrinsically Disordered ◽

Sequence Dependent

Endeavoring toward a transferable, predictive coarse-grained explicit-chain model for biomolecular condensates underlain by liquid–liquid phase separation (LLPS) of proteins, we conducted multiple-chain simulations of the N-terminal intrinsically disordered region (IDR) of DEAD-box helicase Ddx4, as a test case, to assess roles of electrostatic, hydrophobic, cation–π, and aromatic interactions in amino acid sequence-dependent LLPS. We evaluated three different residue–residue interaction schemes with a shared electrostatic potential. Neither a common hydrophobicity scheme nor one augmented with arginine/lysine-aromatic cation–π interactions consistently accounted for available experimental LLPS data on the wild-type, a charge-scrambled, a phenylalanine-to-alanine (FtoA), and an arginine-to-lysine (RtoK) mutant of Ddx4 IDR. In contrast, interactions based on contact statistics among folded globular protein structures reproduce the overall experimental trend, including that the RtoK mutant has a much diminished LLPS propensity. Consistency between simulation and experiment was also found for RtoK mutants of P-granule protein LAF-1, underscoring that, to a degree, important LLPS-driving π-related interactions are embodied in classical statistical potentials. Further elucidation is necessary, however, especially of phenylalanine’s role in condensate assembly because experiments on FtoA and tyrosine-to-phenylalanine mutants suggest that LLPS-driving phenylalanine interactions are significantly weaker than posited by common statistical potentials. Protein–protein electrostatic interactions are modulated by relative permittivity, which in general depends on aqueous protein concentration. Analytical theory suggests that this dependence entails enhanced interprotein interactions in the condensed phase but more favorable protein–solvent interactions in the dilute phase. The opposing trends lead to only a modest overall impact on LLPS.

Download Full-text

RFPR-IDP: reduce the false positive rates for intrinsically disordered protein and region prediction by incorporating both fully ordered proteins and disordered proteins

Briefings in Bioinformatics ◽

10.1093/bib/bbaa018 ◽

2020 ◽

Author(s):

Yumeng Liu ◽

Xiaolong Wang ◽

Bin Liu

Keyword(s):

Intrinsically Disordered Proteins ◽

Short Term Memory ◽

Protein Structures ◽

Intrinsically Disordered Protein ◽

Disordered Proteins ◽

Test Dataset ◽

Intrinsically Disordered ◽

Disordered Protein ◽

Real World Applications ◽

Long Short Term Memory

Abstract As an important type of proteins, intrinsically disordered proteins/regions (IDPs/IDRs) are related to many crucial biological functions. Accurate prediction of IDPs/IDRs is beneficial to the prediction of protein structures and functions. Most of the existing methods ignore the fully ordered proteins without IDRs during training and test processes. As a result, the corresponding predictors prefer to predict the fully ordered proteins as disordered proteins. Unfortunately, these methods were only evaluated on datasets consisting of disordered proteins without or with only a few fully ordered proteins, and therefore, this problem escapes the attention of the researchers. However, most of the newly sequenced proteins are fully ordered proteins in nature. These predictors fail to accurately predict the ordered and disordered proteins in real-world applications. In this regard, we propose a new method called RFPR-IDP trained with both fully ordered proteins and disordered proteins, which is constructed based on the combination of convolution neural network (CNN) and bidirectional long short-term memory (BiLSTM). The experimental results show that although the existing predictors perform well for predicting the disordered proteins, they tend to predict the fully ordered proteins as disordered proteins. In contrast, the RFPR-IDP predictor can correctly predict the fully ordered proteins and outperform the other 10 state-of-the-art methods when evaluated on a test dataset with both fully ordered proteins and disordered proteins. The web server and datasets of RFPR-IDP are freely available at http://bliulab.net/RFPR-IDP/server.

Download Full-text