scholarly journals Probing Protein Allostery as a Residue-specific Concept via Residue Perturbation Maps

2018 ◽  
Author(s):  
Hamed S Hayatshahi ◽  
Emilio Ahuactzin ◽  
Peng Tao ◽  
Shouyi Wang ◽  
Jin Liu

AbstractAllosteric regulation is a well-established phenomenon classically defined as conformational or dynamical change of a small number of allosteric residues of the protein upon allosteric effector binding at a distance. Here, we developed a novel approach to delineate allosteric effects in proteins. In this approach, we applied robust machine learning methods, including Deep Neural Network and Random Forest, on extensive molecular dynamics (MD) simulations to distinguish otherwise similar allosteric states of proteins. Using PDZ3 domain of PDS-95 as a model protein, we demonstrated that the allosteric effects could be represented as residue-specific properties through two-dimensional property-residue maps, which we refer as “residue perturbation maps”. These maps were constructed through two machine learning methods and could accurately describe how different properties of various residues are affected upon allosteric perturbation on protein. Based on the “residue perturbation maps”, we propose allostery as a residue-specific concept, suggesting all residues could be considered as allosteric residues because each residue “senses” the allosteric events through perturbation of its one or multiple attributes in a quantitatively unique way. The “residue perturbation maps” could be used to fingerprint a protein based on the unique patterns of residue perturbations upon binding events, providing a novel way to systematically describe the protein allosteric effects of each residue upon perturbation.Author SummaryAllostery is protein regulation at distance. A perturbation at one site of the protein could distantly affect another site. The residues involved in these sites are considered as allosteric residues. The allostery concept has been widely used to understand protein mechanisms and to design allosteric drugs. It is long believed only a small number of residues are allosteric residues. Here, we argue that all residues in a protein are allosteric residues. Upon the perturbation of the allosteric events, the different properties of each residue are affected at the distinct extend. We used hybrid models including molecular dynamics simulations and machine learning components to reveal that not only many properties of residues are affected upon ligand binding, but also each residue is affected through perturbation of its various properties, which makes the residue distinguishable from other residues. According to our findings in a model protein, we defined a “residue perturbation map” as a two-dimensional map that fingerprint a protein based on the extent of perturbation in different properties of all its residues in a quantitative fashion. This “residue perturbation map” provides a novel way to systematically describe the protein allosteric effects of each residue upon perturbation.

Author(s):  
V P Gromov ◽  
L I Lebedev ◽  
V E Turlapov

The development of the nominal sequence of steps for analyzing the HSI proposed by Landgrebe, which is necessary in the context of the appearance of reference signature libraries for environmental monitoring, is discussed. The approach is based on considering the HSI pixel as a signature that stores all spectral features of an object and its states, and the HSI as a whole - as a two-dimensional signature field. As a first step of the analysis, a procedure is proposed for detecting a linear dependence of signatures by the magnitude of the Pearson correlation coefficient. The main apparatus of analysis, as in Landgrebe sequence, is the method of principal component analysis, but it is no longer used to build classes and is applied to investigate the presence in the class of subclasses essential for the applied area. The experimental material includes such objects as water, swamps, soil, vegetation, concrete, pollution. Selection of object samples on the image is made by the user. From the studied images of HSI objects, a base of reference signatures for classes (subclasses) of objects is formed, which in turn can be used to automate HSI markup with the aim of applying machine learning methods to recognize HSI objects and their states.


Author(s):  
Michela Taufer ◽  
Trilce Estrada ◽  
Travis Johnston

This paper presents the survey of three algorithms to transform atomic-level molecular snapshots from molecular dynamics (MD) simulations into metadata representations that are suitable for in situ analytics based on machine learning methods. MD simulations studying the classical time evolution of a molecular system at atomic resolution are widely recognized in the fields of chemistry, material sciences, molecular biology and drug design; these simulations are one of the most common simulations on supercomputers. Next-generation supercomputers will have a dramatically higher performance than current systems, generating more data that needs to be analysed (e.g. in terms of number and length of MD trajectories). In the future, the coordination of data generation and analysis can no longer rely on manual, centralized analysis traditionally performed after the simulation is completed or on current data representations that have been defined for traditional visualization tools. Powerful data preparation phases (i.e. phases in which original row data is transformed to concise and still meaningful representations) will need to proceed data analysis phases. Here, we discuss three algorithms for transforming traditionally used molecular representations into concise and meaningful metadata representations. The transformations can be performed locally. The new metadata can be fed into machine learning methods for runtime in situ analysis of larger MD trajectories supported by high-performance computing. In this paper, we provide an overview of the three algorithms and their use for three different applications: protein–ligand docking in drug design; protein folding simulations; and protein engineering based on analytics of protein functions depending on proteins' three-dimensional structures. This article is part of a discussion meeting issue ‘Numerical algorithms for high-performance computational science’.


Sign in / Sign up

Export Citation Format

Share Document