Identification of .beta.-sheet motifs, of .psi.-loops, and of patterns of amino acid residues in three-dimensional protein structures using a subgraph-isomorphism algorithm

Peter J. Artymiuk; Helen M. Grindley; Andrew R. Poirrette; David W. Rice; Elizabeth C. Ujah; Peter Willett

doi:10.1021/ci00017a007

COUPLED GENETIC ALGORITHM/KOHONEN NEURAL NETWORK (GANN) FOR PROJECTION OF THREE-DIMENSIONAL PROTEIN STRUCTURES ONTO THE PLANE

Journal of Theoretical and Computational Chemistry ◽

10.1142/s0219633602000051 ◽

2002 ◽

Vol 01 (01) ◽

pp. 45-52 ◽

Cited By ~ 1

Author(s):

KUTAY O. ALPER ◽

CAREY K. BAGDASSARIAN

Keyword(s):

Neural Network ◽

Genetic Algorithm ◽

Amino Acid ◽

Fitness Function ◽

Amino Acid Level ◽

Protein Structures ◽

Three Dimensional ◽

Spatial Relations ◽

Amino Acid Residues ◽

Kohonen Neural Network

An algorithm is presented for projecting — at the amino acid level — the three-dimensional crystal structure of a protein molecule onto a planar surface. The scheme is topologically consistent: if two amino acid residues are closely juxtaposed in three-dimensional space, they remain so upon projection. Through such projections, a single resulting picture captures the spatial relations amongst a protein molecule's amino acids. Operationally, a genetic algorithm is used to "evolve" a parameter set which serves as input for a self-organizing Kohonen neural network responsible for the projection itself. A fitness function characterizing the quality of the projections is defined and maximized via the genetic algorithm. The workings of both the genetic algorithm and neural network are discussed in detail. In this work, we seek to optimize projections resulting from the inherently "frustrated" task of collapsing a space-filling collection of amino acid residues onto a simpler surface. Ultimately, the chosen application is a testing ground for establishing the success of our coupled genetic algorithm/Kohonen neural network scheme which can easily be adapted for other uses.

Download Full-text

Structure of Looped Regions in β-α- and α-β-Arches in Abcd-Units of Globular Proteins

Математическая биология и биоинформатика ◽

10.17537/2016.11.159 ◽

2016 ◽

Vol 11 (2) ◽

pp. 159-169

Author(s):

Е.В. Бражников ◽

E.V. Brazhnikov

Keyword(s):

Amino Acid ◽

De Novo ◽

Protein Structures ◽

Three Dimensional ◽

Structural Motif ◽

Dimensional Structure ◽

Amino Acid Residues ◽

Homologous Proteins ◽

Reverse Turn ◽

Structure Of Proteins

Conformations of about 600 looped regions (loops) in β-α- and α-β-arches of a structural motif occurring in the abCd-unit of proteins were analyzed. On the whole, 258 abCd-units with a reverse turn of the polypeptide chain (236 PDB files) and 69 abCd-units with a direct turn (65 PDB files) were selected in non-homologous proteins. Four types of arches were studied: β-α- and α-β-ones at a direct turn of the chain; β-α- and α-β-ones at a reverse turn of the chain. For each type of arches, frequencies of loops occurrence of different lengths were determined and corresponding histograms were plotted. It was found that abCd-units with loops up to three amino acid residues long occur most frequently (57 %). In β-α-arches with a direct turn of the chain, loops consisting of two amino acid residues occur most often (44 %) and in 86% cases they have the βmαβαn - conformation. They have no Gly and Pro residues, and in position β there is an Asn residue. In such type of arches, the loops of one residue (βmεαn- or βmαLαn- conformation) contain the Gly residue most frequently. α-β-Arches with a direct turn of the chain have most commonly (18 %) loops of four amino acid residues. In this case, there is no predominant conformation of the loops. In β-α-arches with a reverse turn of the chain, most common are loops of seven amino acid residues (17%), and most part of them (88 %) have the βmαLββααββαn - conformation. α-β-Arches with a reverse turn of the chain contain most frequently (32%) loops of one amino acid residue (all Gly ones) with arch conformations αmεβn or αmαLβn. The above structural analysis of the abCd-unit has useful information for prediction of the three-dimensional structure of proteins and for molecular simulation of the de novo design of protein structures.

Download Full-text

Statistical Force-Field for Structural Modeling Using Chemical Cross-Linking/mass Spectrometry Distance Constraints

10.26434/chemrxiv.6030563 ◽

2018 ◽

Author(s):

Allan J. R. Ferrari ◽

Fabio C. Gozzo ◽

Leandro Martinez

Keyword(s):

Mass Spectrometry ◽

Amino Acid ◽

Force Field ◽

Protein Structures ◽

Protein Structure Determination ◽

Structural Modeling ◽

Cross Linking ◽

Amino Acid Residues ◽

Distance Constraints ◽

Chemical Cross Linking

<div><p>Chemical cross-linking/Mass Spectrometry (XLMS) is an experimental method to obtain distance constraints between amino acid residues, which can be applied to structural modeling of tertiary and quaternary biomolecular structures. These constraints provide, in principle, only upper limits to the distance between amino acid residues along the surface of the biomolecule. In practice, attempts to use of XLMS constraints for tertiary protein structure determination have not been widely successful. This indicates the need of specifically designed strategies for the representation of these constraints within modeling algorithms. Here, a force-field designed to represent XLMS-derived constraints is proposed. The potential energy functions are obtained by computing, in the database of known protein structures, the probability of satisfaction of a topological cross-linking distance as a function of the Euclidean distance between amino acid residues. The force-field can be easily incorporated into current modeling methods and software. In this work, the force-field was implemented within the Rosetta ab initio relax protocol. We show a significant improvement in the quality of the models obtained relative to current strategies for constraint representation. This force-field contributes to the long-desired goal of obtaining the tertiary structures of proteins using XLMS data. Force-field parameters and usage instructions are freely available at http://m3g.iqm.unicamp.br/topolink/xlff <br></p></div><p></p><p></p>

Download Full-text

Cloning, post-translational modifications, heterologous expression and ligand-binding of boar salivary lipocalin

Biochemical Journal ◽

10.1042/bj3500369 ◽

2000 ◽

Vol 350 (2) ◽

pp. 369-379 ◽

Cited By ~ 22

Author(s):

Dietrich LOEBEL ◽

Andrea SCALONI ◽

Sara PAOLINI ◽

Carlo FINI ◽

Lino FERRARA ◽

...

Keyword(s):

Amino Acid ◽

Sequence Similarity ◽

Three Dimensional ◽

Protein Isoforms ◽

Cysteine Residues ◽

Protein Chemistry ◽

Single Individual ◽

Amino Acid Residues ◽

Three Dimensional Model ◽

Submaxillary Glands

Boar submaxillary glands produce the sex-specific salivary lipocalin (SAL), which binds steroidal sex pheromones as endogenous ligands. The cDNA encoding SAL was cloned and sequenced. From a single individual, two protein isoforms, differing in three amino acid residues, were purified and structurally characterized by a combined Edman degradation/MS approach. These experiments ascertained that the mature polypeptide is composed of 168 amino acid residues, that one of the three putative glycosylation sites is post-translationally modified and the structure of the bound glycosidic moieties. Two of the cysteine residues are paired together in a disulphide bridge, whereas the remaining two occur as free thiols. SAL bears sequence similarity to other lipocalins; on this basis, a three-dimensional model of the protein has been built. A SAL isoform was expressed in Escherichiacoli in good yields. Protein chemistry and CD experiments verified that the recombinant product shows the same redox state at the cysteine residues and that the same conformation is observed as in the natural protein, thus suggesting similar folding. Binding experiments on natural and recombinant SAL were performed with the fluorescent probe 1-aminoanthracene, which was efficiently displaced by the steroidal sex pheromone, as well as by several odorants.

Download Full-text

Identifying Functional Relationships Via the Annotation and Comparison of Three-Dimensional Amino Acid Arrangements in Protein Structures

Encyclopedia of Bioinformatics and Computational Biology ◽

10.1016/b978-0-12-809633-8.20149-0 ◽

2019 ◽

pp. 620-630

Author(s):

Mohd Firdaus-Raih ◽

Nur Syatila Abdul Ghani

Keyword(s):

Amino Acid ◽

Protein Structures ◽

Three Dimensional ◽

Functional Relationships

Download Full-text

Characteristics of Two Forms of α-Amylases and Structural Implication

Applied and Environmental Microbiology ◽

10.1128/aem.65.10.4652-4658.1999 ◽

1999 ◽

Vol 65 (10) ◽

pp. 4652-4658 ◽

Cited By ~ 35

Author(s):

Kohji Ohdan ◽

Takashi Kuriki ◽

Hiroki Kaneko ◽

Jiro Shimada ◽

Toshikazu Takada ◽

...

Keyword(s):

Amino Acid ◽

Three Dimensional ◽

Amino Acid Sequences ◽

Dimensional Structure ◽

Amino Acid Residues ◽

Amylase Gene ◽

Raw Starch ◽

Raw Starch Binding ◽

Terminal Amino ◽

Carboxyl Terminal

ABSTRACT Complete (Ba-L) and truncated (Ba-S) forms of α-amylases fromBacillus subtilis X-23 were purified, and the amino- and carboxyl-terminal amino acid sequences of Ba-L and Ba-S were determined. The amino acid sequence deduced from the nucleotide sequence of the α-amylase gene indicated that Ba-S was produced from Ba-L by truncation of the 186 amino acid residues at the carboxyl-terminal region. The results of genomic Southern analysis and Western analysis suggested that the two enzymes originated from the same α-amylase gene and that truncation of Ba-L to Ba-S occurred during the cultivation of B. subtilis X-23 cells. Although the primary structure of Ba-S was approximately 28% shorter than that of Ba-L, the two enzyme forms had the same enzymatic characteristics (molar catalytic activity, amylolytic pattern, transglycosylation ability, effect of pH on stability and activity, optimum temperature, and raw starch-binding ability), except that the thermal stability of Ba-S was higher than that of Ba-L. An analysis of the secondary structure as well as the predicted three-dimensional structure of Ba-S showed that Ba-S retained all of the necessary domains (domains A, B, and C) which were most likely to be required for functionality as α-amylase.

Download Full-text

Insights into the quaternary association of proteins through structure graphs: a case study of lectins

Biochemical Journal ◽

10.1042/bj20050434 ◽

2005 ◽

Vol 391 (1) ◽

pp. 1-15 ◽

Cited By ~ 43

Author(s):

K. V. Brinda ◽

Avadhesha Surolia ◽

Sarawathi Vishveshwara

Keyword(s):

Amino Acid ◽

Protein Structures ◽

Present Review ◽

Dimensional Structure ◽

Protein Oligomerization ◽

Amino Acid Residues ◽

Sequence Motifs ◽

Structural Determinants ◽

Legume Lectin ◽

Quaternary Structures

The unique three-dimensional structure of both monomeric and oligomeric proteins is encoded in their sequence. The biological functions of proteins are dependent on their tertiary and quaternary structures, and hence it is important to understand the determinants of quaternary association in proteins. Although a large number of investigations have been carried out in this direction, the underlying principles of protein oligomerization are yet to be completely understood. Recently, new insights into this problem have been gained from the analysis of structure graphs of proteins belonging to the legume lectin family. The legume lectins are an interesting family of proteins with very similar tertiary structures but varied quaternary structures. Hence they have become a very good model with which to analyse the role of primary structures in determining the modes of quaternary association. The present review summarizes the results of a legume lectin study as well as those obtained from a similar analysis carried out here on the animal lectins, namely galectins, pentraxins, calnexin, calreticulin and rhesus rotavirus Vp4 sialic-acid-binding domain. The lectin structure graphs have been used to obtain clusters of non-covalently interacting amino acid residues at the intersubunit interfaces. The present study, performed along with traditional sequence alignment methods, has provided the signature sequence motifs for different kinds of quaternary association seen in lectins. Furthermore, the network representation of the lectin oligomers has enabled us to detect the residues which make extensive interactions (‘hubs’) across the oligomeric interfaces that can be targetted for interface-destabilizing mutations. The present review also provides an overview of the methodology involved in representing oligomeric protein structures as connected networks of amino acid residues. Further, it illustrates the potential of such a representation in elucidating the structural determinants of protein–protein association in general and will be of significance to protein chemists and structural biologists.

Download Full-text

Location of amino acid residues important for cobra venom factor function mapped on the three-dimensional structure of complement components C3 and C3c

Molecular Immunology ◽

10.1016/j.molimm.2006.07.062 ◽

2007 ◽

Vol 44 (1-3) ◽

pp. 172 ◽

Cited By ~ 2

Author(s):

David C. Fritzinger ◽

Brian E. Hew ◽

Bert J.C. Janssen ◽

Piet Gros ◽

Carl-Wilhelm Vogel

Keyword(s):

Amino Acid ◽

Three Dimensional ◽

Cobra Venom ◽

Dimensional Structure ◽

Amino Acid Residues ◽

Complement Components ◽

Cobra Venom Factor ◽

Three Dimensional Structure ◽

Factor Function

Download Full-text

Efficacy of Dideoxynucleosides against Human Foamy Virus and Relationship to Its Reverse Transcriptase Amino Acid Sequence and Structure

Journal of Virology ◽

10.1128/jvi.75.15.7184-7187.2001 ◽

2001 ◽

Vol 75 (15) ◽

pp. 7184-7187 ◽

Cited By ~ 7

Author(s):

Anne Yvon-Groussin ◽

Pierre Mugnier ◽

Philippe Bertin ◽

Marc Grandadam ◽

Henri Agut ◽

...

Keyword(s):

Amino Acid ◽

Reverse Transcriptase ◽

Foamy Virus ◽

Three Dimensional ◽

Retroviral Vectors ◽

Dimensional Structure ◽

Human Foamy Virus ◽

Amino Acid Residues ◽

Immunodeficiency Virus ◽

Hiv 1

ABSTRACT Human foamy virus (HFV), a retrovirus of simian origin which occasionally infects humans, is the basis of retroviral vectors in development for gene therapy. Clinical considerations of how to treat patients developing an uncontrolled infection by either HFV or HFV-based vectors need to be raised. We determined the susceptibility of the HFV to dideoxynucleosides and found that only zidovudine was equally efficient against the replication of human immunodeficiency virus type 1 (HIV-1) and HFV. By contrast, zalcitabine (ddC), lamivudine (3TC), stavudine (d4T), and didanosine (ddI) were 3-, 3-, 30-, and 46-fold less efficient against HFV than against HIV-1, respectively. Some amino acid residues known to be involved in HIV-1 resistance to ddC, 3TC, d4T, and ddI were found at homologous positions of HFV reverse transcriptase (RT). These critical amino acids are located at the same positions in the three-dimensional structure of HIV-1 and HFV RT, suggesting that both enzymes share common patterns of inhibition.

Download Full-text

PROTEIN METAL BINDING RESIDUE PREDICTION BASED ON NEURAL NETWORKS

International Journal of Neural Systems ◽

10.1142/s0129065705000116 ◽

2005 ◽

Vol 15 (01n02) ◽

pp. 71-84 ◽

Cited By ~ 37

Author(s):

CHIN-TENG LIN ◽

KEN-LI LIN ◽

CHIH-HSIEN YANG ◽

I-FANG CHUNG ◽

CHUEN-DER HUANG ◽

...

Keyword(s):

Neural Networks ◽

Amino Acid ◽

Metal Ions ◽

Metal Binding ◽

Protein Structures ◽

Direct Analysis ◽

Sequence Information ◽

Amino Acid Residues ◽

Binding Residue

Over one-third of protein structures contain metal ions, which are the necessary elements in life systems. Traditionally, structural biologists were used to investigate properties of metalloproteins (proteins which bind with metal ions) by physical means and interpreting the function formation and reaction mechanism of enzyme by their structures and observations from experiments in vitro. Most of proteins have primary structures (amino acid sequence information) only; however, the 3-dimension structures are not always available. In this paper, a direct analysis method is proposed to predict the protein metal-binding amino acid residues from its sequence information only by neural networks with sliding window-based feature extraction and biological feature encoding techniques. In four major bulk elements (Calcium, Potassium, Magnesium, and Sodium), the metal-binding residues are identified by the proposed method with higher than 90% sensitivity and very good accuracy under 5-fold cross validation. With such promising results, it can be extended and used as a powerful methodology for metal-binding characterization from rapidly increasing protein sequences in the future.

Download Full-text