Probability of Change in Life: amino acid changes in single nucleotide substitutions

Mapping Intimacies ◽

10.1101/729756 ◽

2019 ◽

Cited By ~ 1

Author(s):

Kwok-Fong Chan ◽

Stelios Koukouravas ◽

Joshua Yi Yeo ◽

Darius Wen-Shuo Koh ◽

Samuel Ken-En Gan

Keyword(s):

Amino Acids ◽

Amino Acid ◽

Genetic Code ◽

Nucleotide Substitutions ◽

Universal Genetic Code ◽

Stop Codons ◽

Charged Amino Acids ◽

Wide Range ◽

Statistical Mechanism ◽

The Relationship

ABSTRACTMutations underpin the processes in life, be it beneficial or detrimental. While mutations are assumed to be random in the bereft of selection pressures, the genetic code has underlying computable probabilities in amino acid phenotypic changes. With a wide range of implications including drug resistance, understanding amino acid changes is important. In this study, we calculated the probabilities of substitutions mutations in the genetic code leading to the 20 amino acids and stop codons. Our calculations reveal an enigmatic in-built self-preserving organization of the genetic code that averts disruptive changes at the physicochemical properties level. These changes include changes to start, aromatic, negative charged amino acids and stop codons. Our findings thus reveal a statistical mechanism governing the relationship between amino acids and the universal genetic code.

Download Full-text

The Origin of Prebiotic Information System in the Peptide/RNA World: A Simulation Model of the Evolution of Translation and the Genetic Code

10.20944/preprints201812.0203.v1 ◽

2018 ◽

Author(s):

Sankar Chatterjee ◽

Surya Yadav

Keyword(s):

Amino Acids ◽

Information System ◽

Protein Synthesis ◽

Amino Acid ◽

Genetic Code ◽

Rna World ◽

Information Age ◽

Universal Genetic Code ◽

Wide Range ◽

Translation Machine

Information is the currency of life, but the origin of prebiotic information remains a mystery. We propose transitional pathways from the cosmic building blocks of life to the complex prebiotic organic chemistry that led to the origin of information systems. The prebiotic information system, specifically the genetic code, is segregated, linear, and digital and probably appeared during biogenesis four billion years ago. In the peptide/RNA world, lipid membranes randomly encapsulated amino acids, RNA, and protein molecules, drawn from the prebiotic soup, to initiate a molecular symbiosis inside the protocells. This endosymbiosis led to the hierarchical emergence of several requisite components of the translation machine: tRNAs, aaRS, mRNAs, and ribosomes. When assembled in the right order, the translation machine created biosynthetic polypeptides, a process that transferred information from mRNAs to proteins. This was the beginning of the prebiotic information age. The molecular attraction between tRNA and amino acids led to different stages of the translation machines and the genetic code. tRNA is an ancient molecule that designed and built mRNA for storing the information of its cognate amino acid. Each mRNA strand became the storage device for the genetic information that encoded the amino acid sequences in triplet nucleotides. As information appeared in the digital languages of the codon within mRNA, and the genetic code for protein synthesis evolved, the prebiotic chemistry then became more organized and directional. The origin of the genetic code is enigmatic; herein we propose an evolutionary explanation: the demand for a wide range of specific enzymes in the peptide/RNA world was the main selective pressure for the origin of information-directed protein synthesis. We review three main concepts on the origin and evolution of the genetic code: the stereochemical theory, the coevolution theory, and the adaptive theory. These three theories are compatible with our coevolution model of the translation machines and the genetic code. We suggest biosynthetic pathways as the origin of the specific translation machines which provided the framework for the origin of the genetic code. During translation, the genetic code developed in three stages coincident with the refinement of the translation machines: GNC code developed by the pre-tRNA/pre-aaRS /pre-mRNA machine, SNS code by the tRNA/aaRS/mRNA machine, and finally the universal genetic code by the tRNA/aaRS/mRNA/ribosome machine. Our hypothesis provides the logical and incremental steps for the origin of the programmed protein synthesis. In order to understand the prebiotic information system better, we converted letter codons into numerical codons in the Universal Genetic Code Table. We have developed a software called CATI (Codon-Amino Acid-Translator-Imitator) to translate randomly chosen numerical codons into corresponding amino acids and vice versa. This conversion has granted us insight into how the translation might have worked in the peptide/RNA world. There is great potential in the application of numerical codons to bioinformatics such as barcoding, DNA mining, or DNA fingerprinting. We constructed the likely biochemical pathways for the origin of translation and the genetic code using the Model-View-Controller (MVC) software framework, and the translation machinery step-by-step. Using AnyLogic software we were able to simulate and visualize the entire evolution of the translation machines and the genetic code. The results indicate that the emergence of the information age from the peptide/RNA world was a watershed event in the origin of life about four billion years ago.

Download Full-text

Modern diversification of the amino acid repertoire driven by oxygen

Proceedings of the National Academy of Sciences ◽

10.1073/pnas.1717100115 ◽

2017 ◽

Vol 115 (1) ◽

pp. 41-46 ◽

Cited By ~ 24

Author(s):

Matthias Granold ◽

Parvana Hajieva ◽

Monica Ioana Toşa ◽

Florin-Dan Irimie ◽

Bernd Moosmann

Keyword(s):

Amino Acids ◽

Amino Acid ◽

Genetic Code ◽

Chemical Reactivity ◽

Protein Biosynthesis ◽

Building Blocks ◽

Peroxyl Radicals ◽

Basis Set ◽

Universal Genetic Code ◽

Amino Acid Properties

All extant life employs the same 20 amino acids for protein biosynthesis. Studies on the number of amino acids necessary to produce a foldable and catalytically active polypeptide have shown that a basis set of 7–13 amino acids is sufficient to build major structural elements of modern proteins. Hence, the reasons for the evolutionary selection of the current 20 amino acids out of a much larger available pool have remained elusive. Here, we have analyzed the quantum chemistry of all proteinogenic and various prebiotic amino acids. We find that the energetic HOMO–LUMO gap, a correlate of chemical reactivity, becomes incrementally closer in modern amino acids, reaching the level of specialized redox cofactors in the late amino acids tryptophan and selenocysteine. We show that the arising prediction of a higher reactivity of the more recently added amino acids is correct as regards various free radicals, particularly oxygen-derived peroxyl radicals. Moreover, we demonstrate an immediate survival benefit conferred by the enhanced redox reactivity of the modern amino acids tyrosine and tryptophan in oxidatively stressed cells. Our data indicate that in demanding building blocks with more versatile redox chemistry, biospheric molecular oxygen triggered the selective fixation of the last amino acids in the genetic code. Thus, functional rather than structural amino acid properties were decisive during the finalization of the universal genetic code.

Download Full-text

The Origin of Prebiotic Information System in the Peptide/RNA World: A Simulation Model of the Evolution of Translation and the Genetic Code

Life ◽

10.3390/life9010025 ◽

2019 ◽

Vol 9 (1) ◽

pp. 25 ◽

Cited By ~ 13

Author(s):

Sankar Chatterjee ◽

Surya Yadav

Keyword(s):

Amino Acids ◽

Information System ◽

Protein Synthesis ◽

Amino Acid ◽

Genetic Code ◽

Rna World ◽

Trna Synthetase ◽

Darwinian Evolution ◽

Universal Genetic Code ◽

Translation Machine

Information is the currency of life, but the origin of prebiotic information remains a mystery. We propose transitional pathways from the cosmic building blocks of life to the complex prebiotic organic chemistry that led to the origin of information systems. The prebiotic information system, specifically the genetic code, is segregated, linear, and digital, and it appeared before the emergence of DNA. In the peptide/RNA world, lipid membranes randomly encapsulated amino acids, RNA, and peptide molecules, which are drawn from the prebiotic soup, to initiate a molecular symbiosis inside the protocells. This endosymbiosis led to the hierarchical emergence of several requisite components of the translation machine: transfer RNAs (tRNAs), aminoacyl-tRNA synthetase (aaRS), messenger RNAs (mRNAs), ribosomes, and various enzymes. When assembled in the right order, the translation machine created proteins, a process that transferred information from mRNAs to assemble amino acids into polypeptide chains. This was the beginning of the prebiotic <i>information</i> age. The origin of the genetic code is enigmatic; herein, we propose an evolutionary explanation: the demand for a wide range of protein enzymes over peptides in the prebiotic reactions was the main selective pressure for the origin of information-directed protein synthesis. The molecular basis of the genetic code manifests itself in the interaction of aaRS and their cognate tRNAs. In the beginning, aminoacylated ribozymes used amino acids as a cofactor with the help of bridge peptides as a process for selection between amino acids and their cognate codons/anticodons. This process selects amino acids and RNA species for the next steps. The ribozymes would give rise to pre-tRNA and the bridge peptides to pre-aaRS. Later, variants would appear and evolution would produce different but specific aaRS-tRNA-amino acid combinations. Pre-tRNA designed and built pre-mRNA for the storage of information regarding its cognate amino acid. Each pre-mRNA strand became the storage device for the genetic information that encoded the amino acid sequences in triplet nucleotides. As information appeared in the digital languages of the codon within pre-mRNA and mRNA, and the genetic code for protein synthesis evolved, the prebiotic chemistry then became more organized and directional with the emergence of the translation and genetic code. The genetic code developed in three stages that are coincident with the refinement of the translation machines: the GNC code that was developed by the pre-tRNA/pre-aaRS /pre-mRNA machine, SNS code by the tRNA/aaRS/mRNA machine, and finally the universal genetic code by the tRNA/aaRS/mRNA/ribosome machine. We suggest the coevolution of translation machines and the genetic code. The emergence of the translation machines was the beginning of the Darwinian evolution, an interplay between information and its supporting structure. Our hypothesis provides the logical and incremental steps for the origin of the programmed protein synthesis. In order to better understand the prebiotic information system, we converted letter codons into numerical codons in the Universal Genetic Code Table. We have developed a software, called CATI (Codon-Amino Acid-Translator-Imitator), to translate randomly chosen numerical codons into corresponding amino acids and vice versa. This conversion has granted us insight into how the genetic code might have evolved in the peptide/RNA world. There is great potential in the application of numerical codons to bioinformatics, such as barcoding, DNA mining, or DNA fingerprinting. We constructed the likely biochemical pathways for the origin of translation and the genetic code using the Model-View-Controller (MVC) software framework, and the translation machinery step-by-step. While using AnyLogic software, we were able to simulate and visualize the entire evolution of the translation machines, amino acids, and the genetic code.

Download Full-text

The Origin of the Information System in the RNA/Protein World: A Simulation Model of the Evolution of Translation and the Genetic Code

10.20944/preprints201810.0127.v1 ◽

2018 ◽

Author(s):

Sankar Chatterjee ◽

Surya Yadav

Keyword(s):

Amino Acids ◽

Amino Acid ◽

Genetic Code ◽

Origin Of Life ◽

Building Blocks ◽

Translation Machinery ◽

Crater Lakes ◽

Universal Genetic Code ◽

Protein World ◽

The Origin Of Life

The Late Heavy Bombardment Period (4.1 to 3.8 billion years ago) of heightened impact cratering activity on young Earth is likely the driving force for the origin of life. During the Eoarchean, asteroids such as carbonaceous chondrites delivered the building blocks of life and water to early Earth. Asteroid collisions created innumerable hydrothermal crater lakes in the Eoarchean crust which inadvertently became the perfect cradle for prebiotic chemistry. These hydrothermal crater lakes were filled with cosmic water and the building blocks of life. forming a thick prebiotic soup. The unique combination of exogenous delivery of extraterrestrial building blocks of life, and the endogenous biosynthesis in hydrothermal impact crater lakes very likely gave rise to life. A new symbiotic model for the origin of life within the hydrothermal crater lakes is here proposed. In this scenario, life arose around four billion years ago through five hierarchical stages of increasing molecular complexity: cosmic, geologic, chemical, information, and biological. During the prebiotic synthesis, membranes first appeared in the hydrothermal crater lakes, followed by the simultaneous origin of RNA and protein molecules, creating the RNA/protein world. These proteins were noncoded protein enzymes that facilitated chemical reactions. RNA molecules formed in the hydrothermal crater basin by polymerization of the nucleotides on the montmorillonite mineral substrate. Similarly, the initial synthesis of abiotic protein enzymes was mediated by the condensation of amino acids on pyrite surfaces. The regular wet-dry cycles within the crater lakes assisted further concentration, condensation, and polymerization of the RNAs and proteins. Lipid membranes randomly encapsulated amino acids, RNA, and protein molecules from the prebiotic soup to initiate a molecular symbiosis inside the protocells, this led to the hierarchical emergence of several cell components. As the role of protein enzymes became essential for catalytic process in the RNA/protein world, Darwinian selection from noncoded to coded protein synthesis led to translation systems and the genetic code, heralding the information stage. In this stage, the biochemical pathways suggest the successive emergence of translation machineries such as tRNAs, aaRS, mRNAs, and of ribosomes for protein synthesis. The molecular attraction between tRNA and amino acid led to the emergence of translation machinery and the genetic code.  tRNA is an ancient molecule that created mRNA for the purpose of storing amino acid information like a digital strip. Each mRNA strand became the storage device for genetic information that encoded the amino acid sequences in triplet nucleotides. As information became available in the digital languages of the codon within mRNA, biosynthesis became less random and more organized and directional. The original translation machinery was simpler before the emergence of the ribosome than that of today. We review three main concepts on the origin and evolution of the genetic code: the stereochemical theory, the coevolution theory, and adaptive theory. We believe that these three theories are not mutually exclusive, but are compatible with our coevolution model of translations machines and the genetic code. We suggest biosynthetic pathways as the origin of the translation machine that provided the framework for the origin of the genetic code. During translation, the genetic code developed in three stages coincident with the refinement of the translation machinery: GNC code with four codons and four amino acids during interactions of pre-tRNA/pre-aaRS /pre-mRNA, SNS code consisting of 16 codons and 10 amino acids appeared during the tRNA/aaRS/mRNA interaction, and finally the universal genetic code evolved with the emergence of the tRNA/aaRS/mRNA/ribosome machine. The universal code consists of 64 codons and 20 amino acids, with a redundancy that minimizes errors in translation. To address the question of the origin of the biological information system in the RNA/protein world, we converted letter codons into numerical codons in the Universal Genetic Code Table. We developed a software called CATI (Codon-Amino Acid-Translator-Imitator) to translate randomly chosen numerical codons into corresponding amino acids and vice versa, gaining insight into how translation might have worked in the RNA/protein world. We simulated the likely biochemical pathways for the origin of translation and the genetic code using the Model-View-Controller (MVC) software framework, and the translation machinery step-by-step. We used AnyLogic software to simulate and visualize the evolution of the translation machines and the genetic code. We conclude that the emergence of the information age from the RNA/protein world was a watershed event in the origin of life about four billion years ago.

Download Full-text

Transferability of N-terminal mutations of pyrrolysyl-tRNA synthetase in one species to that in another species on unnatural amino acid incorporation efficiency

Amino Acids ◽

10.1007/s00726-020-02927-z ◽

2020 ◽

Author(s):

Thomas L. Williams ◽

Debra J. Iskandar ◽

Alexander R. Nödling ◽

Yurong Tan ◽

Louis Y. P. Luk ◽

...

Keyword(s):

Amino Acids ◽

Amino Acid ◽

Genetic Code ◽

Unnatural Amino Acids ◽

Trna Synthetase ◽

Unnatural Amino Acid ◽

Amino Acid Incorporation ◽

Acid Incorporation ◽

Genetic Code Expansion ◽

Incorporation Efficiency

AbstractGenetic code expansion is a powerful technique for site-specific incorporation of an unnatural amino acid into a protein of interest. This technique relies on an orthogonal aminoacyl-tRNA synthetase/tRNA pair and has enabled incorporation of over 100 different unnatural amino acids into ribosomally synthesized proteins in cells. Pyrrolysyl-tRNA synthetase (PylRS) and its cognate tRNA from Methanosarcina species are arguably the most widely used orthogonal pair. Here, we investigated whether beneficial effect in unnatural amino acid incorporation caused by N-terminal mutations in PylRS of one species is transferable to PylRS of another species. It was shown that conserved mutations on the N-terminal domain of MmPylRS improved the unnatural amino acid incorporation efficiency up to five folds. As MbPylRS shares high sequence identity to MmPylRS, and the two homologs are often used interchangeably, we examined incorporation of five unnatural amino acids by four MbPylRS variants at two temperatures. Our results indicate that the beneficial N-terminal mutations in MmPylRS did not improve unnatural amino acid incorporation efficiency by MbPylRS. Knowledge from this work contributes to our understanding of PylRS homologs which are needed to improve the technique of genetic code expansion in the future.

Download Full-text

A multivariate study of the relationship between the genetic code and the physical-chemical properties of amino acids

Journal of Molecular Evolution ◽

10.1007/bf02099756 ◽

1985 ◽

Vol 22 (3) ◽

pp. 272-277 ◽

Cited By ~ 70

Author(s):

Michael Sjöström ◽

Svante Wold

Keyword(s):

Amino Acids ◽

Genetic Code ◽

Chemical Properties ◽

Multivariate Study ◽

Physical Chemical ◽

The Relationship

Download Full-text

Making Sense of “Nonsense” and More: Challenges and Opportunities in the Genetic Code Expansion, in the World of tRNA Modifications

International Journal of Molecular Sciences ◽

10.3390/ijms23020938 ◽

2022 ◽

Vol 23 (2) ◽

pp. 938

Author(s):

Olubodun Michael Lateef ◽

Michael Olawale Akintubosun ◽

Olamide Tosin Olaoba ◽

Sunday Ocholi Samson ◽

Malgorzata Adamczyk

Keyword(s):

Amino Acids ◽

Genetic Code ◽

Protein Design ◽

Side Chains ◽

Vast Number ◽

Trna Modifications ◽

Wide Range ◽

Challenges And Opportunities ◽

Code Extension ◽

Genetic Code Expansion

The evolutional development of the RNA translation process that leads to protein synthesis based on naturally occurring amino acids has its continuation via synthetic biology, the so-called rational bioengineering. Genetic code expansion (GCE) explores beyond the natural translational processes to further enhance the structural properties and augment the functionality of a wide range of proteins. Prokaryotic and eukaryotic ribosomal machinery have been proven to accept engineered tRNAs from orthogonal organisms to efficiently incorporate noncanonical amino acids (ncAAs) with rationally designed side chains. These side chains can be reactive or functional groups, which can be extensively utilized in biochemical, biophysical, and cellular studies. Genetic code extension offers the contingency of introducing more than one ncAA into protein through frameshift suppression, multi-site-specific incorporation of ncAAs, thereby increasing the vast number of possible applications. However, different mediating factors reduce the yield and efficiency of ncAA incorporation into synthetic proteins. In this review, we comment on the recent advancements in genetic code expansion to signify the relevance of systems biology in improving ncAA incorporation efficiency. We discuss the emerging impact of tRNA modifications and metabolism in protein design. We also provide examples of the latest successful accomplishments in synthetic protein therapeutics and show how codon expansion has been employed in various scientific and biotechnological applications.

Download Full-text

Analysis of Htra Gene from Zebrafish (Danio Rerio)

Indonesian Journal of Biotechnology ◽

10.22146/ijbiotech.7554 ◽

2015 ◽

Vol 10 (2) ◽

Author(s):

M. Murwantoko ◽

Chio Oka ◽

Masashi Kawaichi

Keyword(s):

Amino Acids ◽

Amino Acid ◽

Amino Acid Sequence ◽

Danio Rerio ◽

Serine Proteases ◽

Catalytic Domain ◽

Fruit Fly ◽

Wide Range ◽

Igf Binding ◽

Bacterial Homolog

HtrA which is characterized by the combination of a trypsin-like catalytic domain with at least one C-terminalPDZ domain is a highly conserved family of serine proteases found in a wide range of organisms. However theidentified HtrA family numbers varies among spesies, for example the number of mammalian, Eschericia coli,fruit fly-HtrA family are 4, 3 and 1 gene respectively. One gene is predicted exist in zebrafish. Since no completeinformation available on zebrafish HtrA, in this paper zebrafish HtrA (zHtrA) gene was analyzed. The zHtrA isbelonged to HtrA1 member and predicted encodes 478 amino acids with a signal peptide, a IGF binding domain,a Kazal-type inhibitor domain in the up stream of HtrA-bacterial homolog. At the amino acid sequence the zHtrA1showed the 69%, 69%, 68%, 54% and 54% with the rat HtrA1, mouse HtrA1, human HtrA1, human HtrA3 andmouse HtrA4 respectively. The zHtrA1 is firstly expressed at 60 hpf and mainly in the vertebral rudiments in thetail region.

Download Full-text

Charged Amino Acids Conserved in the Aromatic Acid/H+ Symporter Family of Permeases Are Required for 4-Hydroxybenzoate Transport by PcaK from Pseudomonas putida

Journal of Bacteriology ◽

10.1128/jb.184.5.1444-1448.2002 ◽

2002 ◽

Vol 184 (5) ◽

pp. 1444-1448 ◽

Cited By ~ 26

Author(s):

Jayna L. Ditty ◽

Caroline S. Harwood

Keyword(s):

Amino Acids ◽

Amino Acid ◽

Pseudomonas Putida ◽

Essential Amino Acid ◽

Major Facilitator Superfamily ◽

Aromatic Acid ◽

Transmembrane Segment ◽

Amino Acid Motif ◽

Charged Amino Acids ◽

Major Facilitator

ABSTRACT Charged amino acids in the predicted transmembrane portion of PcaK, a permease from Pseudomonas putida that transports 4-hydroxybenzoate (4-HBA), were required for 4-HBA transport, and they were also required for P. putida to have a chemotactic response to 4-HBA. An essential amino acid motif (DGXD) containing aspartate residues is located in the first transmembrane segment of PcaK and is conserved in the aromatic acid/H+ symporter family of the major facilitator superfamily of transporters.

Download Full-text

An Automated Detection System for Most L-α-Amino Acids

Clinical Chemistry ◽

10.1093/clinchem/15.2.154 ◽

1969 ◽

Vol 15 (2) ◽

pp. 154-161 ◽

Cited By ~ 6

Author(s):

K Van Dyke ◽

C Szustkiewicz

Keyword(s):

Amino Acids ◽

Amino Acid ◽

Detection System ◽

Automated System ◽

Acid Oxidase ◽

Enzymatic Reactions ◽

Amino Acid Oxidase ◽

Automated Method ◽

Wide Range

Abstract An automated system for the determination of the L-α form of the majority of amino acids is presented. The method is based upon oxidative deamination of the amino acid coupled with oxidation of o-dianisidine by hydrogen peroxide. This procedure can be used comparatively for the determination of a mixture of L-α-amino acids or for the majority of separated L-α-amino acids (especially in conjunction with column separations from urine and blood which give falsely positive identification with ninhydrin detection). The stereospecific nature of the L-α-amino acid oxidase enables the investigator to quantitate the amount of L-α-amino acid in the presence of the D-α form. From an academic viewpoint, the extreme sensitivity and wide range of the detection system make it advantageous for the study of the enzyme itself. This automated method also may be employed to follow enzymatic reactions—e.g., those catalyzed by peptidases or racemases. The methodology is extremely convenient with good reagent stability and is much more sensitive than manometric technics.

Download Full-text