Phylogeny Recapitulates Learning: Self-Optimization of Genetic Code

Mapping Intimacies ◽

10.1101/260877 ◽

2018 ◽

Author(s):

Oliver Attie ◽

Brian Sulkow ◽

Chong Di ◽

Wei-Gang Qiu

Keyword(s):

Amino Acids ◽

Genetic Code ◽

Complex Adaptive Systems ◽

Hebbian Learning ◽

Learning Algorithm ◽

Living Environment ◽

Hopfield Network ◽

Functional Difference ◽

Genetic Codes ◽

Phylogenetic Learning

AbstractLearning algorithms have been proposed as a non-selective mechanism capable of creating complex adaptive systems in life. Evolutionary learning however has not been demonstrated to be a plausible cause for the origin of a specific molecular system. Here we show that genetic codes as optimal as the Standard Genetic Code (SGC) emerge readily by following a molecular analog of the Hebb’s rule (“neurons fire together, wire together”). Specifically, error-minimizing genetic codes are obtained by maximizing the number of physio-chemically similar amino acids assigned to evolutionarily similar codons. Formulating genetic code as a Traveling Salesman Problem (TSP) with amino acids as “cities” and codons as “tour positions” and implemented with a Hopfield neural network, the unsupervised learning algorithm efficiently finds an abundance of genetic codes that are more error-minimizing than SGC. Drawing evidence from molecular phylogenies of contemporary tRNAs and aminoacyl-tRNA synthetases, we show that co-diversification between gene sequences and gene functions, which cumulatively captures functional differences with sequence differences and creates a genomic “memory” of the living environment, provides the biological basis for the Hebbian learning algorithm. Like the Hebb’s rule, the locally acting phylogenetic learning rule, which may simply be stated as increasing phylogenetic divergence for increasing functional difference, could lead to complex and robust life systems. Natural selection, while essential for maintaining gene function, is not necessary to act at system levels. For molecular systems that are self-organizing through phylogenetic learning, the TSP model and its Hopfield network solution offer a promising framework for simulating emerging behavior, forecasting evolutionary trajectories, and designing optimal synthetic systems.

Download Full-text

Symmetrical Properties of Graph Representations of Genetic Codes: From Genotype to Phenotype

Symmetry ◽

10.3390/sym10090388 ◽

2018 ◽

Vol 10 (9) ◽

pp. 388 ◽

Cited By ~ 2

Author(s):

Marco José ◽

Gabriel Zamudio

Keyword(s):

Amino Acids ◽

Genetic Code ◽

Graph Representation ◽

Symmetry Groups ◽

Standard Genetic Code ◽

Graph Representations ◽

Symmetrical Structure ◽

Genetic Codes ◽

Mitochondrial Genetic Code

It has long been claimed that the mitochondrial genetic code possesses more symmetries than the Standard Genetic Code (SGC). To test this claim, the symmetrical structure of the SGC is compared with noncanonical genetic codes. We analyzed the symmetries of the graphs of codons and their respective phenotypic graph representation spanned by the RNY (R purines, Y pyrimidines, and N any of them) code, two RNA Extended codes, the SGC, as well as three different mitochondrial genetic codes from yeast, invertebrates, and vertebrates. The symmetry groups of the SGC and their corresponding phenotypic graphs of amino acids expose the evolvability of the SGC. Indeed, the analyzed mitochondrial genetic codes are more symmetrical than the SGC.

Download Full-text

Reporter system architecture affects measurements of noncanonical amino acid incorporation efficiency and fidelity

10.1101/737197 ◽

2019 ◽

Author(s):

K.A. Potts ◽

J.T. Stieglitz ◽

M. Lei ◽

J.A. Van Deventer

Keyword(s):

Amino Acids ◽

Amino Acid ◽

Genetic Code ◽

Fluorescent Protein ◽

Protein Translation ◽

Chemical Diversity ◽

Reporter System ◽

Noncanonical Amino Acids ◽

Genetic Codes ◽

Incorporation Efficiency

AbstractThe ability to genetically encode noncanonical amino acids (ncAAs) within proteins supports a growing number of applications ranging from fundamental biological studies to enhancing the properties of biological therapeutics. Currently, our quantitative understanding of ncAA incorporation systems is confounded by the diverse set of characterization and analysis approaches used to quantify ncAA incorporation events. While several effective reporter systems support such measurements, it is not clear how quantitative results from different reporters relate to one another, or which details influence measurements most strongly. Here, we evaluate the quantitative performance of single-fluorescent protein reporters, dual-fluorescent protein reporters, and cell surface displayed protein reporters of ncAA insertion in response to the TAG (amber) codon in yeast. While different reporters support varying levels of apparent readthough efficiencies, flow cytometry-based evaluations with dual reporters yielded measurements exhibiting consistent quantitative trends and precision across all evaluated conditions. Further investigations of dual-fluorescent protein reporter architecture revealed that quantitative outputs are influenced by stop codon location and N-and C-terminal fluorescent protein identity. Both dual-fluorescent protein reporters and a “drop-in” version of yeast display support quantification of ncAA incorporation in several single-gene knockout strains, revealing strains that enhance ncAA incorporation efficiency without compromising fidelity. Our studies reveal critical details regarding reporter system performance in yeast and how to effectively deploy such reporters. These findings have substantial implications for how to engineer ncAA incorporation systems—and protein translation apparatuses—to better accommodate alternative genetic codes for expanding the chemical diversity of biosynthesized proteins.Design, System, Application ParagraphOn earth, the genetic code provides nearly invariant instructions for generating the proteins present in all organisms using 20 primary amino acid building blocks. Scientists and engineers have long recognized the potential power of altering the genetic code to introduce amino acids that enhance the chemical versatility of proteins. Proteins containing such “noncanonical amino acids” (ncAAs) can be used to elucidate basic biological phenomena, discover new therapeutics, or engineer new materials. However, tools for measuring ncAA incorporation during protein translation (reporters) exhibit highly variable properties, severely limiting our ability to engineer improved ncAA incorporation systems. In this work, we sought to understand what properties of these reporters affect measurements of ncAA incorporation events. Using a series of ncAA incorporation systems in yeast, we evaluated reporter architecture, measurement techniques, and alternative data analysis methods. We identified key factors contributing to quantification of ncAA incorporation in all of these categories and demonstrated the immediate utility of our approach in identifying genomic knockouts that enhance ncAA incorporation efficiency. Our findings have important implications for how to evolve cells to better accommodate alternative genetic codes.

Download Full-text

On the Importance of Asymmetry in the Phenotypic Expression of the Genetic Code upon the Molecular Evolution of Proteins

Symmetry ◽

10.3390/sym12060997 ◽

2020 ◽

Vol 12 (6) ◽

pp. 997

Author(s):

Marco V. José ◽

Gabriel S. Zamudio

Keyword(s):

Amino Acids ◽

Amino Acid ◽

Genetic Code ◽

Phenotypic Expression ◽

Standard Genetic Code ◽

Specific Amino Acid ◽

Trna Synthetases ◽

Probability Of Occurrence ◽

Genetic Codes ◽

Evolution Of Proteins

The standard genetic code (SGC) is a mapping between the 64 possible arrangements of the four RNA nucleotides (C, A, U, G) into triplets or codons, where 61 codons are assigned to a specific amino acid and the other three are stop codons for terminating protein synthesis. Aminoacyl-tRNA synthetases (aaRSs) are responsible for implementing the SGC by specifically amino-acylating only its cognate transfer RNA (tRNA), thereby linking an amino acid with its corresponding anticodon triplets. tRNAs molecules bind each codon with its anticodon. To understand the meaning of symmetrical/asymmetrical properties of the SGC, we designed synthetic genetic codes with known symmetries and with the same degeneracy of the SGC. We determined their impact on the substitution rates for each amino acid under a neutral model of protein evolution. We prove that the phenotypic graphs of the SGC for codons and anticodons for all the possible arrangements of nucleotides are asymmetric and the amino acids do not form orbits. In the symmetrical synthetic codes, the amino acids are grouped according to their codonicity, this is the number of triplets encoding a given amino acid. Both the SGC and symmetrical synthetic codes exhibit a probability of occurrence of the amino acids proportional to their degeneracy. Unlike the SGC, the synthetic codes display a constant probability of occurrence of the amino acid according to their codonicity. The asymmetry of the phenotypic graphs of codons and anticodons of the SGC, has important implications on the evolutionary processes of proteins.

Download Full-text

Dihedral Group in The Ancient Genetic

Jurnal Matematika Integratif ◽

10.24198/jmi.v16.n1.26646.13-18 ◽

2020 ◽

Vol 16 (1) ◽

pp. 13

Author(s):

Isah Aisah ◽

Eddy Djauhari ◽

Asep Singgih

Keyword(s):

Amino Acids ◽

Genetic Code ◽

Dihedral Group ◽

Vector Spaces ◽

Symmetry Groups ◽

Standard Genetic Code ◽

Dihedral Groups ◽

Genetic Codes ◽

Living Things ◽

Nucleotide Bases

The standard genetic code consist of four nucleotide bases which encode genes to produce amino acids needed by living things. The addition of new base (Dummy) causes a sequence of bases to become five nucleotide bases called ancient genetic codes. The five base set is denoted by , where B forms group through matching , , , , and from set . Ancient genetic codes can be reviewed as algebraic structures as a vector spaces and other structures as symmetry groups. In this article, discussed the properties of symmetry groups from ancient genetic codes that will produce dihedral groups. The study began by constructing an expanded nucleotide base isomorphism with . The presence of base causes to have a cardinality of 24, denoted as with . isomorphic with which is denoted by . Group had three clasess of partitions based on strong-weak, purin-pyrimidin types, and amino-keto nucleotide groups which are denoted as , , and . All three classes are subgroups of . By using the rules of rotation and reflection in the four-side plane, it was found that only one group fulfilled the rule was named the dihedral group.

Download Full-text

Optimization of Expanded Genetic Codes via Genetic Algorithms

10.5753/eniac.2018.4440 ◽

2018 ◽

Author(s):

Maísa de Carvalho Silva ◽

Lariza Laura De Oliveira ◽

Renato Tinós

Keyword(s):

Amino Acids ◽

Genetic Algorithms ◽

Amino Acid ◽

Genetic Code ◽

Unnatural Amino Acids ◽

Genetically Modified Organisms ◽

Fitness Function ◽

Unnatural Amino Acid ◽

Standard Genetic Code ◽

Genetic Codes

In the last decades, researchers have proposed the use of genetically modified organisms that utilize unnatural amino acids, i.e., amino acids other than the 20 amino acids encoded in the standard genetic code. Unnatural amino acids have been incorporated into genetically engineered organisms for the development of new drugs, fuels and chemicals. When new amino acids are incorporated, it is necessary to modify the standard genetic code. Expanded genetic codes have been created without considering the robustness of the code. The objective of this work is the use of genetic algorithms (GAs) for the optimization of expanded genetic codes. The GA indicates which codons of the standard genetic code should be used to encode a new unnatural amino acid. The fitness function has two terms; one for robustness of the new code and another that takes into account the frequency of use of amino acids. Experiments show that, by controlling the weighting between the two terms, it is possible to obtain more or less amino acid substitutions at the same time that the robustness is minimized.

Download Full-text

Faculty Opinions recommendation of Genetic code supports targeted insertion of two amino acids by one codon.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.1146994.604104 ◽

2009 ◽

Author(s):

Lucio Comai

Keyword(s):

Amino Acids ◽

Genetic Code

Download Full-text

A 68-codon genetic code to incorporate four distinct non-canonical amino acids enabled by automated orthogonal mRNA design

Nature Chemistry ◽

10.1038/s41557-021-00764-5 ◽

2021 ◽

Cited By ~ 1

Author(s):

Daniel L. Dunkelmann ◽

Sebastian B. Oehm ◽

Adam T. Beattie ◽

Jason W. Chin

Keyword(s):

Amino Acids ◽

Genetic Code ◽

Mrna Design

Download Full-text

Transferability of N-terminal mutations of pyrrolysyl-tRNA synthetase in one species to that in another species on unnatural amino acid incorporation efficiency

Amino Acids ◽

10.1007/s00726-020-02927-z ◽

2020 ◽

Author(s):

Thomas L. Williams ◽

Debra J. Iskandar ◽

Alexander R. Nödling ◽

Yurong Tan ◽

Louis Y. P. Luk ◽

...

Keyword(s):

Amino Acids ◽

Amino Acid ◽

Genetic Code ◽

Unnatural Amino Acids ◽

Trna Synthetase ◽

Unnatural Amino Acid ◽

Amino Acid Incorporation ◽

Acid Incorporation ◽

Genetic Code Expansion ◽

Incorporation Efficiency

AbstractGenetic code expansion is a powerful technique for site-specific incorporation of an unnatural amino acid into a protein of interest. This technique relies on an orthogonal aminoacyl-tRNA synthetase/tRNA pair and has enabled incorporation of over 100 different unnatural amino acids into ribosomally synthesized proteins in cells. Pyrrolysyl-tRNA synthetase (PylRS) and its cognate tRNA from Methanosarcina species are arguably the most widely used orthogonal pair. Here, we investigated whether beneficial effect in unnatural amino acid incorporation caused by N-terminal mutations in PylRS of one species is transferable to PylRS of another species. It was shown that conserved mutations on the N-terminal domain of MmPylRS improved the unnatural amino acid incorporation efficiency up to five folds. As MbPylRS shares high sequence identity to MmPylRS, and the two homologs are often used interchangeably, we examined incorporation of five unnatural amino acids by four MbPylRS variants at two temperatures. Our results indicate that the beneficial N-terminal mutations in MmPylRS did not improve unnatural amino acid incorporation efficiency by MbPylRS. Knowledge from this work contributes to our understanding of PylRS homologs which are needed to improve the technique of genetic code expansion in the future.

Download Full-text