Exploring the past and the future of protein evolution with ancestral sequence reconstruction: the ‘retro’ approach to protein engineering

2016 ◽  
Vol 474 (1) ◽  
pp. 1-19 ◽  
Author(s):  
Yosephine Gumulya ◽  
Elizabeth M.J. Gillam

A central goal in molecular evolution is to understand the ways in which genes and proteins evolve in response to changing environments. In the absence of intact DNA from fossils, ancestral sequence reconstruction (ASR) can be used to infer the evolutionary precursors of extant proteins. To date, ancestral proteins belonging to eubacteria, archaea, yeast and vertebrates have been inferred that have been hypothesized to date from between several million to over 3 billion years ago. ASR has yielded insights into the early history of life on Earth and the evolution of proteins and macromolecular complexes. Recently, however, ASR has developed from a tool for testing hypotheses about protein evolution to a useful means for designing novel proteins. The strength of this approach lies in the ability to infer ancestral sequences encoding proteins that have desirable properties compared with contemporary forms, particularly thermostability and broad substrate range, making them good starting points for laboratory evolution. Developments in technologies for DNA sequencing and synthesis and computational phylogenetic analysis have led to an escalation in the number of ancient proteins resurrected in the last decade and greatly facilitated the use of ASR in the burgeoning field of synthetic biology. However, the primary challenge of ASR remains in accurately inferring ancestral states, despite the uncertainty arising from evolutionary models, incomplete sequences and limited phylogenetic trees. This review will focus, firstly, on the use of ASR to uncover links between sequence and phenotype and, secondly, on the practical application of ASR in protein engineering.

2020 ◽  
Vol 10 (1) ◽  
Author(s):  
Ryutaro Furukawa ◽  
Wakako Toma ◽  
Koji Yamazaki ◽  
Satoshi Akanuma

Abstract Enzymes have high catalytic efficiency and low environmental impact, and are therefore potentially useful tools for various industrial processes. Crucially, however, natural enzymes do not always have the properties required for specific processes. It may be necessary, therefore, to design, engineer, and evolve enzymes with properties that are not found in natural enzymes. In particular, the creation of enzymes that are thermally stable and catalytically active at low temperature is desirable for processes involving both high and low temperatures. In the current study, we designed two ancestral sequences of 3-isopropylmalate dehydrogenase by an ancestral sequence reconstruction technique based on a phylogenetic analysis of extant homologous amino acid sequences. Genes encoding the designed sequences were artificially synthesized and expressed in Escherichia coli. The reconstructed enzymes were found to be slightly more thermally stable than the extant thermophilic homologue from Thermus thermophilus. Moreover, they had considerably higher low-temperature catalytic activity as compared with the T. thermophilus enzyme. Detailed analyses of their temperature-dependent specific activities and kinetic properties showed that the reconstructed enzymes have catalytic properties similar to those of mesophilic homologues. Collectively, our study demonstrates that ancestral sequence reconstruction can produce a thermally stable enzyme with catalytic properties adapted to low-temperature reactions.


Author(s):  
Milos Musil ◽  
Rayyan Tariq Khan ◽  
Andy Beier ◽  
Jan Stourac ◽  
Hannes Konegger ◽  
...  

Abstract There is a great interest in increasing proteins’ stability to widen their usability in numerous biomedical and biotechnological applications. However, native proteins cannot usually withstand the harsh industrial environment, since they are evolved to function under mild conditions. Ancestral sequence reconstruction is a well-established method for deducing the evolutionary history of genes. Besides its applicability to discover the most probable evolutionary ancestors of the modern proteins, ancestral sequence reconstruction has proven to be a useful approach for the design of highly stable proteins. Recently, several computational tools were developed, which make the ancestral reconstruction algorithms accessible to the community, while leaving the most crucial steps of the preparation of the input data on users’ side. FireProtASR aims to overcome this obstacle by constructing a fully automated workflow, allowing even the unexperienced users to obtain ancestral sequences based on a sequence query as the only input. FireProtASR is complemented with an interactive, easy-to-use web interface and is freely available at https://loschmidt.chemi.muni.cz/fireprotasr/.


2013 ◽  
Vol 9 (5) ◽  
pp. 20130608 ◽  
Author(s):  
Mathieu Groussin ◽  
Bastien Boussau ◽  
Sandrine Charles ◽  
Samuel Blanquart ◽  
Manolo Gouy

Several lines of evidence such as the basal location of thermophilic lineages in large-scale phylogenetic trees and the ancestral sequence reconstruction of single enzymes or large protein concatenations support the conclusion that the ancestors of the bacterial and archaeal domains were thermophilic organisms which were adapted to hot environments during the early stages of the Earth. A parsimonious reasoning would therefore suggest that the last universal common ancestor (LUCA) was also thermophilic. Various authors have used branch-wise non-homogeneous evolutionary models that better capture the variation of molecular compositions among lineages to accurately reconstruct the ancestral G + C contents of ribosomal RNAs and the ancestral amino acid composition of highly conserved proteins. They confirmed the thermophilic nature of the ancestors of Bacteria and Archaea but concluded that LUCA, their last common ancestor, was a mesophilic organism having a moderate optimal growth temperature. In this letter, we investigate the unknown nature of the phylogenetic signal that informs ancestral sequence reconstruction to support this non-parsimonious scenario. We find that rate variation across sites of molecular sequences provides information at different time scales by recording the oldest adaptation to temperature in slow-evolving regions and subsequent adaptations in fast-evolving ones.


2018 ◽  
Vol 35 (15) ◽  
pp. 2562-2568
Author(s):  
Asher Moshe ◽  
Tal Pupko

Abstract Motivation Ancestral sequence reconstruction (ASR) is widely used to understand protein evolution, structure and function. Current ASR methodologies do not fully consider differences in evolutionary constraints among positions imposed by the three-dimensional (3D) structure of the protein. Here, we developed an ASR algorithm that allows different protein sites to evolve according to different mixtures of replacement matrices. We show that assigning replacement matrices to protein positions based on their solvent accessibility leads to ASR with higher log-likelihoods compared to naïve models that assume a single replacement matrix for all sites. Improved ASR log-likelihoods are also demonstrated when solvent accessibility is predicted from protein sequences rather than inferred from a known 3D structure. Finally, we show that using such structure-aware mixture models results in substantial differences in the inferred ancestral sequences. Availability and implementation http://fastml.tau.ac.il. Supplementary information Supplementary data are available at Bioinformatics online.


Database ◽  
2020 ◽  
Vol 2020 ◽  
Author(s):  
Matias Sebastian Carletti ◽  
Alexander Miguel Monzon ◽  
Emilio Garcia-Rios ◽  
Guillermo Benitez ◽  
Layla Hirsh ◽  
...  

Abstract Revenant is a database of resurrected proteins coming from extinct organisms. Currently, it contains a manually curated collection of 84 resurrected proteins derived from bibliographic data. Each protein is extensively annotated, including structural, biochemical and biophysical information. Revenant contains a browse capability designed as a timeline from where the different proteins can be accessed. The oldest Revenant entries are between 4200 and 3500 million years ago, while the younger entries are between 8.8 and 6.3 million years ago. These proteins have been resurrected using computational tools called ancestral sequence reconstruction techniques combined with wet-laboratory synthesis and expression. Resurrected proteins are commonly used, with a noticeable increase during the past years, to explore and test different evolutionary hypotheses such as protein stability, to explore the origin of new functions, to get biochemical insights into past metabolisms and to explore specificity and promiscuous behaviour of ancient proteins.


2015 ◽  
Author(s):  
Betul Kacar

Scientists have access to artifacts of evolutionary history (namely, the fossil record and genomic sequences of living organisms) but they have limited means with which to infer the exact evolutionary events that occurred to produce today s living world. An intriguing question to arise from this historical limitation is whether the evolutionary paths of organisms are dominated by internal or external controlled processes (i.e., Life as a factory) or whether they are inherently random and subject to completely different outcomes if repeated under identical conditions (i.e., Life as a casino parlor). Two experimental approaches, ancestral sequence reconstruction and experimental evolution with microorganisms, can be used to recapitulate ancient adaptive pathways and provide valuable insights into the mutational steps that constitute an organism s genetic heritage. Ancestral sequence reconstruction follows a backwards-from-present-day strategy in which various ancestral forms of a modern gene or protein are reconstructed and then studied mechanistically. Experimental evolution, by contrast, follows a forward-from-present day strategy in which microbial populations are evolved in the laboratory under defined conditions in which their evolutionary paths may be closely monitored. Here I describe a novel hybrid of these two methods, in which synthetic components constructed from inferred ancestral gene or protein sequences are placed into the genomes of modern organisms that are then experimentally evolved. Through this system, we aim to establish the comparative study of ancient phenotypes as a novel, statistically rigorous methodology with which to explore the respective impacts of biophysics and chance in evolution within the scope of the Extended Synthesis.


2021 ◽  
Vol 69 ◽  
pp. 131-141
Author(s):  
Matthew A. Spence ◽  
Joe A. Kaczmarski ◽  
Jake W. Saunders ◽  
Colin J. Jackson

2018 ◽  
Vol 35 (7) ◽  
pp. 1783-1797 ◽  
Author(s):  
Ricardo Assunção Vialle ◽  
Asif U Tamuri ◽  
Nick Goldman

Sign in / Sign up

Export Citation Format

Share Document