scholarly journals A novel unbiased test for molecular convergent evolution and discoveries in echolocating, aquatic and high-altitude mammals

2017 ◽  
Author(s):  
Amir Marcovitz ◽  
Yatish Turakhia ◽  
Michael Gloudemans ◽  
Benjamin A. Braun ◽  
Heidi I. Chen ◽  
...  

AbstractDistantly related species entering similar biological niches often adapt by evolving similar morphological and physiological characters. The extent to which genomic molecular convergence, and the extent to which coding mutations underlie this convergent phenotypic evolution remain unknown. Using a novel test, we ask which group of functionally coherent genes is most affected by convergent amino acid substitutions between phenotypically convergent lineages. This most affected sets reveals 75 novel coding convergences in important genes that pattern a highly adapted organ: the cochlea, skin and lung in echolocating, aquatic and high-altitude mammals, respectively. Our test explicitly requires the enriched converged term to not be simultaneously enriched for divergent mutations, and correctly dismisses relaxation-based signals, such as those produced by vision genes in subterranean mammals. This novel test can be readily applied to birds, fish, flies, worms etc., to discover more of the fascinating contribution of protein coding convergence to phenotype convergence.

2012 ◽  
Vol 279 (1740) ◽  
pp. 3075-3082 ◽  
Author(s):  
Evgeny V. Leushkin ◽  
Georgii A. Bazykin ◽  
Alexey S. Kondrashov

Maps that relate all possible genotypes or phenotypes to fitness—fitness landscapes—are central to the evolution of life, but remain poorly known. An insertion or a deletion (indel) of one or several amino acids constitutes a substantial leap of a protein within the space of amino acid sequences, and it is unlikely that after such a leap the new sequence corresponds precisely to a fitness peak. Thus, one can expect an indel in the protein-coding sequence that gets fixed in a population to be followed by some number of adaptive amino acid substitutions, which move the new sequence towards a nearby fitness peak. Here, we study substitutions that occur after a frame-preserving indel in evolving proteins of Drosophila . An insertion triggers 1.03 ± 0.75 amino acid substitutions within the protein region centred at the site of insertion, and a deletion triggers 4.77 ± 1.03 substitutions within such a region. The difference between these values is probably owing to a higher fraction of effectively neutral insertions. Almost all of the triggered amino acid substitutions can be attributed to positive selection, and most of them occur relatively soon after the triggering indel and take place upstream of its site. A high fraction of substitutions that follow an indel occur at previously conserved sites, suggesting that an indel substantially changes selection that shapes the protein region around it. Thus, an indel is often followed by an adaptive walk of length that is in agreement with the theory of molecular adaptation.


2021 ◽  
Author(s):  
Claudio Casola ◽  
Jingjia Li

AbstractBackgroundThe recurrent evolution of the C4 photosynthetic pathway in angiosperms represents one of the most extraordinary examples of convergent evolution of a complex trait. Comparative genomic analyses have unveiled some of the molecular changes associated with the C4 pathway. For instance, several key enzymes involved in the transition from C3 to C4 photosynthesis have been found to share convergent amino acid replacements along C4 lineages. However, the extent of convergent replacements potentially associated with the emergence of C4 plants remains to be fully assessed. Here, we introduced a robust empirical approach to test molecular convergence along a phylogeny including multiple C3 and C4 taxa. By analyzing proteins encoded by chloroplast genes, we tested if convergent replacements occurred more frequently than expected in C4 lineages compared to C3 lineages. Furthermore, we sought to determine if convergent evolution occurred in multiple chloroplast proteins beside the well-known case of the large RuBisCO subunit encoded by the chloroplast gene rbcL.MethodsOur study was based on the comparative analysis of 43 C4 and 21 C3 grass species belonging to the PACMAD clade, a focal taxonomic group in many investigations of C4 evolution. We first used protein sequences of 67 orthologous chloroplast genes to build an accurate phylogeny of these species. Then, we inferred amino acid replacements along 13 C4 lineages and 9 C3 lineages using reconstructed protein sequences of their ancestral branches, corresponding to the most recent common ancestor of each lineage. Pairwise comparisons between ancestral branches allowed us to identify both convergent and divergent amino acid replacements between C4-C4, C3-C3 and C3-C4 lineages.ResultsThe reconstructed phylogenetic tree of 64 PACMAD grasses was characterized by strong supports in all nodes used for analyses of convergence. We identified 217 convergent replacements and 201 divergent replacements in 45/67 chloroplast proteins in both C4 and C3 ancestral branches. Pairs of C4-C4 ancestral branches showed higher levels of convergent replacements than C3-C3 and C3-C4 pairs. Furthermore, we found that more proteins shared unique convergent replacements in C4 lineages, with both RbcL and RpoC1 (the RNA polymerase beta’ subunit 1) showing a significantly higher convergent/divergent replacements ratio in C4 branches. Notably, significantly more C4-C4 pairs of ancestral branches showed higher numbers of convergent vs. divergent replacements than C3-C3 and C3-C4 pairs. Our results demonstrated that, in the PACMAD clade, C4 grasses experienced higher levels of molecular convergence than C3 species across multiple chloroplast genes. These findings have important implications for both our understanding of the evolution of photosynthesis and the goal of engineering improved crop varieties that integrates components of the C4 pathway.


2016 ◽  
Vol 6 (2) ◽  
pp. 211-216
Author(s):  
Rebecca J. Stoll ◽  
Grace R. Thompson ◽  
Mohammad D. Samy ◽  
George Blanck

2018 ◽  
Vol 115 (8) ◽  
pp. 1865-1870 ◽  
Author(s):  
Xiaojia Zhu ◽  
Yuyan Guan ◽  
Anthony V. Signore ◽  
Chandrasekhar Natarajan ◽  
Shane G. DuBay ◽  
...  

When different species experience similar selection pressures, the probability of evolving similar adaptive solutions may be influenced by legacies of evolutionary history, such as lineage-specific changes in genetic background. Here we test for adaptive convergence in hemoglobin (Hb) function among high-altitude passerine birds that are native to the Qinghai-Tibet Plateau, and we examine whether convergent increases in Hb–O2 affinity have a similar molecular basis in different species. We documented that high-altitude parid and aegithalid species from the Qinghai-Tibet Plateau have evolved derived increases in Hb–O2 affinity in comparison with their closest lowland relatives in East Asia. However, convergent increases in Hb–O2 affinity and convergence in underlying functional mechanisms were seldom attributable to the same amino acid substitutions in different species. Using ancestral protein resurrection and site-directed mutagenesis, we experimentally confirmed two cases in which parallel substitutions contributed to convergent increases in Hb–O2 affinity in codistributed high-altitude species. In one case involving the ground tit (Parus humilis) and gray-crested tit (Lophophanes dichrous), parallel amino acid replacements with affinity-enhancing effects were attributable to nonsynonymous substitutions at a CpG dinucleotide, suggesting a possible role for mutation bias in promoting recurrent changes at the same site. Overall, most altitude-related changes in Hb function were caused by divergent amino acid substitutions, and a select few were caused by parallel substitutions that produced similar phenotypic effects on the divergent genetic backgrounds of different species.


2019 ◽  
Vol 374 (1777) ◽  
pp. 20180244 ◽  
Author(s):  
Fábio K. Mendes ◽  
Andrew P. Livera ◽  
Matthew W. Hahn

Accurate inferences of convergence require that the appropriate tree topology be used. If there is a mismatch between the tree a trait has evolved along and the tree used for analysis, then false inferences of convergence (‘hemiplasy’) can occur. To avoid problems of hemiplasy when there are high levels of gene tree discordance with the species tree, researchers have begun to construct tree topologies from individual loci. However, due to intralocus recombination, even locus-specific trees may contain multiple topologies within them. This implies that the use of individual tree topologies discordant with the species tree can still lead to incorrect inferences about molecular convergence. Here, we examine the frequency with which single exons and single protein-coding genes contain multiple underlying tree topologies, in primates and Drosophila , and quantify the effects of hemiplasy when using trees inferred from individual loci. In both clades, we find that there are most often multiple diagnosable topologies within single exons and whole genes, with 91% of Drosophila protein-coding genes containing multiple topologies. Because of this underlying topological heterogeneity, even using trees inferred from individual protein-coding genes results in 25% and 38% of substitutions falsely labelled as convergent in primates and Drosophila , respectively. While constructing local trees can reduce the problem of hemiplasy, our results suggest that it will be difficult to completely avoid false inferences of convergence. We conclude by suggesting several ways forward in the analysis of convergent evolution, for both molecular and morphological characters. This article is part of the theme issue ‘Convergent evolution in the genomics era: new insights and directions’.


2002 ◽  
Vol 76 (4) ◽  
pp. 1718-1730 ◽  
Author(s):  
Aaron C. Brault ◽  
Ann M. Powers ◽  
Edward C. Holmes ◽  
C. H. Woelk ◽  
Scott C. Weaver

ABSTRACT Epidemic-epizootic Venezuelan equine encephalitis (VEE) viruses (VEEV) have emerged repeatedly via convergent evolution from enzootic predecessors. However, previous sequence analyses have failed to identify common sets of nucleotide or amino acid substitutions associated with all emaergence events. During 1993 and 1996, VEEV subtype IE epizootics occurred on the Pacific Coast of the states of Chiapas and Oaxaca in southern Mexico. Like other epizootic VEEV strains, when inoculated into guinea pigs and mice, the Mexican isolates were no more virulent than closely related enzootic strains, complicating genetic studies of VEE emergence. Complete genomic sequences of 4 of the Mexican strains were determined and compared to those of closely related enzootic subtype IE isolates from Guatemala. The epizootic viruses were less than 2% different at the nucleotide sequence level, and phylogenetic relationships confirmed that the equine-virulent Mexican strains probably evolved from enzootic progenitors on the Pacific Coast of Mexico or Guatemala. Of 35 amino acids that varied among the Guatemalan and Mexican isolates, only 8 were predicted phylogenetically to have accompanied the phenotypic change. One mutation at position 117 of the E2 envelope glycoprotein, involving replacement of Glu by Lys, resulted in a small-plaque phenotype characteristic of epizootic VEEV strains. Analysis of additional E2 sequences from representative enzootic and epizootic VEEV isolates implicated similar surface charge changes in the emergence of previous South American epizootic phenotypes, indicating that E2 mutations are probably important determinants of the equine-virulent phenotype and of VEE emergence. Maximum-likelihood analysis indicated that one change at E2 position 213 has been influenced by positive selection and convergent evolution of the epizootic phenotype.


2019 ◽  
Vol 20 (2) ◽  
pp. 81-89 ◽  
Author(s):  
Yan Hao ◽  
Yanhua Qu ◽  
Gang Song ◽  
Fumin Lei

Adaptive convergent evolution, which refers to the same or similar phenotypes produced by species from independent lineages under similar selective pressures, has been widely examined for a long time. Accumulating studies on the adaptive convergent evolution have been reported from many different perspectives (cellular, anatomical, morphological, physiological, biochemical, and behavioral). Recent advances in the genomic technologies have demonstrated that adaptive convergence can arise from specific genetic mechanisms in different hierarchies, ranging from the same nucleotide or amino acid substitutions to the biological functions or pathways. Among these genetic mechanisms, the same amino acid changes in protein-coding genes play an important role in adaptive phenotypic convergence. Methods for detecting adaptive convergence at the protein sequence level have been constantly debated and developed. Here, we review recent progress on using genomic approaches to evaluate the genetic mechanisms of adaptive convergent evolution, summarize the research methods for identifying adaptive amino acid convergence, and discuss the future perspectives for researching adaptive convergent evolution.


2014 ◽  
Vol 8 ◽  
pp. BBI.S13076 ◽  
Author(s):  
Adam Zemla ◽  
Tanya Kostova ◽  
Rodion Gorchakov ◽  
Evgeniya Volkova ◽  
David W. C. Beasley ◽  
...  

A computational approach for identification and assessment of genomic sequence variability (GeneSV) is described. For a given nucleotide sequence, GeneSV collects information about the permissible nucleotide variability (changes that potentially preserve function) observed in corresponding regions in genomic sequences, and combines it with conservation/variability results from protein sequence and structure-based analyses of evaluated protein coding regions. GeneSV was used to predict effects (functional vs. non-functional) of 37 amino acid substitutions on the NS5 polymerase (RdRp) of dengue virus type 2 (DENV-2), 36 of which are not observed in any publicly available DENV-2 sequence. 32 novel mutants with single amino acid substitutions in the RdRp were generated using a DENV-2 reverse genetics system. In 81% (26 of 32) of predictions tested, GeneSV correctly predicted viability of introduced mutations. In 4 of 5 (80%) mutants with double amino acid substitutions proximal in structure to one another GeneSV was also correct in its predictions. Predictive capabilities of the developed system were illustrated on dengue RNA virus, but described in the manuscript a general approach to characterize real or theoretically possible variations in genomic and protein sequences can be applied to any organism.


2019 ◽  
Vol 374 (1777) ◽  
pp. 20180246 ◽  
Author(s):  
Lu Yang ◽  
Nitin Ravikanthachari ◽  
Ricardo Mariño-Pérez ◽  
Riddhi Deshmukh ◽  
Mariana Wu ◽  
...  

The repeated evolutionary specialization of distantly related insects to cardenolide-containing host plants provides a stunning example of parallel adaptation. Hundreds of herbivorous insect species have independently evolved insensitivity to cardenolides, which are potent inhibitors of the alpha-subunit of Na + ,K + -ATPase (ATPα). Previous studies investigating ATPα-mediated cardenolide insensitivity in five insect orders have revealed remarkably high levels of parallelism in the evolution of this trait, including the frequent occurrence of parallel amino acid substitutions at two sites and recurrent episodes of duplication followed by neo-functionalization. Here we add data for a sixth insect order, Orthoptera, which includes an ancient group of highly aposematic cardenolide-sequestering grasshoppers in the family Pyrgomorphidae. We find that Orthopterans exhibit largely predictable patterns of evolution of insensitivity established by sampling other insect orders. Taken together the data lend further support to the proposal that negative pleiotropic constraints are a key determinant in the evolution of cardenolide insensitivity in insects. Furthermore, analysis of our expanded taxonomic survey implicates positive selection acting on site 111 of cardenolide-sequestering species with a single-copy of ATPα, and sites 115, 118 and 122 in lineages with neo-functionalized duplicate copies, all of which are sites of frequent parallel amino acid substitution. This article is part of the theme issue ‘Convergent evolution in the genomics era: new insights and directions’.


Author(s):  
Yan Hao ◽  
Ying Xiong ◽  
Yalin Cheng ◽  
Gang Song ◽  
Chenxi Jia ◽  
...  

High-altitude environments present strong stresses for living organisms, which have driven striking phenotypic and genetic adaptations. While previous studies have revealed multiple genetic adaptations in high-altitude species, how evolutionary history (i.e., phylogenetic background) contributes to similarity in genetic adaptations to high-altitude environments is largely unknown, in particular in a group of birds. We explored this in 3 high-altitude passerine birds from the Qinghai-Tibet Plateau and their low-altitude relatives in lowland eastern China. We generated transcriptomic data for 5 tissues across these species and compared sequence changes and expression shifts between high- and low-altitude pairs. Sequence comparison revealed that similarity in all 3 high-altitude species was high for genes under positive selection (218 genes) but low in amino acid substitutions (only 4 genes sharing identical amino acid substitutions). Expression profiles for all genes identified a tissue-specific expression pattern (i.e., all species clustered by tissue). By contrast, an altitude-related pattern was observed in genes differentially expressed between all 3 species pairs and genes associated with altitude, suggesting that the high-altitude environment may drive similar expression shifts in the 3 high-altitude species. Gene expression level, gene connectivity, and the interactions of these 2 factors with altitude were correlated with evolutionary rates. Our results provide evidence for how gene sequence changes and expression shifts work in a concerted way in a group of high-altitude birds, leading to similar evolution routes in response to high-altitude environmental stresses.


Sign in / Sign up

Export Citation Format

Share Document