scholarly journals Combing Transcriptomes for Secrets of Deep-Sea Survival: Environmental Diversity Drives Patterns of Protein Evolution

2019 ◽  
Vol 59 (4) ◽  
pp. 786-798 ◽  
Author(s):  
J R Winnikoff ◽  
W R Francis ◽  
E V Thuesen ◽  
S H D Haddock

AbstractCtenophores, also known as comb jellies, live across extremely broad ranges of temperature and hydrostatic pressure in the ocean. Because various ctenophore lineages adapted independently to similar environmental conditions, Phylum Ctenophora is an ideal system for the study of protein adaptation to extreme environments in a comparative framework. We present such a study here, using a phylogenetically-informed method to compare sequences of four essential metabolic enzymes across gradients of habitat depth and temperature. This method predicts convergent adaptation to these environmental parameters at the amino acid level, providing a novel view of protein adaptation to extreme environments and demonstrating the power and relevance of phylogenetic comparison applied to multi-species transcriptomic datasets from early-diverging metazoa. Across all four enzymes analyzed, 46 amino acid sites were associated with depth-adaptation, 59 with temperature-adaptation, and 56 with both. Sites predicted to be depth- and temperature-adaptive occurred consistently near Rossmann fold cofactor binding motifs and disproportionately in solvent-exposed regions of the protein. These results suggest that the hydrophobic effect and ligand binding may mediate efficient enzyme function at different hydrostatic pressures and temperatures. Using predicted adaptive site maps, such mechanistic hypotheses can now be tested via mutagenesis.

BMC Genomics ◽  
2020 ◽  
Vol 21 (1) ◽  
Author(s):  
Bin Lu ◽  
Hong Jin ◽  
Jinzhong Fu

Abstract Background To date, evidence for the relative prevalence or rarity of molecular convergent and parallel evolution is conflicting, and understanding of how these processes contribute to adaptation is limited. We compared four high-elevation anuran species (Bufo tibetanus, Nanorana parkeri, Rana kukunoris and Scutiger boulengeri) from the Tibetan region, and examined convergent and parallel amino acid substitutions between them and how they may have contributed to high-elevation adaptation. Results Genomic data of the four high-elevation species and eight of their low-elevation close relatives were gathered. A total of 1098 orthologs shared by all species were identified. We first conducted pairwise comparisons using Zhang and Kumar’s test. Then, the Rconv index was calculated and convergence/divergence correlation plotting was conducted. Furthermore, genes under positive selection and with elevated evolutionary rate were examined. We detected a large number of amino acid sites with convergent or parallel substitutions. Several pairs of high-elevation species, in particular, R. kukunoris vs N. parkeri and B. tibetanus vs S. boulengeri, had excessive amounts of convergent substitutions compared to neutral expectation. Nevertheless, these sites were mostly concentrated in a small number of genes (3–32), and no genome-wide convergence was detected. Furthermore, the majority of these convergent genes were neither under detectable positive selection nor had elevated evolutionary rates, although functional prediction analysis suggested some of the convergent genes could potentially contribute to high-elevation adaptation. Conclusions There is a substantial amount of convergent evolution at the amino-acid level among high-elevation amphibians, although these sites are concentrated in a few genes, not widespread across the genomes. This may attribute to the fact that all the target species are from the same environment. The relative prevalence of convergent substitutions among high-elevation amphibians provides an excellent opportunity for further study of molecular convergent evolution.


Genetics ◽  
1999 ◽  
Vol 152 (1) ◽  
pp. 393-400 ◽  
Author(s):  
Hugh Salamon ◽  
William Klitz ◽  
Simon Easteal ◽  
Xiaojiang Gao ◽  
Henry A Erlich ◽  
...  

Abstract Analysis of the highly polymorphic β1 domains of the HLA class II molecules encoded by the DRB1, DQB1, and DPB1 loci reveals contrasting levels of diversity at the allele and amino acid site levels. Statistics of allele frequency distributions, based on Watterson’s homozygosity statistic F, reveal distinct evolutionary patterns for these loci in ethnically diverse samples (26 populations for DQB1 and DRB1 and 14 for DPB1). When examined over all populations, the DQB1 locus allelic variation exhibits striking balanced polymorphism (P < 10-4), DRB1 shows some evidence of balancing selection (P < 0.06), and while there is overall very little evidence for selection of DPB1 allele frequencies, there is a trend in the direction of balancing selection (P < 0.08). In contrast, at the amino acid level all three loci show strong evidence of balancing selection at some sites. Averaged over polymorphic amino acid sites, DQB1 and DPB1 show similar deviation from neutrality expectations, and both exhibit more balanced polymorphic amino acid sites than DRB1. Across ethnic groups, polymorphisms at many codons show evidence for balancing selection, yet data consistent with directional selection were observed at other codons. Both antigen-binding pocket- and non-pocket-forming amino acid sites show overall deviation from neutrality for all three loci. Only in the case of DRB1 was there a significant difference between pocket- and non-pocket-forming amino acid sites. Our findings indicate that balancing selection at the MHC occurs at the level of polymorphic amino acid residues, and that in many cases this selection is consistent across populations.


2020 ◽  
Author(s):  
YiXuan Li ◽  
Kaifa Wei

Abstract Background: The cytochrome P450s (CYP450s) as the largest enzyme family of plant metabolism participate in various physiological processes, whereas no study has demonstrated interest in comprehensive comparison of the genes in wheat and maize. Genome-wide survey, characterization and comparison of wheat and maize CYP450 gene superfamily are useful for genetic manipulation of the Gramineae crops.Results: In total, 1285 and 263 full-length CYP450s were identified in wheat and maize, respectively. According to standard nomenclature, wheat CYP450s (TaCYP450s) were categorized into 45 families, while maize CYP450s (ZmCYP450s) into 43 families. A comprehensive analysis of wheat and maize CYP450s, involved in functional domains, conserved motifs, phylogeny, gene structures, chromosome locations and duplicated events was performed. The result showed that each family/subfamily in both species exhibited characteristic features, suggesting their phylogenetic relationship and the potential divergence in their functions. Functional divergence analysis at the amino acid level of representative clans CYP51, CYP74 and CYP97 in wheat, maize and rice identified some critical amino acid sites that are responsible for functional divergence of a gene family. Expression profiles of Ta-, ZmCYP450s were investigated using RNA-seq data, which contribute to infer the potential functions of the genes during development and stress responses. We found in both species CYP450s had preferential expression in specific tissues, and many tissue-specific genes were identified. Under water-deficit condition, 82 and 39 significantly differentially expressed CYP450s were respectively detected in wheat and maize. These genes may have some role in protecting plants against drought damage. Thereinto, fourteen CYP450s were selected to validate their expression level through qRT-PCR. To further elucidating molecular mechanisms of CYP450 action, gene co-expression network was constructed. In total, 477 TaCYP450s were distributed in 22 co-expression modules, and some co-expressed genes that likely take part in the same biochemical pathway were identified. For instance, the expression of TaCYP74A98_4D was highly correlated with TaLOX9, TaLOX36, TaLOX39, TaLOX44 and TaOPR8, and all of them may be involved in jasmonate (JA) biosynthesis. TaCYP73A201_3A showed coexpression with TaPAL1.25, TaCCoAOMT1.2, TaCOMT.1, TaCCR1.6 and TaLAC5, which probably act in the wheat stem and/or root lignin synthesis pathway. Conclusion: Our study first established systematic information about evolutionary relationship, expression pattern and function characterization of CYP450s in wheat and maize.


2019 ◽  
Author(s):  
kaifa wei ◽  
YiXuan Li

Abstract The cytochrome P450s (CYP450s) as the largest enzyme family of plant metabolism participate in various physiological processes, whereas no study has demonstrated interest in comprehensive comparison of the genes in wheat and maize. Genome-wide survey, characterization and comparison of wheat and maize CYP450 gene superfamily are useful for genetic manipulation of the Gramineae crops. Results In total, 1285 and 263 full-length CYP450s were identified in wheat and maize, respectively. According to standard nomenclature, wheat CYP450s (TaCYP450s) were categorized into 45 families, while maize CYP450s (ZmCYP450s) into 43 families. Of maize CYP450s, there are 108 duplicated gene pairs and 134 genes had their syntenic counterparts in rice genome, and a total of 140 duplication events were predicted in wheat CYP450s. Functional divergence analysis at the amino acid level of representative clans CYP51, CYP74 and CYP97 in wheat, maize and rice identified some critical amino acid sites that are responsible for functional divergence of a gene family. Expression profiles of Ta-, ZmCYP450s were investigated using RNA-seq data, and we found the majority of them showed tissue-specific expression patterns such as TaCYP51H39_5B in root; TaCYP74A98_4A in leaf at cotyledon emergence stage (LCE); TaCYP71C162_5A in fruit at whole plant fruit ripening stage (FR); ZmCYP707A5 in root; ZmCYP76B18 in tassel; ZmCYP78A130 in embryo. Under water-deficit condition, 82 and 39 significantly differentially expressed CYP450s were respectively detected in wheat and maize like TaCYP71Y18_1D, TaCYP707A5_6A, TaCYP71R11_1A, TaCYP97B4_6D, ZmCYP81N4, ZmCYP81A1 and ZmCYP71C62, of which TaCYP71Y18_1D and ZmCYP81N4 were the genes most highly induced by drought treatment. Fourteen CYP450s were selected to validate their expression level through qRT-PCR. In total, 477 TaCYP450s were distributed in 22 co-expression modules, and some co-expressed genes that likely take part in the same biochemical pathway were identified. For instance, the expression of TaCYP74A98_4D was highly correlated with TaLOX9, TaLOX36, TaLOX39, TaLOX44 and TaOPR8, and all of them may be involved in jasmonate (JA) biosynthesis. TaCYP73A201_3A showed coexpression with TaPAL1.25, TaCCoAOMT1.2, TaCOMT.1, TaCCR1.6 and TaLAC5, which probably act in the wheat stem and/or root lignin synthesis pathway.


2020 ◽  
Author(s):  
Michele Larocca

<p>Protein folding is strictly related to the determination of the backbone dihedral angles and depends on the information contained in the amino acid sequence as well as on the hydrophobic effect. To date, the type of information embedded in the amino acid sequence has not yet been revealed. The present study deals with these problematics and aims to furnish a possible explanation of the information contained in the amino acid sequence, showing and reporting rules to calculate the backbone dihedral angles φ. The study is based on the development of mechanical forces once specific chemical interactions are established among the side chain of the residues in a polypeptide chain. It aims to furnish a theoretical approach to predict backbone dihedral angles which, in the future, may be applied to computational developments focused on the prediction of polypeptide structures.</p>


Genetics ◽  
2000 ◽  
Vol 155 (1) ◽  
pp. 431-449 ◽  
Author(s):  
Ziheng Yang ◽  
Rasmus Nielsen ◽  
Nick Goldman ◽  
Anne-Mette Krabbe Pedersen

AbstractComparison of relative fixation rates of synonymous (silent) and nonsynonymous (amino acid-altering) mutations provides a means for understanding the mechanisms of molecular sequence evolution. The nonsynonymous/synonymous rate ratio (ω = dN/dS) is an important indicator of selective pressure at the protein level, with ω = 1 meaning neutral mutations, ω &lt; 1 purifying selection, and ω &gt; 1 diversifying positive selection. Amino acid sites in a protein are expected to be under different selective pressures and have different underlying ω ratios. We develop models that account for heterogeneous ω ratios among amino acid sites and apply them to phylogenetic analyses of protein-coding DNA sequences. These models are useful for testing for adaptive molecular evolution and identifying amino acid sites under diversifying selection. Ten data sets of genes from nuclear, mitochondrial, and viral genomes are analyzed to estimate the distributions of ω among sites. In all data sets analyzed, the selective pressure indicated by the ω ratio is found to be highly heterogeneous among sites. Previously unsuspected Darwinian selection is detected in several genes in which the average ω ratio across sites is &lt;1, but in which some sites are clearly under diversifying selection with ω &gt; 1. Genes undergoing positive selection include the β-globin gene from vertebrates, mitochondrial protein-coding genes from hominoids, the hemagglutinin (HA) gene from human influenza virus A, and HIV-1 env, vif, and pol genes. Tests for the presence of positively selected sites and their subsequent identification appear quite robust to the specific distributional form assumed for ω and can be achieved using any of several models we implement. However, we encountered difficulties in estimating the precise distribution of ω among sites from real data sets.


2000 ◽  
Vol 17 (6) ◽  
pp. 847-854 ◽  
Author(s):  
JAMES C. RYAN ◽  
SERGEY ZNOIKO ◽  
LIN XU ◽  
ROSALIE K. CROUCH ◽  
JIAN-XING MA

The mammalian retina is known to contain two distinct transducins that interact with their respective rod and cone pigments. However, there are no reports of a nonmammalian species having two distinct transducins. In the present study, we report the cloning and cellular localization of two transducin α subunits (Gαt) from the tiger salamander. Through degenerate polymerase chain reaction (PCR) and subsequent screening of a salamander retina cDNA library, we have identified two forms of Gαt. When compared to existing sequences in GenBank, the cloned subunits showed high similarity to rod and cone transducins. The salamander Gαt-1 has 91.2–93.7% amino acid sequence identity to mammalian rod Gαt subunits and 79.7–80.9% to mammalian cone Gαts. The salamander Gαt-2 has 86.2–87.9% sequence identity to mammalian cone Gαts and 78.9–80.9% to mammalian rod Gαts at the amino acid level. The Gαt-1 cDNA encodes 350 amino acids while the Gαt-2 cDNA encodes 354 residues, which is typical for rod and cone Gαts, respectively, and we thus identified the Gαt-1 as rod and Gαt-2 as cone Gαt. Sequences identified as effector binding sites and GTPase activity regions are highly conserved between the two subunits. Genomic Southern blot analysis showed that rod and cone Gαt subunits are both encoded by single-copy genes. Northern blot analysis identified retina-specific transcripts of 3.0 kb for rod Gαt and 2.6 kb for cone Gαt. Immunohistochemistry in the flat-mounted salamander retina demonstrated that rod Gαt is localized to rods, predominantly in the outer segments; similarly, cone Gαt is localized to cone outer segments. The results confirm that the two sequences encode rod and cone transducins and demonstrate that this lower vertebrate contains two distinct transducins that are localized specifically to rod and cone photoreceptors.


2017 ◽  
Vol 8 (9) ◽  
pp. 5992-6004 ◽  
Author(s):  
Tiia Kittilä ◽  
Claudia Kittel ◽  
Julien Tailhades ◽  
Diane Butz ◽  
Melanie Schoppet ◽  
...  

Halogenase enzymes involved in glycopeptide antibiotic biosynthesis accept aminoacyl-carrier protein substrates.


2004 ◽  
Vol 91 (01) ◽  
pp. 38-42 ◽  
Author(s):  
Christof Geisen ◽  
Erhard Seifried ◽  
Johannes Oldenburg ◽  
Matthias Watzka

SummaryFactorVIII acts as an essential compound of the tenase complex of the coagulation system. Herein we report the cDNA of the rat factor VIII. The rat cDNA comprises 6777 nucleotides and encodes a protein of 2258 amino acids, 61 amino acids less than mouse and 92 amino acids less than human factor VIII. The overall identity compared to human cDNA is 61% on the cDNA and 51% on the amino acid level. In cDNA, highest levels of sequence identity can be observed in the A and C domains (ranging between 68% and 73%), whereas B domain and the small acidic regions are more divergent (34%-49%). Compared to mouse and human most sites for posttranslational modifications such as sulfatation and glycosylation as well as thrombin and protein C cleavage sites are conserved in rat. Alternative transcripts lacking exon 17 and/or comprising additional 26 bp due to alternative splicing of exon 20 were found. Furthermore, 13 polymorphisms (seven in exon 14, one in exon 20, 23, 24, and 25, two in the 3’UTR) three of which lead to an amino acid exchange could be detected. Our findings might provide new insights into the structure-function analysis of the factor VIII protein and might prove useful for future animal models addressing the function of factor VIII.


Sign in / Sign up

Export Citation Format

Share Document