Combing Transcriptomes for Secrets of Deep-Sea Survival: Environmental Diversity Drives Patterns of Protein Evolution

J R Winnikoff; W R Francis; E V Thuesen; S H D Haddock

doi:10.1093/icb/icz063

Combing Transcriptomes for Secrets of Deep-Sea Survival: Environmental Diversity Drives Patterns of Protein Evolution

Integrative and Comparative Biology ◽

10.1093/icb/icz063 ◽

2019 ◽

Vol 59 (4) ◽

pp. 786-798 ◽

Cited By ~ 3

Author(s):

J R Winnikoff ◽

W R Francis ◽

E V Thuesen ◽

S H D Haddock

Keyword(s):

Amino Acid ◽

Amino Acid Level ◽

Hydrophobic Effect ◽

Extreme Environments ◽

Environmental Parameters ◽

Acid Sites ◽

Temperature Adaptation ◽

Environmental Diversity ◽

Cofactor Binding ◽

Ideal System

AbstractCtenophores, also known as comb jellies, live across extremely broad ranges of temperature and hydrostatic pressure in the ocean. Because various ctenophore lineages adapted independently to similar environmental conditions, Phylum Ctenophora is an ideal system for the study of protein adaptation to extreme environments in a comparative framework. We present such a study here, using a phylogenetically-informed method to compare sequences of four essential metabolic enzymes across gradients of habitat depth and temperature. This method predicts convergent adaptation to these environmental parameters at the amino acid level, providing a novel view of protein adaptation to extreme environments and demonstrating the power and relevance of phylogenetic comparison applied to multi-species transcriptomic datasets from early-diverging metazoa. Across all four enzymes analyzed, 46 amino acid sites were associated with depth-adaptation, 59 with temperature-adaptation, and 56 with both. Sites predicted to be depth- and temperature-adaptive occurred consistently near Rossmann fold cofactor binding motifs and disproportionately in solvent-exposed regions of the protein. These results suggest that the hydrophobic effect and ligand binding may mediate efficient enzyme function at different hydrostatic pressures and temperatures. Using predicted adaptive site maps, such mechanistic hypotheses can now be tested via mutagenesis.

Download Full-text

Molecular convergent and parallel evolution among four high-elevation anuran species from the Tibetan region

BMC Genomics ◽

10.1186/s12864-020-07269-4 ◽

2020 ◽

Vol 21 (1) ◽

Author(s):

Bin Lu ◽

Hong Jin ◽

Jinzhong Fu

Keyword(s):

Amino Acid ◽

Positive Selection ◽

Convergent Evolution ◽

Parallel Evolution ◽

Amino Acid Level ◽

High Elevation ◽

Acid Sites ◽

Anuran Species ◽

Close Relatives ◽

Relative Prevalence

Abstract Background To date, evidence for the relative prevalence or rarity of molecular convergent and parallel evolution is conflicting, and understanding of how these processes contribute to adaptation is limited. We compared four high-elevation anuran species (Bufo tibetanus, Nanorana parkeri, Rana kukunoris and Scutiger boulengeri) from the Tibetan region, and examined convergent and parallel amino acid substitutions between them and how they may have contributed to high-elevation adaptation. Results Genomic data of the four high-elevation species and eight of their low-elevation close relatives were gathered. A total of 1098 orthologs shared by all species were identified. We first conducted pairwise comparisons using Zhang and Kumar’s test. Then, the Rconv index was calculated and convergence/divergence correlation plotting was conducted. Furthermore, genes under positive selection and with elevated evolutionary rate were examined. We detected a large number of amino acid sites with convergent or parallel substitutions. Several pairs of high-elevation species, in particular, R. kukunoris vs N. parkeri and B. tibetanus vs S. boulengeri, had excessive amounts of convergent substitutions compared to neutral expectation. Nevertheless, these sites were mostly concentrated in a small number of genes (3–32), and no genome-wide convergence was detected. Furthermore, the majority of these convergent genes were neither under detectable positive selection nor had elevated evolutionary rates, although functional prediction analysis suggested some of the convergent genes could potentially contribute to high-elevation adaptation. Conclusions There is a substantial amount of convergent evolution at the amino-acid level among high-elevation amphibians, although these sites are concentrated in a few genes, not widespread across the genomes. This may attribute to the fact that all the target species are from the same environment. The relative prevalence of convergent substitutions among high-elevation amphibians provides an excellent opportunity for further study of molecular convergent evolution.

Download Full-text

Evolution of HLA Class II Molecules: Allelic and Amino Acid Site Variability Across Populations

Genetics ◽

10.1093/genetics/152.1.393 ◽

1999 ◽

Vol 152 (1) ◽

pp. 393-400 ◽

Cited By ~ 1

Author(s):

Hugh Salamon ◽

William Klitz ◽

Simon Easteal ◽

Xiaojiang Gao ◽

Henry A Erlich ◽

...

Keyword(s):

Amino Acid ◽

Acid Site ◽

Balancing Selection ◽

Amino Acid Level ◽

Class Ii ◽

Acid Sites ◽

Hla Class Ii ◽

Amino Acid Site ◽

Significant Difference ◽

Hla Class Ii Molecules

Abstract Analysis of the highly polymorphic β1 domains of the HLA class II molecules encoded by the DRB1, DQB1, and DPB1 loci reveals contrasting levels of diversity at the allele and amino acid site levels. Statistics of allele frequency distributions, based on Watterson’s homozygosity statistic F, reveal distinct evolutionary patterns for these loci in ethnically diverse samples (26 populations for DQB1 and DRB1 and 14 for DPB1). When examined over all populations, the DQB1 locus allelic variation exhibits striking balanced polymorphism (P < 10-4), DRB1 shows some evidence of balancing selection (P < 0.06), and while there is overall very little evidence for selection of DPB1 allele frequencies, there is a trend in the direction of balancing selection (P < 0.08). In contrast, at the amino acid level all three loci show strong evidence of balancing selection at some sites. Averaged over polymorphic amino acid sites, DQB1 and DPB1 show similar deviation from neutrality expectations, and both exhibit more balanced polymorphic amino acid sites than DRB1. Across ethnic groups, polymorphisms at many codons show evidence for balancing selection, yet data consistent with directional selection were observed at other codons. Both antigen-binding pocket- and non-pocket-forming amino acid sites show overall deviation from neutrality for all three loci. Only in the case of DRB1 was there a significant difference between pocket- and non-pocket-forming amino acid sites. Our findings indicate that balancing selection at the MHC occurs at the level of polymorphic amino acid residues, and that in many cases this selection is consistent across populations.

Download Full-text

Comparative functional genomics analysis of cytochrome P450 gene superfamily in wheat and maize

10.21203/rs.2.11228/v2 ◽

2020 ◽

Author(s):

YiXuan Li ◽

Kaifa Wei

Keyword(s):

Amino Acid ◽

Stress Responses ◽

Molecular Mechanisms ◽

Genetic Manipulation ◽

Expression Profiles ◽

Functional Divergence ◽

Amino Acid Level ◽

Evolutionary Relationship ◽

Acid Sites ◽

Preferential Expression

Abstract Background: The cytochrome P450s (CYP450s) as the largest enzyme family of plant metabolism participate in various physiological processes, whereas no study has demonstrated interest in comprehensive comparison of the genes in wheat and maize. Genome-wide survey, characterization and comparison of wheat and maize CYP450 gene superfamily are useful for genetic manipulation of the Gramineae crops.Results: In total, 1285 and 263 full-length CYP450s were identified in wheat and maize, respectively. According to standard nomenclature, wheat CYP450s (TaCYP450s) were categorized into 45 families, while maize CYP450s (ZmCYP450s) into 43 families. A comprehensive analysis of wheat and maize CYP450s, involved in functional domains, conserved motifs, phylogeny, gene structures, chromosome locations and duplicated events was performed. The result showed that each family/subfamily in both species exhibited characteristic features, suggesting their phylogenetic relationship and the potential divergence in their functions. Functional divergence analysis at the amino acid level of representative clans CYP51, CYP74 and CYP97 in wheat, maize and rice identified some critical amino acid sites that are responsible for functional divergence of a gene family. Expression profiles of Ta-, ZmCYP450s were investigated using RNA-seq data, which contribute to infer the potential functions of the genes during development and stress responses. We found in both species CYP450s had preferential expression in specific tissues, and many tissue-specific genes were identified. Under water-deficit condition, 82 and 39 significantly differentially expressed CYP450s were respectively detected in wheat and maize. These genes may have some role in protecting plants against drought damage. Thereinto, fourteen CYP450s were selected to validate their expression level through qRT-PCR. To further elucidating molecular mechanisms of CYP450 action, gene co-expression network was constructed. In total, 477 TaCYP450s were distributed in 22 co-expression modules, and some co-expressed genes that likely take part in the same biochemical pathway were identified. For instance, the expression of TaCYP74A98_4D was highly correlated with TaLOX9, TaLOX36, TaLOX39, TaLOX44 and TaOPR8, and all of them may be involved in jasmonate (JA) biosynthesis. TaCYP73A201_3A showed coexpression with TaPAL1.25, TaCCoAOMT1.2, TaCOMT.1, TaCCR1.6 and TaLAC5, which probably act in the wheat stem and/or root lignin synthesis pathway. Conclusion: Our study first established systematic information about evolutionary relationship, expression pattern and function characterization of CYP450s in wheat and maize.

Download Full-text

Comparative functional genomics analysis of cytochrome P450 gene superfamily in wheat and maize

10.21203/rs.2.11228/v1 ◽

2019 ◽

Author(s):

kaifa wei ◽

YiXuan Li

Keyword(s):

Amino Acid ◽

Genetic Manipulation ◽

Expression Profiles ◽

Functional Divergence ◽

Expression Patterns ◽

Amino Acid Level ◽

Rice Genome ◽

Acid Sites ◽

Drought Treatment ◽

Specific Expression

Abstract The cytochrome P450s (CYP450s) as the largest enzyme family of plant metabolism participate in various physiological processes, whereas no study has demonstrated interest in comprehensive comparison of the genes in wheat and maize. Genome-wide survey, characterization and comparison of wheat and maize CYP450 gene superfamily are useful for genetic manipulation of the Gramineae crops. Results In total, 1285 and 263 full-length CYP450s were identified in wheat and maize, respectively. According to standard nomenclature, wheat CYP450s (TaCYP450s) were categorized into 45 families, while maize CYP450s (ZmCYP450s) into 43 families. Of maize CYP450s, there are 108 duplicated gene pairs and 134 genes had their syntenic counterparts in rice genome, and a total of 140 duplication events were predicted in wheat CYP450s. Functional divergence analysis at the amino acid level of representative clans CYP51, CYP74 and CYP97 in wheat, maize and rice identified some critical amino acid sites that are responsible for functional divergence of a gene family. Expression profiles of Ta-, ZmCYP450s were investigated using RNA-seq data, and we found the majority of them showed tissue-specific expression patterns such as TaCYP51H39_5B in root; TaCYP74A98_4A in leaf at cotyledon emergence stage (LCE); TaCYP71C162_5A in fruit at whole plant fruit ripening stage (FR); ZmCYP707A5 in root; ZmCYP76B18 in tassel; ZmCYP78A130 in embryo. Under water-deficit condition, 82 and 39 significantly differentially expressed CYP450s were respectively detected in wheat and maize like TaCYP71Y18_1D, TaCYP707A5_6A, TaCYP71R11_1A, TaCYP97B4_6D, ZmCYP81N4, ZmCYP81A1 and ZmCYP71C62, of which TaCYP71Y18_1D and ZmCYP81N4 were the genes most highly induced by drought treatment. Fourteen CYP450s were selected to validate their expression level through qRT-PCR. In total, 477 TaCYP450s were distributed in 22 co-expression modules, and some co-expressed genes that likely take part in the same biochemical pathway were identified. For instance, the expression of TaCYP74A98_4D was highly correlated with TaLOX9, TaLOX36, TaLOX39, TaLOX44 and TaOPR8, and all of them may be involved in jasmonate (JA) biosynthesis. TaCYP73A201_3A showed coexpression with TaPAL1.25, TaCCoAOMT1.2, TaCOMT.1, TaCCR1.6 and TaLAC5, which probably act in the wheat stem and/or root lignin synthesis pathway.

Download Full-text

Polypeptides Folding: Rules for the Calculation of the Backbone Dihedral Angles φ starting from the Amino Acid Sequence

10.26434/chemrxiv.12000132 ◽

2020 ◽

Author(s):

Michele Larocca

Keyword(s):

Amino Acid ◽

Amino Acid Sequence ◽

Theoretical Approach ◽

Polypeptide Chain ◽

Hydrophobic Effect ◽

Side Chain ◽

Dihedral Angles ◽

Mechanical Forces ◽

Specific Chemical

<p>Protein folding is strictly related to the determination of the backbone dihedral angles and depends on the information contained in the amino acid sequence as well as on the hydrophobic effect. To date, the type of information embedded in the amino acid sequence has not yet been revealed. The present study deals with these problematics and aims to furnish a possible explanation of the information contained in the amino acid sequence, showing and reporting rules to calculate the backbone dihedral angles φ. The study is based on the development of mechanical forces once specific chemical interactions are established among the side chain of the residues in a polypeptide chain. It aims to furnish a theoretical approach to predict backbone dihedral angles which, in the future, may be applied to computational developments focused on the prediction of polypeptide structures.</p>

Download Full-text

Involvement of Cys69 residue in the catalytic mechanism of N-hydroxyarylamine O-acetyltransferase of Salmonella typhimurium. Sequence similarity at the amino acid level suggests a common catalytic mechanism of acetyltransferase for S. typhimurium and higher organisms.

Journal of Biological Chemistry ◽

10.1016/s0021-9258(18)42462-3 ◽

1992 ◽

Vol 267 (12) ◽

pp. 8429-8436 ◽

Cited By ~ 1

Author(s):

M Watanabe ◽

T Sofuni ◽

T Nohmi

Keyword(s):

Amino Acid ◽

Salmonella Typhimurium ◽

Acid Level ◽

Catalytic Mechanism ◽

Sequence Similarity ◽

Amino Acid Level

Download Full-text

Codon-Substitution Models for Heterogeneous Selection Pressure at Amino Acid Sites

Genetics ◽

10.1093/genetics/155.1.431 ◽

2000 ◽

Vol 155 (1) ◽

pp. 431-449 ◽

Cited By ~ 41

Author(s):

Ziheng Yang ◽

Rasmus Nielsen ◽

Nick Goldman ◽

Anne-Mette Krabbe Pedersen

Keyword(s):

Amino Acid ◽

Positive Selection ◽

Selective Pressure ◽

Acid Sites ◽

Data Sets ◽

Protein Coding ◽

Important Indicator ◽

Diversifying Selection ◽

Codon Substitution ◽

Neutral Mutations

AbstractComparison of relative fixation rates of synonymous (silent) and nonsynonymous (amino acid-altering) mutations provides a means for understanding the mechanisms of molecular sequence evolution. The nonsynonymous/synonymous rate ratio (ω = dN/dS) is an important indicator of selective pressure at the protein level, with ω = 1 meaning neutral mutations, ω < 1 purifying selection, and ω > 1 diversifying positive selection. Amino acid sites in a protein are expected to be under different selective pressures and have different underlying ω ratios. We develop models that account for heterogeneous ω ratios among amino acid sites and apply them to phylogenetic analyses of protein-coding DNA sequences. These models are useful for testing for adaptive molecular evolution and identifying amino acid sites under diversifying selection. Ten data sets of genes from nuclear, mitochondrial, and viral genomes are analyzed to estimate the distributions of ω among sites. In all data sets analyzed, the selective pressure indicated by the ω ratio is found to be highly heterogeneous among sites. Previously unsuspected Darwinian selection is detected in several genes in which the average ω ratio across sites is <1, but in which some sites are clearly under diversifying selection with ω > 1. Genes undergoing positive selection include the β-globin gene from vertebrates, mitochondrial protein-coding genes from hominoids, the hemagglutinin (HA) gene from human influenza virus A, and HIV-1 env, vif, and pol genes. Tests for the presence of positively selected sites and their subsequent identification appear quite robust to the specific distributional form assumed for ω and can be achieved using any of several models we implement. However, we encountered difficulties in estimating the precise distribution of ω among sites from real data sets.

Download Full-text

Salamander rods and cones contain distinct transducin alpha subunits

Visual Neuroscience ◽

10.1017/s0952523800176047 ◽

2000 ◽

Vol 17 (6) ◽

pp. 847-854 ◽

Cited By ~ 5

Author(s):

JAMES C. RYAN ◽

SERGEY ZNOIKO ◽

LIN XU ◽

ROSALIE K. CROUCH ◽

JIAN-XING MA

Keyword(s):

Amino Acid ◽

Blot Analysis ◽

Cellular Localization ◽

Amino Acid Level ◽

Single Copy ◽

Lower Vertebrate ◽

Rods And Cones ◽

Sequence Identity ◽

Outer Segments ◽

Cone Pigments

The mammalian retina is known to contain two distinct transducins that interact with their respective rod and cone pigments. However, there are no reports of a nonmammalian species having two distinct transducins. In the present study, we report the cloning and cellular localization of two transducin α subunits (Gαt) from the tiger salamander. Through degenerate polymerase chain reaction (PCR) and subsequent screening of a salamander retina cDNA library, we have identified two forms of Gαt. When compared to existing sequences in GenBank, the cloned subunits showed high similarity to rod and cone transducins. The salamander Gαt-1 has 91.2–93.7% amino acid sequence identity to mammalian rod Gαt subunits and 79.7–80.9% to mammalian cone Gαts. The salamander Gαt-2 has 86.2–87.9% sequence identity to mammalian cone Gαts and 78.9–80.9% to mammalian rod Gαts at the amino acid level. The Gαt-1 cDNA encodes 350 amino acids while the Gαt-2 cDNA encodes 354 residues, which is typical for rod and cone Gαts, respectively, and we thus identified the Gαt-1 as rod and Gαt-2 as cone Gαt. Sequences identified as effector binding sites and GTPase activity regions are highly conserved between the two subunits. Genomic Southern blot analysis showed that rod and cone Gαt subunits are both encoded by single-copy genes. Northern blot analysis identified retina-specific transcripts of 3.0 kb for rod Gαt and 2.6 kb for cone Gαt. Immunohistochemistry in the flat-mounted salamander retina demonstrated that rod Gαt is localized to rods, predominantly in the outer segments; similarly, cone Gαt is localized to cone outer segments. The results confirm that the two sequences encode rod and cone transducins and demonstrate that this lower vertebrate contains two distinct transducins that are localized specifically to rod and cone photoreceptors.

Download Full-text

Halogenation of glycopeptide antibiotics occurs at the amino acid level during non-ribosomal peptide synthesis

Chemical Science ◽

10.1039/c7sc00460e ◽

2017 ◽

Vol 8 (9) ◽

pp. 5992-6004 ◽

Cited By ~ 27

Author(s):

Tiia Kittilä ◽

Claudia Kittel ◽

Julien Tailhades ◽

Diane Butz ◽

Melanie Schoppet ◽

...

Keyword(s):

Amino Acid ◽

Peptide Synthesis ◽

Acid Level ◽

Amino Acid Level ◽

Antibiotic Biosynthesis ◽

Carrier Protein ◽

Glycopeptide Antibiotics ◽

Glycopeptide Antibiotic ◽

Protein Substrates

Halogenase enzymes involved in glycopeptide antibiotic biosynthesis accept aminoacyl-carrier protein substrates.

Download Full-text

Sequence of the rat factor VIII cDNA

Thrombosis and Haemostasis ◽

10.1160/th03-06-0336 ◽

2004 ◽

Vol 91 (01) ◽

pp. 38-42 ◽

Cited By ~ 6

Author(s):

Christof Geisen ◽

Erhard Seifried ◽

Johannes Oldenburg ◽

Matthias Watzka

Keyword(s):

Amino Acids ◽

Amino Acid ◽

Factor Viii ◽

Posttranslational Modifications ◽

Function Analysis ◽

Amino Acid Level ◽

Coagulation System ◽

Cleavage Sites ◽

Human Factor Viii ◽

Exon 20

SummaryFactorVIII acts as an essential compound of the tenase complex of the coagulation system. Herein we report the cDNA of the rat factor VIII. The rat cDNA comprises 6777 nucleotides and encodes a protein of 2258 amino acids, 61 amino acids less than mouse and 92 amino acids less than human factor VIII. The overall identity compared to human cDNA is 61% on the cDNA and 51% on the amino acid level. In cDNA, highest levels of sequence identity can be observed in the A and C domains (ranging between 68% and 73%), whereas B domain and the small acidic regions are more divergent (34%-49%). Compared to mouse and human most sites for posttranslational modifications such as sulfatation and glycosylation as well as thrombin and protein C cleavage sites are conserved in rat. Alternative transcripts lacking exon 17 and/or comprising additional 26 bp due to alternative splicing of exon 20 were found. Furthermore, 13 polymorphisms (seven in exon 14, one in exon 20, 23, 24, and 25, two in the 3’UTR) three of which lead to an amino acid exchange could be detected. Our findings might provide new insights into the structure-function analysis of the factor VIII protein and might prove useful for future animal models addressing the function of factor VIII.

Download Full-text