scholarly journals Evolutionary and codon usage preference insights into spike glycoprotein of SARS-CoV-2

Author(s):  
Yashpal Singh Malik ◽  
Mohd Ikram Ansari ◽  
Jobin Jose Kattoor ◽  
Rahul Kaushik ◽  
Shubhankar Sircar ◽  
...  

Abstract Interaction of SARS-CoV-2 spike glycoprotein with the ACE2 cell receptor is very crucial for virus attachment to human cells. Selected mutations in SARS-CoV-2 S-protein are reported to strengthen its binding affinity to mammalian ACE2. The N501T mutation in SARS-CoV-2-CTD furnishes better support to hotspot 353 in comparison with SARS-CoV and shows higher affinity for receptor binding. Recombination analysis exhibited higher recombination events in SARS-CoV-2 strains, irrespective of their geographical origin or hosts. Investigation further supports a common origin among SARS-CoV-2 and its predecessors, SARS-CoV and bat-SARS-like-CoV. The recombination events suggest a constant exchange of genetic material among the co-infecting viruses in possible reservoirs and human hosts before SARS-CoV-2 emerged. Furthermore, a comprehensive analysis of codon usage bias (CUB) in SARS-CoV-2 revealed significant CUB among the S-genes of different beta-coronaviruses governed majorly by natural selection and mutation pressure. Various indices of codon usage of S-genes helped in quantifying its adaptability in other animal hosts. These findings might help in identifying potential experimental animal models for investigating pathogenicity for drugs and vaccine development experiments.

2016 ◽  
Vol 49 (4) ◽  
pp. 477-486 ◽  
Author(s):  
Sheng-Fan Wang ◽  
Ming-Wei Su ◽  
Sung-Pin Tseng ◽  
Ming-Chun Li ◽  
Ching-Han Tsao ◽  
...  

2003 ◽  
Vol 11 (04) ◽  
pp. 407-418
Author(s):  
Fei Ma ◽  
Yonglong Zhuang ◽  
Yanda Li ◽  
Xiaofeng Xu ◽  
Xueping Chen

Genetic code is one of the most important biological languages in communications between DNA and protein, so peoples have been paying a great attention to the usage bias of synonymous codons. Based on Grosjean and Ikemura's "optimum combination of codon-anticodon complex" and "translation efficiency" hypotheses, in this paper, we put forward that a biased codon usage is identical to its corresponding complementary codon usage preference. To testify the hypothesis and reveal usage patterns between codons and corresponding complementary codons among different cellular organisms and organelles, the usage data of both codons and corresponding complementary ones from 28 cellular organisms and 20 organelles were analyzed. The results showed that: (1) there is a significantly positive correlation between codons and their complementary ones in most cellular organisms, chloroplasts and mitochondria; (2) all 32 single pairs codon versus complementary codon shared the likely usage correlation patterns, with the significantly positive, unrelated and significantly negative pair number of 18, 12 and 2 within 28 cellular organisms as well as 11, 17 and 4 within 20 organelles respectively, and some usage patterns of 32 single pairs codon versus complementary codon of cellular organisms are highly consistency with two kinds of organelles, which strongly implied that their codon usage has undergone the similar evolutionary selection in their wobbling and modification; (3) the codon-frequency tree agreed fairly well with the traditional one. These results demonstrated the validity of our hypothesis, and indicated the usefulness of correlation between codon and complementary codon in elucidating molecular evolutionary mechanisms.


2021 ◽  
Vol 12 ◽  
Author(s):  
Zhaobin Deng ◽  
Jun Wang ◽  
Wenjie Zhang ◽  
Yi Geng ◽  
Mingde Zhao ◽  
...  

The members of the family Iridoviridae are large, double-stranded DNA viruses that infect various hosts, including both vertebrates and invertebrates. Although great progress has been made in genomic and phylogenetic analyses, the adequacy of the existing criteria for classification within the Iridoviridae family remains unknown. In this study, we redetermined 23 Iridoviridae core genes by re-annotation, core-pan analysis and local BLASTN search. The phylogenetic tree based on the 23 re-annotated core genes (Maximum Likelihood, ML-Tree) and amino acid sequences (composition vector, CV-Tree) were found to be consistent with previous reports. Furthermore, the information provided by synteny analysis and codon usage preference (relative synonymous codon usage, correspondence analysis, ENC-plot and Neutrality plot) also supports the phylogenetic relationship. Collectively, our results will be conducive to understanding the genera demarcation within the Iridoviridae family based on genomic synteny and component (codon usage preference) and contribute to the existing taxonomy methods for the Iridoviridae family.


2021 ◽  
Vol 9 (6) ◽  
pp. 1111
Author(s):  
Jaedeok Kim ◽  
Aamir Lal ◽  
Eui-Joon Kil ◽  
Hae-Ryun Kwak ◽  
Hwan-Su Yoon ◽  
...  

Apple stem grooving virus (ASGV; genus Capillovirus) is an economically important virus. It has an approx. 6.5 kb, monopartite, linear, positive-sense, single-stranded RNA genome. The present study includes identification of 24 isolates—13 isolates from apple (Pyrus malus L.) and 11 isolates from pear (Pyrus communis L.)—from different agricultural fields in South Korea. The coat protein (CP) gene of the corresponding 23 isolates were amplified, sequenced, and analyzed. The CP sequences showed phylogenetic separation based on their host species, and not on the geography, indicating host adaptation. Further analysis showed that the ASGV isolated in this study followed host adaptation influenced and preferred by the host codon-usage.


2021 ◽  
Vol 9 (1) ◽  
pp. 81-89
Author(s):  
Robert Penner

Abstract Tools developed by Moderna, BioNTech/Pfizer, and Oxford/Astrazeneca, among others, provide universal solutions to previously problematic aspects of drug or vaccine delivery, uptake and toxicity, portending new tools across the medical sciences. A novel method is presented based on estimating protein backbone free energy via geometry to predict effective antiviral targets, antigens and vaccine cargos that are resistant to viral mutation. This method is reviewed and reformulated in light of the recent proliferation of structural data on the SARS-CoV-2 spike glycoprotein and its mutations in multiple lineages. Key findings include: collections of mutagenic residues reoccur across strains, suggesting cooperative convergent evolution; most mutagenic residues do not participate in backbone hydrogen bonds; metastability of the glyco-protein limits the change of free energy through mutation thereby constraining selective pressure; and there are mRNA or virus-vector cargos targeting low free energy peptides proximal to conserved high free energy peptides providing specific recipes for vaccines with greater specificity than the full-spike approach. These results serve to limit peptides in the spike glycoprotein with high mutagenic potential and thereby provide a priori constraints on viral and attendant vaccine evolution. Scientific and regulatory challenges to nucleic acid therapeutic and vaccine development and deployment are finally discussed.


Biomolecules ◽  
2021 ◽  
Vol 11 (6) ◽  
pp. 912
Author(s):  
Saadullah Khattak ◽  
Mohd Ahmar Rauf ◽  
Qamar Zaman ◽  
Yasir Ali ◽  
Shabeen Fatima ◽  
...  

The ongoing outbreak of coronavirus disease COVID-19 is significantly implicated by global heterogeneity in the genome organization of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). The causative agents of global heterogeneity in the whole genome of SARS-CoV-2 are not well characterized due to the lack of comparative study of a large enough sample size from around the globe to reduce the standard deviation to the acceptable margin of error. To better understand the SARS-CoV-2 genome architecture, we have performed a comprehensive analysis of codon usage bias of sixty (60) strains to get a snapshot of its global heterogeneity. Our study shows a relatively low codon usage bias in the SARS-CoV-2 viral genome globally, with nearly all the over-preferred codons’ A.U. ended. We concluded that the SARS-CoV-2 genome is primarily shaped by mutation pressure; however, marginal selection pressure cannot be overlooked. Within the A/U rich virus genomes of SARS-CoV-2, the standard deviation in G.C. (42.91% ± 5.84%) and the GC3 value (30.14% ± 6.93%) points towards global heterogeneity of the virus. Several SARS-CoV-2 viral strains were originated from different viral lineages at the exact geographic location also supports this fact. Taking all together, these findings suggest that the general root ancestry of the global genomes are different with different genome’s level adaptation to host. This research may provide new insights into the codon patterns, host adaptation, and global heterogeneity of SARS-CoV-2.


Author(s):  
Amrita Banerjee ◽  
Dipannita Santra ◽  
Smarajit Maiti

AbstractThe recent outbreak by SARS-CoV-2 has generated a chaos in global health and economy and claimed/infected a large number of lives. Closely resembling with SARS CoV, the present strain has manifested exceptionally higher degree of spreadability, virulence and stability possibly due to some unidentified mutations. The viral spike glycoprotein is very likely to interact with host Angiotensin-Converting Enzyme 2 (ACE2) and transmits its genetic materials and hijacks host machinery with extreme fidelity for self propagation. Few attempts have been made to develop a suitable vaccine or ACE2 blocker or virus-receptor inhibitor within this short period of time. Here, attempt was taken to develop some therapeutic and vaccination strategies with a comparison of spike glycoproteins among SARS-CoV, MERS-CoV and the SARS-CoV-2. We verified their structure quality (SWISS-MODEL, Phyre2, Pymol) topology (ProFunc), motifs (MEME Suite, GLAM2Scan), gene ontology based conserved domain (InterPro database) and screened several epitopes (SVMTrip) of SARS CoV-2 based on their energetics, IC50 and antigenicity with regard to their possible glycosylation and MHC/paratopic binding (Vaxigen v2.0, HawkDock, ZDOCK Server) effects. We screened here few pairs of spike protein epitopic regions and selected their energetic, IC50, MHC II reactivity and found some of those to be very good target for vaccination. A possible role of glycosylation on epitopic region showed profound effects on epitopic recognition. The present work might be helpful for the urgent development of a suitable vaccination regimen against SARS CoV-2.


F1000Research ◽  
2020 ◽  
Vol 9 ◽  
pp. 145 ◽  
Author(s):  
Chloe Hyun-Jung Lee ◽  
Hashem Koohy

Background: The newly identified coronavirus known as 2019-nCoV has posed a serious global health threat. According to the latest report (18-February-2020), it has infected more than 72,000 people globally and led to deaths of more than 1,016 people in China. Methods: The 2019 novel coronavirus proteome was aligned to a curated database of viral immunogenic peptides. The immunogenicity of detected peptides and their binding potential to HLA alleles was predicted by immunogenicity predictive models and NetMHCpan 4.0. Results: We report in silico identification of a comprehensive list of immunogenic peptides that can be used as potential targets for 2019 novel coronavirus (2019-nCoV) vaccine development. First, we found 28 nCoV peptides identical to Severe acute respiratory syndrome-related coronavirus (SARS CoV) that have previously been characterized immunogenic by T cell assays. Second, we identified 48 nCoV peptides having a high degree of similarity with immunogenic peptides deposited in The Immune Epitope Database (IEDB). Lastly, we conducted a de novo search of 2019-nCoV 9-mer peptides that i) bind to common HLA alleles in Chinese and European population and ii) have T Cell Receptor (TCR) recognition potential by positional weight matrices and a recently developed immunogenicity algorithm, iPred, and identified in total 63 peptides with a high immunogenicity potential. Conclusions: Given the limited time and resources to develop vaccine and treatments for 2019-nCoV, our work provides a shortlist of candidates for experimental validation and thus can accelerate development pipeline.


PeerJ ◽  
2020 ◽  
Vol 8 ◽  
pp. e8251 ◽  
Author(s):  
Zhanjun Wang ◽  
Beibei Xu ◽  
Bao Li ◽  
Qingqing Zhou ◽  
Guiyi Wang ◽  
...  

Euphorbiaceae plants are important as suppliers of biodiesel. In the current study, the codon usage patterns and sources of variance in chloroplast genome sequences of six different Euphorbiaceae plant species have been systematically analyzed. Our results revealed that the chloroplast genomes of six Euphorbiaceae plant species were biased towards A/T bases and A/T-ending codons, followed by detection of 17 identical high-frequency codons including GCT, TGT, GAT, GAA, TTT, GGA, CAT, AAA, TTA, AAT, CCT, CAA, AGA, TCT, ACT, TAT and TAA. It was found that mutation pressure was a minor factor affecting the variation of codon usage, however, natural selection played a significant role. Comparative analysis of codon usage frequencies of six Euphorbiaceae plant species with four model organisms reflected that Arabidopsis thaliana, Populus trichocarpa, and Saccharomyces cerevisiae should be considered as suitable exogenous expression receptor systems for chloroplast genes of six Euphorbiaceae plant species. Furthermore, it is optimal to choose Saccharomyces cerevisiae as the exogenous expression receptor. The outcome of the present study might provide important reference information for further understanding the codon usage patterns of chloroplast genomes in other plant species.


2021 ◽  
Author(s):  
Yanan Fu ◽  
Yanping Huang ◽  
Jingjing Rao ◽  
Feng Zeng ◽  
Ruiping Yang ◽  
...  

Abstract The outbreak of COVID-19, caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infections, spread across hosts from humans to animals, transmitting particularly effectively in mink. How SARS-CoV-2 selects and evolves in the host, and the differences in the evolution of different animals are still unclear. To analysis the mutation and codon usage bias of SARS-CoV-2 in infected humans and animals. The SARS-CoV-2 sequence in mink (Mink-SARS2) and binding energy with receptor were calculated compared with human. The relative synonymous codon usage of viral encoded gene was analyzed to characterize the differences and the evolutionary characteristics. A synonymous codon usage analysis showed that SARS-CoV-2 is optimized to adapt in the animals in which it is currently reported, and all of the animals showed decreased adaptability relative to that of humans, except for mink. The neutrality plot showed that the effect of natural selection on different SARS-CoV-2 sequences is stronger than mutation pressure. A binding affinity analysis indicated that the spike protein of the SARS-CoV-2 variant in mink showed a greater preference for binding with the mink receptor ACE2 than with the human receptor, especially as the mutation Y453F and N501T in Mink-SARS2 lead to improvement of binding affinity for mink receptor. In summary, mutations Y453F and N501T in Mink-SARS2 lead to improvement of binding affinity with mink receptor, indicating possible natural selection and current host adaptation. Monitoring the variation and codon bias of SARS-CoV-2 provides a theoretical basis for tracing the epidemic, evolution and cross-species spread of SARS-CoV-2.


Sign in / Sign up

Export Citation Format

Share Document