Independent evolution of tetraloop in enterovirus oriL replicative element and its putative binding partners in protein 3C
Background. Enteroviruses are small non-enveloped viruses with (+) ssRNA genome with one open reading frame. Enterovirus protein 3C (or 3CD for some species) binds replicative element oriL to initiate replication. The replication of enteroviruses features low fidelity that allows virus to adapt to changing environment on the one hand, and requires additional mechanisms to maintain the genome stability on the other. Structural disturbances in the apical region of oriL domain d can be compensated by amino acid substitutions in positions 154 or 156 of 3C (amino acid numeration corresponds to poliovirus 3C), thus suggesting co-evolution of these interacting sequences in nature. The aim of this work was to understand co-evolution patterns of two interacting replication machinery elements in enteroviruses, the apical region of oriL domain d and its putative binding partners in the 3C protein. Methods.To evaluate the variability of the domain d loop sequence we retrieved all available full enterovirus sequences (>6400 nucleotides) that were present in the NCBI database on February 2017 and analyzed variety and abundance of sequences in domain d of replicative element oriL and in the protein 3C. Results. A total of 2842 full genome sequences were analyzed. Majority of domain d apical loops were tetraloops, which belonged to consensus YNHG (Y=U/C, N=any nucleotide, H=A/C/U). Putative RNA-binding tripeptide 154-156 (Enterovirus C 3C protein numeration) was less diverse than the apical domain d loop region and, in contrast to it, was species-specific. Discussion. Despite RNA-binding tripeptide is suggested to interact with apical region of domain d, they evolve independently in nature. Together, our data indicates plastic evolution of both interplayers of 3C-oriL recognition.