scholarly journals Genome-Wide Patterns of Nucleotide Substitution Reveal Stringent Functional Constraints on the Protein Sequences of Thermophiles

Genetics ◽  
2004 ◽  
Vol 167 (3) ◽  
pp. 1507-1512 ◽  
Author(s):  
Robert Friedman ◽  
John W. Drake ◽  
Austin L. Hughes
2021 ◽  
Vol 49 (3) ◽  
pp. 1497-1516
Author(s):  
Wilfried M Guiblet ◽  
Marzia A Cremona ◽  
Robert S Harris ◽  
Di Chen ◽  
Kristin A Eckert ◽  
...  

Abstract Approximately 13% of the human genome can fold into non-canonical (non-B) DNA structures (e.g. G-quadruplexes, Z-DNA, etc.), which have been implicated in vital cellular processes. Non-B DNA also hinders replication, increasing errors and facilitating mutagenesis, yet its contribution to genome-wide variation in mutation rates remains unexplored. Here, we conducted a comprehensive analysis of nucleotide substitution frequencies at non-B DNA loci within noncoding, non-repetitive genome regions, their ±2 kb flanking regions, and 1-Megabase windows, using human-orangutan divergence and human single-nucleotide polymorphisms. Functional data analysis at single-base resolution demonstrated that substitution frequencies are usually elevated at non-B DNA, with patterns specific to each non-B DNA type. Mirror, direct and inverted repeats have higher substitution frequencies in spacers than in repeat arms, whereas G-quadruplexes, particularly stable ones, have higher substitution frequencies in loops than in stems. Several non-B DNA types also affect substitution frequencies in their flanking regions. Finally, non-B DNA explains more variation than any other predictor in multiple regression models for diversity or divergence at 1-Megabase scale. Thus, non-B DNA substantially contributes to variation in substitution frequencies at small and large scales. Our results highlight the role of non-B DNA in germline mutagenesis with implications to evolution and genetic diseases.


Microbiology ◽  
2003 ◽  
Vol 149 (7) ◽  
pp. 1849-1858 ◽  
Author(s):  
Emily A. L. Thompson ◽  
Ian M. Feavers ◽  
Martin C. J. Maiden

Meningococcal FetA (FrpB), an iron-regulated outer-membrane protein and vaccine component, was shown to be highly diverse: a total of 60 fetA alleles, encoding 56 protein sequences, were identified from 107 representative Neisseria meningitidis isolates. Phylogenetic analysis established that the allelic variants had been generated by both point mutation and horizontal genetic exchange. Nucleotide substitution was unevenly distributed in the gene, which contained both conserved and variable sequence regions. The most conserved region of the translated peptide sequence corresponded to an amino-terminal domain of the protein and the most diverse region to a previously identified variable region (VR). A nomenclature system for the peptides encoded by the VR was devised which classified 24 variants into 5 FetA variant families. On the basis of these data, murine polyclonal sera specific for four FetA variants were generated. The reactivities of these sera in whole-cell ELISA experiments were consistent with the hypothesis that the VR encoded an immunodominant epitope and indicated that the sera reacted mainly with variants against which they were raised. The diversity of this protein is likely to limit its effectiveness as a vaccine component.


2019 ◽  
Vol 35 (16) ◽  
pp. 2790-2795 ◽  
Author(s):  
Jan Ludwiczak ◽  
Aleksander Winski ◽  
Krzysztof Szczepaniak ◽  
Vikram Alva ◽  
Stanislaw Dunin-Horkawicz

Abstract Motivation Coiled coils are protein structural domains that mediate a plethora of biological interactions, and thus their reliable annotation is crucial for studies of protein structure and function. Results Here, we report DeepCoil, a new neural network-based tool for the detection of coiled-coil domains in protein sequences. In our benchmarks, DeepCoil significantly outperformed current state-of-the-art tools, such as PCOILS and Marcoil, both in the prediction of canonical and non-canonical coiled coils. Furthermore, in a scan of the human genome with DeepCoil, we detected many coiled-coil domains that remained undetected by other methods. This higher sensitivity of DeepCoil should make it a method of choice for accurate genome-wide detection of coiled-coil domains. Availability and implementation DeepCoil is written in Python and utilizes the Keras machine learning library. A web server is freely available at https://toolkit.tuebingen.mpg.de/#/tools/deepcoil and a standalone version can be downloaded at https://github.com/labstructbioinf/DeepCoil. Supplementary information Supplementary data are available at Bioinformatics online.


2021 ◽  
Author(s):  
Nima Wang ◽  
Dalei Wang

ABSTRACTOne gene could be transcribed to different RNA isoforms, and then produce various forms of protein sequences. This mechanism largely diversifies the cellular pool and allows natural selection to select from a wider range of substrates. Most of the deleterious changes should be either purged or only be observed in patients with deficiencies or diseases. In the cancer field, the “intra-gene” changes between tumor and normal tissues such as the alternative splicing, stop codon read-through, or protein domestication could not be captured by differential expression analyses. In this work, we collected public transcriptome and translatome data from ten patients with liver cancer, and performed genome-wide comparison on the stop codon read-through and protein domestication events. Both events could diversify the proteome without changing the genome sequence. Surprisingly, we found that compared to normal tissues, the tumor tissues globally have significantly higher occurrence of stop codon read-through events. Similarly, the translation signals of non-coding repetitive elements (protein domestication) are elevated in tumor samples. These read-through and domestication events show limited overlapping across the ten patients, suggesting the randomness of the occurrence. It also indicates that these tumor-specific read-through and domestication events should be deleterious, and should be purged by natural selection if they are not collected timely. Our work manifests the role of protein extension and domestication in liver cancer oncogenesis, and adds new aspects to the cancer field.


2021 ◽  
Author(s):  
Zhang Zhang

KaKs_Calculator 3.0 is an updated toolkit that is capable for calculating selective pressure on both coding and non-coding sequences. Similar to the nonsynonymous/synonymous substitution rate ratio for coding sequences, selection on non-coding sequences can be quantified as non-coding nucleotide substitution rate normalized by synonymous substitution rate of adjacent coding sequences. As testified on empirical data, it shows effectiveness to detect the strength and mode of selection operated on molecular sequences, accordingly demonstrating its great potential to achieve genome-wide scan of natural selection on diverse sequences and identification of potentially functional elements at whole genome scale. The package of KaKs_Calculator 3.0 is freely available for academic use only at https://ngdc.cncb.ac.cn/biocode/tools/BT000001.


2020 ◽  
Vol 10 (1) ◽  
Author(s):  
Yannick Mesrouze ◽  
Gustavo Aguilar ◽  
Fedir Bokhovchuk ◽  
Typhaine Martin ◽  
Clara Delaunay ◽  
...  

Abstract The most downstream elements of the Hippo pathway, the TEAD transcription factors, are regulated by several cofactors, such as Vg/VGLL1-3. Earlier findings on human VGLL1 and here on human VGLL3 show that these proteins interact with TEAD via a conserved amino acid motif called the TONDU domain. Surprisingly, our studies reveal that the TEAD-binding domain of Drosophila Vg and of human VGLL2 is more complex and contains an additional structural element, an Ω-loop, that contributes to TEAD binding. To explain this unexpected structural difference between proteins from the same family, we propose that, after the genome-wide duplications at the origin of vertebrates, the Ω-loop present in an ancestral VGLL gene has been lost in some VGLL variants. These findings illustrate how structural and functional constraints can guide the evolution of transcriptional cofactors to preserve their ability to compete with other cofactors for binding to transcription factors.


Author(s):  
Naureen Z ◽  
◽  
Maqsood H ◽  
Mazhar MW ◽  
Mehmood J ◽  
...  

The BR-responsive genes are then regulated by BRASSINAZOLERESISTANT (BZR) Transcription Factors (TFs). As BRs possess numerous stress-resistant functions, BZR TFs also show activities of stress-resistance along with other developmental functions. Up to 88% similarity of protein sequences has been observed between BZR1 and BZR2 genes. Many positive roles of BZR TFs have been revealed by many studies in positively regulating the BR signal transduction in rice and maize family, but there is a very limited research is available on the BZR gene family of Zea mays. The aim of this study is to perform a wide-genome analysis of BZR1 transcription factor in Zea mays so that regulatory role of BZR TFs in BR-induced signaling pathway can be revealed. Gene structure analysis revealed the information about exons and introns. Phylogenetic analysis was done to identify the maximum likelihood among different families of BZR genes. Restriction analysis provided the information about the presence of restriction sites in Zea mays genome.


Sign in / Sign up

Export Citation Format

Share Document