Toward High-Resolution de Novo Structure Prediction for Small Proteins

Science ◽  
2005 ◽  
Vol 309 (5742) ◽  
pp. 1868-1871 ◽  
Author(s):  
P. Bradley
2016 ◽  
Author(s):  
Ling-Hong Hung ◽  
Ram Samudrala

AbstractBackgroundMany rice protein sequences are very different from the sequence of proteins with known structures. Homology modeling is not possible for many rice proteins. However, it is possible to use computational intensive de novo techniques to obtain protein models when the protein sequence cannot be mapped to a protein of known structure. The Nutritious Rice for the World project generated 10 billion models encompassing more than 60,000 small proteins and protein domains for the rice strains Oryza sativa and Oryza japonica.FindingsOver a period of 1.5 years, the volunteers of World Community Grid supported by IBM generated 10 billion candidate structures, a task that would have taken a single CPU on the order of 10 millennia. For each protein sequence, 5 top structures were chosen using a novel clustering methodology developed for analyzing large datasets. These are provided along with the entire set of 10 billion conformers.ConclusionsWe anticipate that the centroid models will be of use in visualizing and determining the role of rice proteins where the function is unknown. The entire set of conformers is unique in terms of size and that they were derived from sequences that lack detectable homologs. Large sets of de novo conformers are rare and we anticipate that this set will be useful for benchmarking and developing new protein structure prediction methodologies.


2021 ◽  
Vol 22 (9) ◽  
pp. 4493
Author(s):  
Harrison Ndung’u Mwangi ◽  
Edward Kirwa Muge ◽  
Peter Waiganjo Wagacha ◽  
Albert Ndakala ◽  
Francis Jackim Mulaa

The development of novel anti-infectives against Kinetoplastids pathogens targeting proteins is a big problem occasioned by the antigenic variation in these parasites. This is also a global concern due to the zoonosis of these parasites, as they infect both humans and animals. Therefore, we need not only to create novel antibiotics, but also to speed up the development pipeline for these antibiotics. This may be achieved by using novel drug targets for Kinetoplastids drug discovery. In this study, we focused our attention on motifs of rRNA molecules that have been created using homology modeling. The RNA is the most ambiguous biopolymer in the kinetoplatid, which carries many different functions. For instance, tRNAs, rRNAs, and mRNAs are essential for gene expression both in the pro-and eukaryotes. However, all these types of RNAs have sequences with unique 3D structures that are specific for kinetoplastids only and can be used to shut down essential biochemical processes in kinetoplastids only. All these features make RNA very potent targets for antibacterial drug development. Here, we combine in silico methods combined with both computational biology and structure prediction tools to address our hypothesis. In this study, we outline a systematic approach for identifying kinetoplastid rRNA-ligand interactions and, more specifically, techniques that can be used to identify small molecules that target particular RNA. The high-resolution optimized model structures of these kineoplastids were generated using RNA 123, where all the stereochemical conflicts were solved and energies minimized to attain the best biological qualities. The high-resolution optimized model’s structures of these kinetoplastids were generated using RNA 123 where all the stereochemical conflicts were solved and energies minimized to attain the best biological qualities. These models were further analyzed to give their docking assessment reliability. Docking strategies, virtual screening, and fishing approaches successfully recognized novel and myriad macromolecular targets for the myxobacterial natural products with high binding affinities to exploit the unmet therapeutic needs. We demonstrate a sensible exploitation of virtual screening strategies to 18S rRNA using natural products interfaced with classical maximization of their efficacy in phamacognosy strategies that are well established. Integration of these virtual screening strategies in natural products chemistry and biochemistry research will spur the development of potential interventions to these tropical neglected diseases.


Microbiome ◽  
2021 ◽  
Vol 9 (1) ◽  
Author(s):  
Hannes Petruschke ◽  
Christian Schori ◽  
Sebastian Canzler ◽  
Sarah Riesbeck ◽  
Anja Poehlein ◽  
...  

Abstract Background The intestinal microbiota plays a crucial role in protecting the host from pathogenic microbes, modulating immunity and regulating metabolic processes. We studied the simplified human intestinal microbiota (SIHUMIx) consisting of eight bacterial species with a particular focus on the discovery of novel small proteins with less than 100 amino acids (= sProteins), some of which may contribute to shape the simplified human intestinal microbiota. Although sProteins carry out a wide range of important functions, they are still often missed in genome annotations, and little is known about their structure and function in individual microbes and especially in microbial communities. Results We created a multi-species integrated proteogenomics search database (iPtgxDB) to enable a comprehensive identification of novel sProteins. Six of the eight SIHUMIx species, for which no complete genomes were available, were sequenced and de novo assembled. Several proteomics approaches including two earlier optimized sProtein enrichment strategies were applied to specifically increase the chances for novel sProtein discovery. The search of tandem mass spectrometry (MS/MS) data against the multi-species iPtgxDB enabled the identification of 31 novel sProteins, of which the expression of 30 was supported by metatranscriptomics data. Using synthetic peptides, we were able to validate the expression of 25 novel sProteins. The comparison of sProtein expression in each single strain versus a multi-species community cultivation showed that six of these sProteins were only identified in the SIHUMIx community indicating a potentially important role of sProteins in the organization of microbial communities. Two of these novel sProteins have a potential antimicrobial function. Metabolic modelling revealed that a third sProtein is located in a genomic region encoding several enzymes relevant for the community metabolism within SIHUMIx. Conclusions We outline an integrated experimental and bioinformatics workflow for the discovery of novel sProteins in a simplified intestinal model system that can be generically applied to other microbial communities. The further analysis of novel sProteins uniquely expressed in the SIHUMIx multi-species community is expected to enable new insights into the role of sProteins on the functionality of bacterial communities such as those of the human intestinal tract.


Author(s):  
Bertrand Chesneau ◽  
Aurélie Plancke ◽  
Guillaume Rolland ◽  
Nicolas Chassaing ◽  
Christine Coubes ◽  
...  

AbstractMarfan syndrome (MFS) is a heritable connective tissue disorder (HCTD) caused by pathogenic variants in FBN1 that frequently occur de novo. Although individuals with somatogonadal mosaicisms have been reported with respect to MFS and other HCTD, the overall frequency of parental mosaicism in this pathology is unknown. In an attempt to estimate this frequency, we reviewed all the 333 patients with a disease-causing variant in FBN1. We then used direct sequencing, combined with High Resolution Melting Analysis, to detect mosaicism in their parents, complemented by NGS when a mosaicism was objectivized. We found that (1) the number of apparently de novo events is much higher than the classically admitted number (around 50% of patients and not 25% as expected for FBN1) and (2) around 5% of the FBN1 disease-causing variants were not actually de novo as anticipated, but inherited in a context of somatogonadal mosaicisms revealed in parents from three families. High Resolution Melting Analysis and NGS were more efficient at detecting and evaluating the level of mosaicism compared to direct Sanger sequencing. We also investigated individuals with a causal variant in another gene identified through our “aortic diseases genes” NGS panel and report, for the first time, on an individual with a somatogonadal mosaicism in COL5A1. Our study shows that parental mosaicism is not that rare in Marfan syndrome and should be investigated with appropriate methods given its implications in patient’s management.


PLoS ONE ◽  
2015 ◽  
Vol 10 (4) ◽  
pp. e0123998 ◽  
Author(s):  
Saulo H. P. de Oliveira ◽  
Jiye Shi ◽  
Charlotte M. Deane

2009 ◽  
Vol 393 (1) ◽  
pp. 249-260 ◽  
Author(s):  
David E. Kim ◽  
Ben Blum ◽  
Philip Bradley ◽  
David Baker

Sign in / Sign up

Export Citation Format

Share Document