orphan proteins Latest Research Papers

Quality control of mislocalized and orphan proteins

Experimental Cell Research ◽

10.1016/j.yexcr.2021.112617 ◽

2021 ◽

Vol 403 (2) ◽

pp. 112617

Author(s):

Ka-Yiu Edwin Kong ◽

João P.L. Coelho ◽

Matthias J. Feige ◽

Anton Khmelinskii

Keyword(s):

Quality Control ◽

Orphan Proteins

Download Full-text

Genome-wide Prediction of Small Molecule Binding to Remote Orphan Proteins Using Distilled Sequence Alignment Embedding

10.1101/2020.08.04.236729 ◽

2020 ◽

Author(s):

Tian Cai ◽

Hansaim Lim ◽

Kyra Alyssa Abbu ◽

Yue Qiu ◽

Ruth Nussinov ◽

...

Keyword(s):

Sequence Alignment ◽

Protein Interactions ◽

Domain Knowledge ◽

Training Data ◽

Fine Tuning ◽

Data Set ◽

Vast Number ◽

Model Generalization ◽

Bioassay Data ◽

Orphan Proteins

AbstractEndogenous or surrogate ligands of a vast number of proteins remain unknown. Identification of small molecules that bind to these orphan proteins will not only shed new light into their biological functions but also provide new opportunities for drug discovery. Deep learning plays an increasing role in the prediction of chemical-protein interactions, but it faces several challenges in protein deorphanization. Bioassay data are highly biased to certain proteins, making it difficult to train a generalizable machine learning model for the proteins that are dissimilar from the ones in the training data set. Pre-training offers a general solution to improving the model generalization, but needs incorporation of domain knowledge and customization of task-specific supervised learning. To address these challenges, we develop a novel protein pre-training method, DIstilled Sequence Alignment Embedding (DISAE), and a module-based fine-tuning strategy for the protein deorphanization. In the benchmark studies, DISAE significantly improves the generalizability and outperforms the state-of-the-art methods with a large margin. The interpretability analysis of pre-trained model suggests that it learns biologically meaningful information. We further use DISAE to assign ligands to 649 human orphan G-Protein Coupled Receptors (GPCRs) and to cluster the human GPCRome by integrating their phylogenetic and ligand relationships. The promising results of DISAE open an avenue for exploring the chemical landscape of entire sequenced genomes.

Download Full-text

Removing orphan proteins from the system

Science ◽

10.1126/science.357.6350.467-m ◽

2017 ◽

Vol 357 (6350) ◽

pp. 467.13-469

Author(s):

Stella M. Hurtley

Keyword(s):

Orphan Proteins

Download Full-text

High GC content causes orphan proteins to be intrinsically disordered

PLoS Computational Biology ◽

10.1371/journal.pcbi.1005375 ◽

2017 ◽

Vol 13 (3) ◽

pp. e1005375 ◽

Cited By ~ 26

Author(s):

Walter Basile ◽

Oxana Sachenkova ◽

Sara Light ◽

Arne Elofsson

Keyword(s):

Gc Content ◽

Intrinsically Disordered ◽

Orphan Proteins

Download Full-text

High GC Content Causes Orphan Proteins to be Intrinsically Disordered

10.1101/103739 ◽

2017 ◽

Author(s):

Walter Basile ◽

Oxana Sachenkova ◽

Sara Light ◽

Arne Elofsson

Keyword(s):

Amino Acids ◽

Structural Properties ◽

De Novo ◽

Gc Content ◽

Large Degree ◽

Protein Coding ◽

Intrinsically Disordered ◽

A Genome ◽

Short Orfs ◽

Orphan Proteins

AbstractDe novo creation of protein coding genes involves the formation of short ORFs from noncoding regions; some of these ORFs might then become fixed in the populationThese orphan proteins need to, at the bare minimum, not cause serious harm to the organism, meaning that they should for instance not aggregate. Therefore, although the creation of short ORFs could be truly random, the fixation should be subjected to some selective pressure. The selective forces acting on orphan proteins have been elusive, and contradictory results have been reported. In Drosophila young proteins are more disordered than ancient ones, while the opposite trend is present in yeast. To the best of our knowledge no valid explanation for this difference has been proposed.To solve this riddle we studied structural properties and age of proteins in 187 eukaryotic organisms. We find that, with the exception of length, there are only small differences in the properties between proteins of different ages. However, when we take the GC content into account we noted that it could explain the opposite trends observed for orphans in yeast (low GC) and Drosophila (high GC). GC content is correlated with codons coding for disorder promoting amino acids. This leads us to propose that intrinsic disorder is not a strong determining factor for fixation of orphan proteins. Instead these proteins largely resemble random proteins given a particular GC level. During evolution the properties of a protein change faster than the GC level causing the relationship between disorder and GC to gradually weaken.Author SummaryWe show that the GC content of a genome is of great importance for the properties of an orphan protein. GC content affects the frequency of the codons and this affects the probability for each amino acid to be included in a de novo created protein. The codons encoding for Ala, Pro and Gly contain 80% GC, while codons for Lys, Phe, Asn, Tyr and Ile contain 20% or less. The three high GC amino acids are all disorder promoting, while Phe, Tyr and Ile are order promoting. Therefore, random protein sequences at a high GC will be more disordered than the ones created at a low GC. The structural properties of the youngest proteins match to a large degree the properties of random proteins when the GC content is taken into account. In contrast, structural properties of ancient proteins only show a weak correlation with GC content. This suggests that even after fixation in the population, proteins largely resemble random proteins given a certain GC content. Thereafter, during evolution the correlation between structural properties and GC weakens.

Download Full-text

Orphan proteins of unknown function in the mitochondrial intermembrane space proteome: New pathways and metabolic cross-talk

Biochimica et Biophysica Acta (BBA) - Molecular Cell Research ◽

10.1016/j.bbamcr.2016.07.004 ◽

2016 ◽

Vol 1863 (11) ◽

pp. 2613-2623 ◽

Cited By ~ 7

Author(s):

Esther Nuebel ◽

Phanee Manganas ◽

Kostas Tokatlidis

Keyword(s):

Cross Talk ◽

Unknown Function ◽

Intermembrane Space ◽

Mitochondrial Intermembrane Space ◽

Orphan Proteins

Download Full-text

Prioritizing orphan proteins for further study using phylogenomics and gene expression profiles in Streptomyces coelicolor

BMC Research Notes ◽

10.1186/1756-0500-4-325 ◽

2011 ◽

Vol 4 (1) ◽

Author(s):

Mohammad Tauqeer Alam ◽

Eriko Takano ◽

Rainer Breitling

Keyword(s):

Gene Expression ◽

Streptomyces Coelicolor ◽

Expression Profiles ◽

Gene Expression Profiles ◽

Orphan Proteins

Download Full-text

Functional Insights from Computational Modeling of Orphan Proteins Expressed in a Microbial Community

Journal of Proteomics & Bioinformatics ◽

10.4172/jpb.1000150 ◽

2010 ◽

Vol 03 (09) ◽

pp. 266-274 ◽

Cited By ~ 1

Author(s):

Korin E. Wheeler ◽

Adam Zemla ◽

Yongqin Jiao ◽

Daniela S. Aliaga Goltsman ◽

Steven W.Singer

Keyword(s):

Microbial Community ◽

Computational Modeling ◽

Orphan Proteins

Download Full-text

orphan proteins
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Quality control of mislocalized and orphan proteins

Genome-wide Prediction of Small Molecule Binding to Remote Orphan Proteins Using Distilled Sequence Alignment Embedding

Removing orphan proteins from the system

High GC content causes orphan proteins to be intrinsically disordered

High GC Content Causes Orphan Proteins to be Intrinsically Disordered

Orphan proteins of unknown function in the mitochondrial intermembrane space proteome: New pathways and metabolic cross-talk

Prioritizing orphan proteins for further study using phylogenomics and gene expression profiles in Streptomyces coelicolor

Functional Insights from Computational Modeling of Orphan Proteins Expressed in a Microbial Community

Export Citation Format

orphan proteinsRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Quality control of mislocalized and orphan proteins

Genome-wide Prediction of Small Molecule Binding to Remote Orphan Proteins Using Distilled Sequence Alignment Embedding

Removing orphan proteins from the system

High GC content causes orphan proteins to be intrinsically disordered

High GC Content Causes Orphan Proteins to be Intrinsically Disordered

Orphan proteins of unknown function in the mitochondrial intermembrane space proteome: New pathways and metabolic cross-talk

Prioritizing orphan proteins for further study using phylogenomics and gene expression profiles in Streptomyces coelicolor

Functional Insights from Computational Modeling of Orphan Proteins Expressed in a Microbial Community

orphan proteins
Recently Published Documents