Predicting Gram-Positive Bacterial Protein Subcellular Location by Using Combined Features

BioMed Research International ◽

10.1155/2020/9701734 ◽

2020 ◽

Vol 2020 ◽

pp. 1-8 ◽

Cited By ~ 1

Author(s):

Feng-Min Li ◽

Xiao-Wei Gao

Keyword(s):

Amino Acid ◽

Protein Function ◽

Subcellular Location ◽

Support Vector ◽

Bacterial Protein ◽

Dipeptide Composition ◽

Gram Positive ◽

Jackknife Test ◽

Gram Positive Bacteria ◽

Protein Subcellular Location

There are a lot of bacteria in the environment, and Gram-positive bacteria are the most common ones. Some Gram-positive bacteria are very harmful to the human body, so it is significant to predict Gram-positive bacterial protein subcellular location. And identification of Gram-positive bacterial protein subcellular location is important for developing effective drugs. In this paper, a new Gram-positive bacterial protein subcellular location dataset was established. The amino acid composition, the gene ontology annotation information, the hydropathy dipeptide composition information, the amino acid dipeptide composition information, and the autocovariance average chemical shift information were selected as characteristic parameters, then these parameters were combined. The locations of Gram-positive bacterial proteins were predicted by the Support Vector Machine (SVM) algorithm, and the overall accuracy (OA) reached 86.1% under the Jackknife test. The overall accuracy (OA) in our predictive model was higher than those in existing methods. This improved method may be helpful for protein function prediction.

Download Full-text

Identifying protein subcellular location with embedding features learned from networks

Current Proteomics ◽

10.2174/1570164617999201124142950 ◽

2020 ◽

Vol 17 ◽

Author(s):

Hongwei Liu ◽

Bin Hu ◽

Lei Chen ◽

Lin Lu

Keyword(s):

Computational Methods ◽

Protein Function ◽

Cross Validation ◽

Learning Algorithm ◽

Subcellular Location ◽

Support Vector ◽

Network Embedding ◽

Validation Method ◽

Protein Subcellular Location ◽

The Cross

Background: Identification of protein subcellular location is an important problem because the subcellular location is highly related to protein function. It is fundamental to determine the locations with biology experiments. However, these experiments are of high costs and time-consuming. The alternative way to address such problem is to design effective computational methods. Objective: To date, several computational methods have been proposed in this regard. However, these methods mainly adopted the features derived from proteins themselves. On the other hand, with the development of network technique, several embedding algorithms have been proposed, which can encode nodes in the network into feature vectors. Such algorithms connected the network and traditional classification algorithms. Thus, they provided a new way to construct models for the prediction of protein subcellular location. Method: In this study, we analyzed features produced by three network embedding algorithms (DeepWalk, Node2vec and Mashup) that were applied on one or multiple protein networks. Obtained features were learned by one machine learning algorithm (support vector machine or random forest) to construct the model. The cross-validation method was adopted to evaluate all constructed models. Results: After evaluating models with the cross-validation method, embedding features yielded by Mashup on multiple networks were quite informative for predicting protein subcellular location. The model based on these features were superior to some classic models. Conclusion: Embedding features yielded by a proper and powerful network embedding algorithm were effective for building the model for prediction of protein subcellular location, providing new pipelines to build more efficient models.

Download Full-text

Integrating Second-order Moving Average and Over-sampling Algorithm to Predict Apoptosis Protein Subcellular Localization

Current Bioinformatics ◽

10.2174/1574893614666190902155811 ◽

2020 ◽

Vol 15 (6) ◽

pp. 517-527

Author(s):

Yunyun Liang ◽

Shengli Zhang

Keyword(s):

Subcellular Localization ◽

Moving Average ◽

Subcellular Location ◽

Second Order ◽

Test Method ◽

Support Vector ◽

Protein Subcellular Localization ◽

Protein Subcellular Location ◽

Apoptosis Protein ◽

Leibler Divergence

Background: Apoptosis proteins have a key role in the development and the homeostasis of the organism, and are very important to understand the mechanism of cell proliferation and death. The function of apoptosis protein is closely related to its subcellular location. Objective: Prediction of apoptosis protein subcellular localization is a meaningful task. Methods: In this study, we predict the apoptosis protein subcellular location by using the PSSMbased second-order moving average descriptor, nonnegative matrix factorization based on Kullback-Leibler divergence and over-sampling algorithms. This model is named by SOMAPKLNMF- OS and constructed on the ZD98, ZW225 and CL317 benchmark datasets. Then, the support vector machine is adopted as the classifier, and the bias-free jackknife test method is used to evaluate the accuracy. Results: Our prediction system achieves the favorable and promising performance of the overall accuracy on the three datasets and also outperforms the other listed models. Conclusion: The results show that our model offers a high throughput tool for the identification of apoptosis protein subcellular localization.

Download Full-text

iAPSL-IF: Identification of Apoptosis Protein Subcellular Location Using Integrative Features Captured from Amino Acid Sequences

International Journal of Molecular Sciences ◽

10.3390/ijms19041190 ◽

2018 ◽

Vol 19 (4) ◽

pp. 1190 ◽

Cited By ~ 1

Author(s):

Yadong Tang ◽

Lu Xie ◽

Lanming Chen

Keyword(s):

Amino Acid ◽

Subcellular Location ◽

Amino Acid Sequences ◽

Protein Subcellular Location ◽

Apoptosis Protein

Download Full-text

Prediction and classification of protein subcellular location-sequence-order effect and pseudo amino acid composition

Journal of Cellular Biochemistry ◽

10.1002/jcb.10719 ◽

2003 ◽

Vol 90 (6) ◽

pp. 1250-1260 ◽

Cited By ~ 111

Author(s):

Kuo-Chen Chou ◽

Yu-Dong Cai

Keyword(s):

Amino Acid ◽

Amino Acid Composition ◽

Acid Composition ◽

Subcellular Location ◽

Order Effect ◽

Pseudo Amino Acid Composition ◽

Protein Subcellular Location

Download Full-text

Prediction of protein subcellular location using hydrophobic patterns of amino acid sequence

Computational Biology and Chemistry ◽

10.1016/j.compbiolchem.2006.08.003 ◽

2006 ◽

Vol 30 (5) ◽

pp. 367-371 ◽

Cited By ~ 50

Author(s):

Tongliang Zhang ◽

Yongsheng Ding ◽

Kuo-Chen Chou

Keyword(s):

Amino Acid ◽

Amino Acid Sequence ◽

Subcellular Location ◽

Protein Subcellular Location ◽

Hydrophobic Patterns

Download Full-text

Subcellular location prediction of proteins using support vector machines with alignment of block sequences utilizing amino acid composition

BMC Bioinformatics ◽

10.1186/1471-2105-8-466 ◽

2007 ◽

Vol 8 (1) ◽

Cited By ~ 36

Author(s):

Takeyuki Tamura ◽

Tatsuya Akutsu

Keyword(s):

Amino Acid ◽

Support Vector Machines ◽

Amino Acid Composition ◽

Acid Composition ◽

Subcellular Location ◽

Support Vector ◽

Location Prediction ◽

Subcellular Location Prediction ◽

Vector Machines

Download Full-text

Antibacterial Action of Structurally Diverse Cationic Peptides on Gram-Positive Bacteria

Antimicrobial Agents and Chemotherapy ◽

10.1128/aac.44.8.2086-2092.2000 ◽

2000 ◽

Vol 44 (8) ◽

pp. 2086-2092 ◽

Cited By ~ 325

Author(s):

Carol L. Friedrich ◽

Dianne Moyles ◽

Terry J. Beveridge ◽

Robert E. W. Hancock

Keyword(s):

Amino Acid ◽

Mechanism Of Action ◽

Cytoplasmic Membrane ◽

Membrane Depolarization ◽

Membrane Permeabilization ◽

Cationic Peptides ◽

Nuclear Condensation ◽

Gram Positive ◽

Gram Positive Bacteria ◽

Helical Peptide

ABSTRACT Antimicrobial cationic peptides are ubiquitous in nature and are thought to be a component of the first line of defense against infectious agents. It is widely believed that the killing mechanism of these peptides on bacteria involves an interaction with the cytoplasmic membrane. Cationic peptides from different structural classes were used in experiments withStaphylococcus aureus and other medically important gram-positive bacteria to gain insight into the mechanism of action. The membrane potential-sensitive fluorophore dipropylthiacarbocyanine was used to assess the interactions of selected antimicrobial peptides with the cytoplasmic membrane of S. aureus. Study of the kinetics of killing and membrane depolarization showed that, at early time points, membrane depolarization was incomplete, even when 90% or more of the bacteria had been killed. CP26, a 26-amino-acid α-helical peptide with a high MIC against S. aureus, still had the ability to permeabilize the membrane. Cytoplasmic-membrane permeabilization was a widespread ability and an action that may be necessary for reaching an intracellular target but in itself did not appear to be the killing mechanism. Transmission electron microscopy of S. aureus andStaphylococcus epidermidis treated with CP29 (a 26-amino-acid α-helical peptide), CP11CN (a 13-amino-acid, proline- and tryptophan-rich peptide), and Bac2A-NH2 (a linearized version of the 12-amino-acid loop peptide bactenecin) showed variability in effects on bacterial structure. Mesosome-like structures were seen to develop in S. aureus, whereas cell wall effects and mesosomes were seen with S. epidermidis. Nuclear condensation and abherrent septation were occasionally seen in S. epidermidis. Our experiments indicated that these peptides vary in their mechanisms of action and that the mechanism of action likely does not solely involve cytoplasmic-membrane permeabilization.

Download Full-text

The Influence of Dipeptide Composition on Protein Folding Rates

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.378-379.157 ◽

2011 ◽

Vol 378-379 ◽

pp. 157-160

Author(s):

Jian Xiu Guo ◽

Ni Ni Rao

Keyword(s):

Protein Folding ◽

Amino Acid ◽

Amino Acid Sequences ◽

Dipeptide Composition ◽

Coupling Effects ◽

Jackknife Test ◽

Folding Rates ◽

Important Challenge ◽

The Relationship

Understanding the relationship between amino acid sequences and folding rates of proteins is an important challenge in computational and molecular biology. All existing algorithms for predicting protein folding rates have never taken into account the sequence coupling effects. In this work, a novel algorithm was developed for predicting the protein folding rates from amino acid sequences. The prediction was achieved on the basis of dipeptide composition, in which the sequence coupling effects are explicitly included through a series of conditional probability elements. Based on a non-redundant dataset of 99 proteins, the proposed method was found to provide an excellent agreement between the predicted and experimental folding rates of proteins when evaluated with the jackknife test. The correlation coefficient was 87.7% and the standard error was 2.04, which indicated the important contribution from sequence coupling effects to the determination of protein folding rates.

Download Full-text

Protein Subcellular Location Prediction Based on Pseudo Amino Acid Composition and PSI-Blast Profile

Journal of Computational and Theoretical Nanoscience ◽

10.1166/jctn.2015.4272 ◽

2015 ◽

Vol 12 (10) ◽

pp. 3756-3762 ◽

Cited By ~ 2

Author(s):

Huimin Xu ◽

Shoujiang Yan ◽

Qi Dai ◽

Ping-An He ◽

Bo Liao ◽

...

Keyword(s):

Amino Acid ◽

Amino Acid Composition ◽

Acid Composition ◽

Subcellular Location ◽

Location Prediction ◽

Pseudo Amino Acid Composition ◽

Protein Subcellular Location ◽

Subcellular Location Prediction ◽

Protein Subcellular Location Prediction

Download Full-text

Prediction of protein function using a deep convolutional neural network ensemble

10.7287/peerj.preprints.2778 ◽

2017 ◽

Author(s):

Evangelia I Zacharaki

Keyword(s):

Neural Network ◽

Amino Acid ◽

Convolutional Neural Network ◽

Protein Function ◽

Protein Structures ◽

Function Prediction ◽

Deep Convolutional Neural Network ◽

Supervised Machine Learning ◽

Support Vector ◽

Feature Maps

Background. The availability of large databases containing high resolution three-dimensional (3D) models of proteins in conjunction with functional annotation allows the exploitation of advanced supervised machine learning techniques for automatic protein function prediction. Methods. In this work, novel shape features are extracted representing protein structure in the form of local (per amino acid) distribution of angles and amino acid distances, respectively. Each of the multi-channel feature maps is introduced into a deep convolutional neural network (CNN) for function prediction and the outputs are fused through Support Vector Machines (SVM) or a correlation-based k-nearest neighbor classifier. Two different architectures are investigated employing either one CNN per multi-channel feature set, or one CNN per image channel. Results. Cross validation experiments on enzymes (n = 44,661) from the PDB database achieved 90.1% correct classification demonstrating the effectiveness of the proposed method for automatic function annotation of protein structures. Discussion. The automatic prediction of protein function can provide quick annotations on extensive datasets opening the path for relevant applications, such as pharmacological target identification.

Download Full-text