Predicting Protein Subcellular Localization Using the Algorithm of Increment Of Diversity Combined with Weighted K-Nearest Neighbor

Predicting Protein Subcellular Localization Using the Algorithm of Increment of Diversity Combined with Weighted K-Nearest Neighbor

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.765-767.3099 ◽

2013 ◽

Vol 765-767 ◽

pp. 3099-3103 ◽

Cited By ~ 1

Author(s):

Ze Yue Wu ◽

Yue Hui Chen

Keyword(s):

Subcellular Localization ◽

Cross Validation ◽

Nearest Neighbor ◽

Research Field ◽

Important Research ◽

Protein Subcellular Localization ◽

K Nearest Neighbor ◽

Prediction Ability ◽

New Approach ◽

Increment Of Diversity

Protein subcellular localization is an important research field of bioinformatics. In this paper, we use the algorithm of the increment of diversity combined with weighted K nearest neighbor to predict protein in SNL6 which has six subcelluar localizations and SNL9 which has nine subcelluar localizations. We use the increment of diversity to extract diversity finite coefficient as new features of proteins. And the basic classifier is weighted K-nearest neighbor. The prediction ability was evaluated by 5-jackknife cross-validation. Its predicted result is 83.3% for SNL6 and 87.6 % for SNL9. By comparing its results with other methods, it indicates the new approach is feasible and effective.

Download Full-text

MpsLDA-ProSVM: predicting multi-label protein subcellular localization by wMLDAe dimensionality reduction and ProSVM classifier

10.1101/2020.04.19.049478 ◽

2020 ◽

Author(s):

Qi Zhang ◽

Shan Li ◽

Bin Yu ◽

Yang Li ◽

Yandan Zhang ◽

...

Keyword(s):

Subcellular Localization ◽

Nearest Neighbor ◽

Chemical Information ◽

Sequence Information ◽

Feature Subset ◽

Protein Subcellular Localization ◽

K Nearest Neighbor ◽

Entropy Weight ◽

Linear Discriminant ◽

Optimal Feature Subset

ABSTRACTProteins play a significant part in life processes such as cell growth, development, and reproduction. Exploring protein subcellular localization (SCL) is a direct way to better understand the function of proteins in cells. Studies have found that more and more proteins belong to multiple subcellular locations, and these proteins are called multi-label proteins. They not only play a key role in cell life activities, but also play an indispensable role in medicine and drug development. This article first presents a new prediction model, MpsLDA-ProSVM, to predict the SCL of multi-label proteins. Firstly, the physical and chemical information, evolution information, sequence information and annotation information of protein sequences are fused. Then, for the first time, use a weighted multi-label linear discriminant analysis framework based on entropy weight form (wMLDAe) to refine and purify features, reduce the difficulty of learning. Finally, input the optimal feature subset into the multi-label learning with label-specific features (LIFT) and multi-label k-nearest neighbor (ML-KNN) algorithms to obtain a synthetic ranking of relevant labels, and then use Prediction and Relevance Ordering based SVM (ProSVM) classifier to predict the SCLs. This method can rank and classify related tags at the same time, which greatly improves the efficiency of the model. Tested by jackknife method, the overall actual accuracy (OAA) on virus, plant, Gram-positive bacteria and Gram-negative bacteria datasets are 98.06%, 98.97%, 99.81% and 98.49%, which are 0.56%-9.16%, 5.37%-30.87%, 3.51%-6.91% and 3.99%-8.59% higher than other advanced methods respectively. The source codes and datasets are available at https://github.com/QUST-AIBBDRC/MpsLDA-ProSVM/.

Download Full-text

Predicting Viral Protein Subcellular Localization with Chou's Pseudo Amino Acid Composition and Imbalance-Weighted Multi-Label K-Nearest Neighbor Algorithm

Protein and Peptide Letters ◽

10.2174/092986612803216999 ◽

2012 ◽

Vol 19 (11) ◽

pp. 1163-1169 ◽

Cited By ~ 16

Author(s):

Jun-Zhe Cao ◽

Wen-Qi Liu ◽

Hong Gu

Keyword(s):

Amino Acid ◽

Subcellular Localization ◽

Amino Acid Composition ◽

Acid Composition ◽

Viral Protein ◽

Nearest Neighbor ◽

Protein Subcellular Localization ◽

K Nearest Neighbor ◽

Pseudo Amino Acid Composition ◽

K Nearest Neighbor Algorithm

Download Full-text

Predicting Protein Subcellular Localization Using the Algorithm of Diversity Finite Coefficient Combined with Artificial Neural Network

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.756-759.3760 ◽

2013 ◽

Vol 756-759 ◽

pp. 3760-3765 ◽

Cited By ~ 1

Author(s):

Ze Yue Wu ◽

Yue Hui Chen

Keyword(s):

Neural Network ◽

Artificial Neural Network ◽

Subcellular Localization ◽

Classification Problem ◽

Research Field ◽

Protein Subcellular Localization ◽

Classification Problems ◽

Prediction Ability ◽

Increment Of Diversity ◽

Artificial Neural

Protein subcellular localization is an important research field of bioinformatics. The subcellular localization of proteins classification problem is transformed into several two classification problems with error-correcting output codes. In this paper, we use the algorithm of the increment of diversity combined with artificial neural network to predict protein in SNL6 which has six subcelluar localizations. The prediction ability was evaluated by 5-jackknife cross-validation. Its predicted result is 81.3%. By com-paring its results with other methods, it indicates the new approach is feasible and effective.

Download Full-text

Predicting protein subcellular localization by approximate nearest neighbor searching

2017 29th Chinese Control And Decision Conference (CCDC) ◽

10.1109/ccdc.2017.7978996 ◽

2017 ◽

Author(s):

Wei Xue ◽

Xiao-yu Hong ◽

Nan Zhao ◽

Rong-li Yang ◽

Liang Zhang

Keyword(s):

Subcellular Localization ◽

Nearest Neighbor ◽

Protein Subcellular Localization ◽

Approximate Nearest Neighbor ◽

Nearest Neighbor Searching

Download Full-text

Machine Learning Verdict of EEG Signals in Brain Computer Interface

International Journal of Scientific Research in Computer Science Engineering and Information Technology ◽

10.32628/cseit1838114 ◽

2018 ◽

pp. 429-441

Author(s):

M. Jeyanthi ◽

C. Velayutham

Keyword(s):

Nearest Neighbor ◽

Technology Development ◽

Vital Role ◽

Svm Classifier ◽

K Nearest Neighbor ◽

Data Mining Technique ◽

Data Set ◽

Eeg Data ◽

Irrelevant Attributes

In Science and Technology Development BCI plays a vital role in the field of Research. Classification is a data mining technique used to predict group membership for data instances. Analyses of BCI data are challenging because feature extraction and classification of these data are more difficult as compared with those applied to raw data. In this paper, We extracted features using statistical Haralick features from the raw EEG data . Then the features are Normalized, Binning is used to improve the accuracy of the predictive models by reducing noise and eliminate some irrelevant attributes and then the classification is performed using different classification techniques such as Naïve Bayes, k-nearest neighbor classifier, SVM classifier using BCI dataset. Finally we propose the SVM classification algorithm for the BCI data set.

Download Full-text

PENENTUAN DAERAH PRIORITAS PELAYANAN AKTA KELAHIRAN DENGAN METODE K-NN DAN K-MEANS

Komputasi: Jurnal Ilmiah Ilmu Komputer dan Matematika ◽

10.33751/komputasi.v17i1.1735 ◽

2020 ◽

Vol 17 (1) ◽

pp. 319-328

Author(s):

Ade Muchlis Maulana Anwar ◽

Prihastuti Harsani ◽

Aries Maesya

Keyword(s):

Nearest Neighbor ◽

Information Gain ◽

Birth Certificate ◽

Population Data ◽

Community Services ◽

Birth Certificates ◽

Similar Data ◽

K Nearest Neighbor ◽

Civil Registration ◽

The Family

Population Data is individual data or aggregate data that is structured as a result of Population Registration and Civil Registration activities. Birth Certificate is a Civil Registration Deed as a result of recording the birth event of a baby whose birth is reported to be registered on the Family Card and given a Population Identification Number (NIK) as a basis for obtaining other community services. From the total number of integrated birth certificate reporting for the 2018 Population Administration Information System (SIAK) totaling 570,637 there were 503,946 reported late and only 66,691 were reported publicly. Clustering is a method used to classify data that is similar to others in one group or similar data to other groups. K-Nearest Neighbor is a method for classifying objects based on learning data that is the closest distance to the test data. k-means is a method used to divide a number of objects into groups based on existing categories by looking at the midpoint. In data mining preprocesses, data is cleaned by filling in the blank data with the most dominating data, and selecting attributes using the information gain method. Based on the k-nearest neighbor method to predict delays in reporting and the k-means method to classify priority areas of service with 10,000 birth certificate data on birth certificates in 2019 that have good enough performance to produce predictions with an accuracy of 74.00% and with K = 2 on k-means produces a index davies bouldin of 1,179.

Download Full-text

A Scalable K-Nearest Neighbor Algorithm for Recommendation System Problems

2020 43rd International Convention on Information, Communication and Electronic Technology (MIPRO) ◽

10.23919/mipro48935.2020.9245195 ◽

2020 ◽

Author(s):

A. Sagdic ◽

C. Tekinbas ◽

E. Arslan ◽

T. Kucukyilmaz

Keyword(s):

Recommendation System ◽

Nearest Neighbor ◽

K Nearest Neighbor ◽

Nearest Neighbor Algorithm ◽

K Nearest Neighbor Algorithm

Download Full-text

Reuse Recipe Document for: A robust fractionation method for protein subcellular localization studies in Escherichia coli

10.23942/biotechniques.1559047365000 ◽

2019 ◽

Keyword(s):

Escherichia Coli ◽

Subcellular Localization ◽

Protein Subcellular Localization ◽

Fractionation Method

Download Full-text

Optimizing Error Rate in Intrusion Detection System Using Artificial Neural Network Algorithm

International Journal of Emerging Research in Management and Technology ◽

10.23956/ijermt.v6i9.102 ◽

2018 ◽

Vol 6 (9) ◽

pp. 152

Author(s):

S. Vijaya Rani ◽

G. N. K. Suresh Babu

Keyword(s):

Neural Network ◽

Artificial Neural Network ◽

Intrusion Detection ◽

Error Rate ◽

Learning Process ◽

Nearest Neighbor ◽

Detection System ◽

Support Vector ◽

K Nearest Neighbor ◽

Artificial Neural

The illegal hackers penetrate the servers and networks of corporate and financial institutions to gain money and extract vital information. The hacking varies from one computing system to many system. They gain access by sending malicious packets in the network through virus, worms, Trojan horses etc. The hackers scan a network through various tools and collect information of network and host. Hence it is very much essential to detect the attacks as they enter into a network. The methods available for intrusion detection are Naive Bayes, Decision tree, Support Vector Machine, K-Nearest Neighbor, Artificial Neural Networks. A neural network consists of processing units in complex manner and able to store information and make it functional for use. It acts like human brain and takes knowledge from the environment through training and learning process. Many algorithms are available for learning process This work carry out research on analysis of malicious packets and predicting the error rate in detection of injured packets through artificial neural network algorithms.

Download Full-text