Improving K-NN Internet Traffic Classification Using Clustering and Principle Component Analysis

Adi Suryaputra Paramita

doi:10.11591/eei.v6i2.608

Improving K-NN Internet Traffic Classification Using Clustering and Principle Component Analysis

Bulletin of Electrical Engineering and Informatics ◽

10.11591/eei.v6i2.608 ◽

2017 ◽

Vol 6 (2) ◽

pp. 159-165

Author(s):

Adi Suryaputra Paramita

Keyword(s):

Feature Selection ◽

Component Analysis ◽

Internet Traffic ◽

Classification Algorithm ◽

Combination Method ◽

Traffic Classification ◽

Selection Algorithm ◽

Feature Selection Algorithm ◽

Initial Dataset ◽

Internet Traffic Classification

K-Nearest Neighbour (K-NN) is one of the popular classification algorithm, in this research K-NN use to classify internet traffic, the K-NN is appropriate for huge amounts of data and have more accurate classification, K-NN algorithm has a disadvantages in computation process because K-NN algorithm calculate the distance of all existing data in dataset. Clustering is one of the solution to conquer the K-NN weaknesses, clustering process should be done before the K-NN classification process, the clustering process does not need high computing time to conqest the data which have same characteristic, Fuzzy C-Mean is the clustering algorithm used in this research. The Fuzzy C-Mean algorithm no need to determine the first number of clusters to be formed, clusters that form on this algorithm will be formed naturally based datasets be entered. The Fuzzy C-Mean has weakness in clustering results obtained are frequently not same even though the input of dataset was same because the initial dataset that of the Fuzzy C-Mean is less optimal, to optimize the initial datasets needs feature selection algorithm. Feature selection is a method to produce an optimum initial dataset Fuzzy C-Means. Feature selection algorithm in this research is Principal Component Analysis (PCA). PCA can reduce non significant attribute or feature to create optimal dataset and can improve performance for clustering and classification algorithm. The resultsof this research is the combination method of classification, clustering and feature selection of internet traffic dataset was successfully modeled internet traffic classification method that higher accuracy and faster performance.

Download Full-text

Effect of Feature Selection on Performance of Internet Traffic Classification on NIMS Multi-Class dataset

Journal of Physics Conference Series ◽

10.1088/1742-6596/1299/1/012035 ◽

2019 ◽

Vol 1299 ◽

pp. 012035

Author(s):

Jonathan Oluranti ◽

Nicholas Omoregbe ◽

Sanjay Misra

Keyword(s):

Feature Selection ◽

Internet Traffic ◽

Traffic Classification ◽

Internet Traffic Classification

Download Full-text

Effective Feature Selection for 5G IM Applications Traffic Classification

Mobile Information Systems ◽

10.1155/2017/6805056 ◽

2017 ◽

Vol 2017 ◽

pp. 1-12 ◽

Cited By ~ 3

Author(s):

Muhammad Shafiq ◽

Xiangzhan Yu ◽

Asif Ali Laghari ◽

Dawei Wang

Keyword(s):

Feature Selection ◽

Classification Accuracy ◽

Statistical Test ◽

Traffic Classification ◽

Features Selection ◽

Traffic Flows ◽

Selection Algorithm ◽

Feature Selection Algorithm ◽

Wrapper Method ◽

Selection For

Recently, machine learning (ML) algorithms have widely been applied in Internet traffic classification. However, due to the inappropriate features selection, ML-based classifiers are prone to misclassify Internet flows as that traffic occupies majority of traffic flows. To address this problem, a novel feature selection metric named weighted mutual information (WMI) is proposed. We develop a hybrid feature selection algorithm named WMI_ACC, which filters most of the features with WMI metric. It further uses a wrapper method to select features for ML classifiers with accuracy (ACC) metric. We evaluate our approach using five ML classifiers on the two different network environment traces captured. Furthermore, we also apply Wilcoxon pairwise statistical test on the results of our proposed algorithm to find out the robust features from the selected set of features. Experimental results show that our algorithm gives promising results in terms of classification accuracy, recall, and precision. Our proposed algorithm can achieve 99% flow accuracy results, which is very promising.

Download Full-text

Clustering and Principal Feature Selection Impact for Internet Traffic Classification Using K-NN

Proceedings of Second International Conference on Electrical Systems, Technology and Information 2015 (ICESTI 2015) - Lecture Notes in Electrical Engineering ◽

10.1007/978-981-287-988-2_7 ◽

2016 ◽

pp. 75-81 ◽

Cited By ~ 1

Author(s):

Trianggoro Wiradinata ◽

P. Adi Suryaputra

Keyword(s):

Feature Selection ◽

Internet Traffic ◽

Traffic Classification ◽

Principal Feature ◽

Internet Traffic Classification

Download Full-text

Online hybrid internet traffic classification algorithm based on signature statistical and port methods to identify internet applications

2013 IEEE International Conference on Control System, Computing and Engineering ◽

10.1109/iccsce.2013.6719956 ◽

2013 ◽

Cited By ~ 1

Author(s):

Hamza Awad Hamza Ibrahim ◽

Sulaiman Mohd Nor ◽

Haitham A. Jamil

Keyword(s):

Internet Traffic ◽

Classification Algorithm ◽

Traffic Classification ◽

Internet Applications ◽

Internet Traffic Classification

Download Full-text

A novel feature selection technique based on Roach Infestation Optimization for Internet Traffic Classification

2020 2nd International Conference on Computer and Information Sciences (ICCIS) ◽

10.1109/iccis49240.2020.9257694 ◽

2020 ◽

Author(s):

Dalila Boughaci ◽

Fatma Belaidi ◽

Imene Kerkouche

Keyword(s):

Feature Selection ◽

Internet Traffic ◽

Traffic Classification ◽

Feature Selection Technique ◽

Selection Technique ◽

Internet Traffic Classification

Download Full-text

A New Feature Selection Method for Internet Traffic Classification Using ML

Physics Procedia ◽

10.1016/j.phpro.2012.05.220 ◽

2012 ◽

Vol 33 ◽

pp. 1338-1345 ◽

Cited By ~ 14

Author(s):

Liu Zhen ◽

Liu Qiong

Keyword(s):

Feature Selection ◽

Feature Selection Method ◽

Internet Traffic ◽

Selection Method ◽

Traffic Classification ◽

Internet Traffic Classification ◽

New Feature

Download Full-text

A reliable feature selection algorithm for determining heartbeat case using weighted principal component analysis

2016 International Conference on System Science and Engineering (ICSSE) ◽

10.1109/icsse.2016.7551594 ◽

2016 ◽

Cited By ~ 2

Author(s):

Yun-Chi Yeh ◽

Chun-Wei Chen ◽

Che Wun Chiou ◽

Tsui-Yao Chu

Keyword(s):

Principal Component Analysis ◽

Feature Selection ◽

Principal Component ◽

Component Analysis ◽

Selection Algorithm ◽

Feature Selection Algorithm ◽

Weighted Principal Component

Download Full-text

Clustering and Feature Selection Technique for Improving Internet Traffic Classification Using K-NN

Journal of Advances in Computer Networks ◽

10.18178/jacn.2016.4.1.198 ◽

2016 ◽

Vol 4 (1) ◽

pp. 24-27 ◽

Cited By ~ 2

Author(s):

Trianggoro Wiradinata ◽

◽

Adi Suryaputra Paramita

Keyword(s):

Feature Selection ◽

Internet Traffic ◽

Traffic Classification ◽

Feature Selection Technique ◽

Selection Technique ◽

Internet Traffic Classification

Download Full-text

Spark-Based Feature Selection Algorithm of Network Traffic Classification

2017 13th International Conference on Computational Intelligence and Security (CIS) ◽

10.1109/cis.2017.00038 ◽

2017 ◽

Author(s):

Wenlong Ke ◽

Yong Wang ◽

Xiaochun Lei ◽

Bizhong Wei

Keyword(s):

Feature Selection ◽

Network Traffic ◽

Traffic Classification ◽

Selection Algorithm ◽

Feature Selection Algorithm ◽

Network Traffic Classification

Download Full-text

Principal Feature Selection Impact for Internet Traffic Classification Using Naïve Bayes

Proceedings of Second International Conference on Electrical Systems, Technology and Information 2015 (ICESTI 2015) - Lecture Notes in Electrical Engineering ◽

10.1007/978-981-287-988-2_52 ◽

2016 ◽

pp. 475-480

Author(s):

Adi Suryaputra Paramita

Keyword(s):

Feature Selection ◽

Naive Bayes ◽

Internet Traffic ◽

Naïve Bayes ◽

Traffic Classification ◽

Principal Feature ◽

Internet Traffic Classification

Download Full-text