An Ensemble approach for feature selection and classification in intrusion detection using Extra-Tree algorithm

doi:10.4018/ijisp.2022010113

An Ensemble approach for feature selection and classification in intrusion detection using Extra-Tree algorithm

International Journal of Information Security and Privacy ◽

10.4018/ijisp.2022010113 ◽

2022 ◽

Vol 16 (1) ◽

pp. 0-0

Keyword(s):

Feature Selection ◽

Intrusion Detection ◽

Intrusion Detection System ◽

Detection System ◽

Final Decision ◽

Feature Subset ◽

Tree Classifier ◽

Multiple Thresholds ◽

Feature Importance ◽

Web Communication

The number of attacks increased with speedy development in web communication in the last couple of years. The Anomaly Detection method for IDS has become substantial in detecting novel attacks in Intrusion Detection System (IDS). Achieving high accuracy are the significant challenges in designing an intrusion detection system. It also emphasizes applying different feature selection techniques to identify the most suitable feature subset. The author uses Extremely randomized trees (Extra-Tree) for feature importance. The author tries multiple thresholds on the feature importance parameters to find the best features. If single classifiers use, then the classifier's output is wrong, so that the final decision may be wrong. So The author uses an Extra-Tree classifier applied to the best-selected features. The proposed method is estimated on standard datasets KDD CUP'99, NSL-KDD, and UNSW-NB15. The experimental results show that the proposed approach performs better than existing methods in detection rate, false alarm rate, and accuracy.

Download Full-text

An ensemble feature selection approach using hybrid kernel based SVM for network intrusion detection system

Indonesian Journal of Electrical Engineering and Computer Science ◽

10.11591/ijeecs.v23.i1.pp558-565 ◽

2021 ◽

Vol 23 (1) ◽

pp. 558

Author(s):

Gaddam Venu Gopal ◽

Gatram Rama Mohan Babu

Keyword(s):

Feature Selection ◽

Intrusion Detection ◽

Intrusion Detection System ◽

Detection System ◽

Network Intrusion Detection ◽

Support Vector ◽

Feature Subset ◽

Network Intrusion ◽

Feature Selection Approach ◽

Hybrid Kernel

Feature selection is a process of identifying relevant feature subset that leads to the machine learning algorithm in a well-defined manner. In this paper, anovel ensemble feature selection approach that comprises of Relief Attribute Evaluation and hybrid kernel-based support vector machine (HK-SVM) approach is proposed as a feature selection method for network intrusion detection system (NIDS). A Hybrid approach along with the combination of Gaussian and Polynomial methods is used as a kernel for support vector machine (SVM). The key issue is to select a feature subset that yields good accuracy at a minimal computational cost. The proposed approach is implemented and compared with classical SVM and simple kernel. Kyoto2006+, a bench mark intrusion detection dataset,is used for experimental evaluation and then observations are drawn.

Download Full-text

Intrusion Detection System for Malicious Traffic Using Evolutionary Search Algorithm

Recent Advances in Computer Science and Communications ◽

10.2174/2666255813999200821162547 ◽

2020 ◽

Vol 13 ◽

Author(s):

Samar Al-Saqqa ◽

Mustafa Al-Fayoumi ◽

Malik Qasaimeh

Keyword(s):

Feature Selection ◽

Intrusion Detection ◽

Intrusion Detection System ◽

Search Algorithm ◽

Detection System ◽

Feature Subset Selection ◽

Intrusion Detection Systems ◽

Feature Subset ◽

Evolutionary Search ◽

Detection Systems

Introduction: Intrusion detection systems play a key role in system security by identifying potential attacks and giving appropriate responses. As new attacks are always emerging, intrusion detection systems must adapt to these attacks, and more work is continuously needed to develop and propose new methods and techniques that can improve efficient and effective adaptive intrusion systems. Feature selection is one of the challenging areas that need more work because of its importance and impact on the performance of intrusion detection systems. This paper applies evolutionary search algorithm in feature subset selection for intrusion detection systems. Methods: The evolutionary search algorithm for the feature subset selection is applied and two classifiers are used, Naïve Bayes and decision tree J48, to evaluate system performance before and after features selection. NSL-KDD dataset and its subsets are used in all evaluation experiments. Results: The results show that feature selection using the evolutionary search algorithm enhances the intrusion detection system with respect to detection accuracy and detection of unknown attacks. Furthermore, time performance is achieved by reducing training time, which is reflected positively in overall system performance. Discussion: The evolutionary search applied to select IDS algorithm features can be developed by modifying and enhancing mutation and crossover operators and applying new enhanced techniques in the selection process, which can give better results and enhance the performance of intrusion detection for rare and complicated attacks. Conclusion: The evolutionary search algorithm is applied to find the best subset of features for the intrusion detection system. In conclusion, it is a promising approach to be used as a feature selection method for intrusion detection. The results showed better performance for the intrusion detection system in terms of accuracy and detection rate.

Download Full-text

An intelligent intrusion detection system using genetic based feature selection and Modified J48 decision tree classifier

2013 Fifth International Conference on Advanced Computing (ICoAC) ◽

10.1109/icoac.2013.6921918 ◽

2013 ◽

Cited By ~ 6

Author(s):

B. Senthilnayaki ◽

K. Venkatalakshmi ◽

A. Kannan

Keyword(s):

Feature Selection ◽

Intrusion Detection ◽

Decision Tree ◽

Intrusion Detection System ◽

Detection System ◽

Decision Tree Classifier ◽

Tree Classifier ◽

J48 Decision Tree

Download Full-text

Improved TLBO-JAYA Algorithm for Subset Feature Selection and Parameter Optimisation in Intrusion Detection System

Complexity ◽

10.1155/2020/5287684 ◽

2020 ◽

Vol 2020 ◽

pp. 1-18 ◽

Cited By ~ 1

Author(s):

Mohammad Aljanabi ◽

Mohd Arfian Ismail ◽

Vitaly Mezhuyev

Keyword(s):

Support Vector Machine ◽

Feature Selection ◽

Intrusion Detection ◽

Intrusion Detection System ◽

Detection System ◽

Parameter Tuning ◽

Feature Subset Selection ◽

Supervised Machine Learning ◽

Support Vector ◽

Feature Subset

Many optimisation-based intrusion detection algorithms have been developed and are widely used for intrusion identification. This condition is attributed to the increasing number of audit data features and the decreasing performance of human-based smart intrusion detection systems regarding classification accuracy, false alarm rate, and classification time. Feature selection and classifier parameter tuning are important factors that affect the performance of any intrusion detection system. In this paper, an improved intrusion detection algorithm for multiclass classification was presented and discussed in detail. The proposed method combined the improved teaching-learning-based optimisation (ITLBO) algorithm, improved parallel JAYA (IPJAYA) algorithm, and support vector machine. ITLBO with supervised machine learning (ML) technique was used for feature subset selection (FSS). The selection of the least number of features without causing an effect on the result accuracy in FSS is a multiobjective optimisation problem. This work proposes ITLBO as an FSS mechanism, and its algorithm-specific, parameterless concept (no parameter tuning is required during optimisation) was explored. IPJAYA in this study was used to update the C and gamma parameters of the support vector machine (SVM). Several experiments were performed on the prominent intrusion ML dataset, where significant enhancements were observed with the suggested ITLBO-IPJAYA-SVM algorithm compared with the classical TLBO and JAYA algorithms.

Download Full-text

Building an Effective Intrusion Detection System by Using Hybrid Data Optimization Based on Machine Learning Algorithms

Security and Communication Networks ◽

10.1155/2019/7130868 ◽

2019 ◽

Vol 2019 ◽

pp. 1-11 ◽

Cited By ~ 12

Author(s):

Jiadong Ren ◽

Jiawei Guo ◽

Wang Qian ◽

Huang Yuan ◽

Xiaobing Hao ◽

...

Keyword(s):

Feature Selection ◽

Intrusion Detection ◽

Intrusion Detection System ◽

Detection System ◽

Machine Learning Algorithms ◽

Training Dataset ◽

Feature Subset ◽

Data Sampling ◽

Hybrid Data ◽

Data Optimization

Intrusion detection system (IDS) can effectively identify anomaly behaviors in the network; however, it still has low detection rate and high false alarm rate especially for anomalies with fewer records. In this paper, we propose an effective IDS by using hybrid data optimization which consists of two parts: data sampling and feature selection, called DO_IDS. In data sampling, the Isolation Forest (iForest) is used to eliminate outliers, genetic algorithm (GA) to optimize the sampling ratio, and the Random Forest (RF) classifier as the evaluation criteria to obtain the optimal training dataset. In feature selection, GA and RF are used again to obtain the optimal feature subset. Finally, an intrusion detection system based on RF is built using the optimal training dataset obtained by data sampling and the features selected by feature selection. The experiment will be carried out on the UNSW-NB15 dataset. Compared with other algorithms, the model has obvious advantages in detecting rare anomaly behaviors.

Download Full-text

Performance Evaluation of Intrusion Detection System using Selected Features and Machine Learning Classifiers

Baghdad Science Journal ◽

10.21123/bsj.2021.18.2(suppl.).0884 ◽

2021 ◽

Vol 18 (2(Suppl.)) ◽

pp. 0884

Author(s):

Raja Azlina Raja Mahmood ◽

AmirHossien Abdi ◽

Masnida Hussin

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Intrusion Detection ◽

Decision Tree ◽

Network Traffic ◽

Intrusion Detection System ◽

Detection System ◽

Support Vector ◽

Large Network ◽

Tree Classifier

Some of the main challenges in developing an effective network-based intrusion detection system (IDS) include analyzing large network traffic volumes and realizing the decision boundaries between normal and abnormal behaviors. Deploying feature selection together with efficient classifiers in the detection system can overcome these problems. Feature selection finds the most relevant features, thus reduces the dimensionality and complexity to analyze the network traffic. Moreover, using the most relevant features to build the predictive model, reduces the complexity of the developed model, thus reducing the building classifier model time and consequently improves the detection performance. In this study, two different sets of selected features have been adopted to train four machine-learning based classifiers. The two sets of selected features are based on Genetic Algorithm (GA) and Particle Swarm Optimization (PSO) approach respectively. These evolutionary-based algorithms are known to be effective in solving optimization problems. The classifiers used in this study are Naïve Bayes, k-Nearest Neighbor, Decision Tree and Support Vector Machine that have been trained and tested using the NSL-KDD dataset. The performance of the abovementioned classifiers using different features values was evaluated. The experimental results indicate that the detection accuracy improves by approximately 1.55% when implemented using the PSO-based selected features than that of using GA-based selected features. The Decision Tree classifier that was trained with PSO-based selected features outperformed other classifiers with accuracy, precision, recall, and f-score result of 99.38%, 99.36%, 99.32%, and 99.34% respectively. The results show that using optimal features coupling with a good classifier in a detection system able to reduce the classifier model building time, reduce the computational burden to analyze data, and consequently attain high detection rate.

Download Full-text

Performance Enhancement of Intrusion detection System Using Bagging Ensemble Technique with Feature Selection

2020 IEEE Asia-Pacific Conference on Computer Science and Data Engineering (CSDE) ◽

10.1109/csde50874.2020.9411608 ◽

2020 ◽

Author(s):

Md. Mamunur Rashid ◽

Joarder Kamruzzaman ◽

Mohiuddin Ahmed ◽

Nahina Islam ◽

Santoso Wibowo ◽

...

Keyword(s):

Feature Selection ◽

Intrusion Detection ◽

Intrusion Detection System ◽

Performance Enhancement ◽

Detection System ◽

Ensemble Technique ◽

Bagging Ensemble

Download Full-text

Optimal feature selection for machine learning based intrusion detection system by exploiting attribute dependence

Materials Today Proceedings ◽

10.1016/j.matpr.2021.04.643 ◽

2021 ◽

Author(s):

Ghanshyam Prasad Dubey ◽

Dr. Rakesh Kumar Bhujade

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Intrusion Detection ◽

Intrusion Detection System ◽

Detection System ◽

Optimal Feature Selection ◽

Selection For ◽

Optimal Feature

Download Full-text

An intelligent flow-based and signature-based IDS for SDNs using ensemble feature selection and a multi-layer machine learning-based classifier

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-200850 ◽

2020 ◽

pp. 1-20

Author(s):

K. Muthamil Sudar ◽

P. Deepalakshmi

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Intrusion Detection ◽

Intrusion Detection System ◽

Network Architecture ◽

Detection System ◽

Software Defined Networking ◽

Control Plane ◽

Control Logic ◽

Data Plane

Software-defined networking is a new paradigm that overcomes problems associated with traditional network architecture by separating the control logic from data plane devices. It also enhances performance by providing a highly-programmable interface that adapts to dynamic changes in network policies. As software-defined networking controllers are prone to single-point failures, providing security is one of the biggest challenges in this framework. This paper intends to provide an intrusion detection mechanism in both the control plane and data plane to secure the controller and forwarding devices respectively. In the control plane, we imposed a flow-based intrusion detection system that inspects every new incoming flow towards the controller. In the data plane, we assigned a signature-based intrusion detection system to inspect traffic between Open Flow switches using port mirroring to analyse and detect malicious activity. Our flow-based system works with the help of trained, multi-layer machine learning-based classifier, while our signature-based system works with rule-based classifiers using the Snort intrusion detection system. The ensemble feature selection technique we adopted in the flow-based system helps to identify the prominent features and hasten the classification process. Our proposed work ensures a high level of security in the Software-defined networking environment by working simultaneously in both control plane and data plane.

Download Full-text