Comparison of Machine Learning and Deep Learning Models for Network Intrusion Detection Systems

Niraj Thapa; Zhipeng Liu; Dukka B. KC; Balakrishna Gokaraju; Kaushik Roy

doi:10.3390/fi12100167

Comparison of Machine Learning and Deep Learning Models for Network Intrusion Detection Systems

Future Internet ◽

10.3390/fi12100167 ◽

2020 ◽

Vol 12 (10) ◽

pp. 167

Author(s):

Niraj Thapa ◽

Zhipeng Liu ◽

Dukka B. KC ◽

Balakrishna Gokaraju ◽

Kaushik Roy

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Intrusion Detection ◽

Detection Rate ◽

Intrusion Detection Systems ◽

Network Intrusion Detection ◽

High Detection Rate ◽

Detection Systems ◽

Network Intrusion ◽

High Detection

The development of robust anomaly-based network detection systems, which are preferred over static signal-based network intrusion, is vital for cybersecurity. The development of a flexible and dynamic security system is required to tackle the new attacks. Current intrusion detection systems (IDSs) suffer to attain both the high detection rate and low false alarm rate. To address this issue, in this paper, we propose an IDS using different machine learning (ML) and deep learning (DL) models. This paper presents a comparative analysis of different ML models and DL models on Coburg intrusion detection datasets (CIDDSs). First, we compare different ML- and DL-based models on the CIDDS dataset. Second, we propose an ensemble model that combines the best ML and DL models to achieve high-performance metrics. Finally, we benchmarked our best models with the CIC-IDS2017 dataset and compared them with state-of-the-art models. While the popular IDS datasets like KDD99 and NSL-KDD fail to represent the recent attacks and suffer from network biases, CIDDS, used in this research, encompasses labeled flow-based data in a simulated office environment with both updated attacks and normal usage. Furthermore, both accuracy and interpretability must be considered while implementing AI models. Both ML and DL models achieved an accuracy of 99% on the CIDDS dataset with a high detection rate, low false alarm rate, and relatively low training costs. Feature importance was also studied using the Classification and regression tree (CART) model. Our models performed well in 10-fold cross-validation and independent testing. CART and convolutional neural network (CNN) with embedding achieved slightly better performance on the CIC-IDS2017 dataset compared to previous models. Together, these results suggest that both ML and DL methods are robust and complementary techniques as an effective network intrusion detection system.

Download Full-text

Effect of Activation Functions on the Performance of Deep Learning Algorithms for Network Intrusion Detection Systems

Proceedings of ICETIT 2019 - Lecture Notes in Electrical Engineering ◽

10.1007/978-3-030-30577-2_84 ◽

2019 ◽

pp. 949-960

Author(s):

Neha Gupta ◽

Punam Bedi ◽

Vinita Jindal

Keyword(s):

Deep Learning ◽

Intrusion Detection ◽

Learning Algorithms ◽

Intrusion Detection Systems ◽

Network Intrusion Detection ◽

Activation Functions ◽

Detection Systems ◽

Network Intrusion ◽

Network Intrusion Detection Systems

Download Full-text

Poisoning Attacks and Data Sanitization Mitigations for Machine Learning Models in Network Intrusion Detection Systems

10.1109/milcom52596.2021.9652916 ◽

2021 ◽

Author(s):

Sridhar Venkatesan ◽

Harshvardhan Sikka ◽

Rauf Izmailov ◽

Ritu Chadha ◽

Alina Oprea ◽

...

Keyword(s):

Machine Learning ◽

Intrusion Detection ◽

Intrusion Detection Systems ◽

Network Intrusion Detection ◽

Learning Models ◽

Detection Systems ◽

Data Sanitization ◽

Network Intrusion ◽

Network Intrusion Detection Systems ◽

Machine Learning Models

Download Full-text

A Deep Learning Approach for Anomaly-Based Network Intrusion Detection Systems: A Survey and an Objective Comparison

10.1007/978-3-030-82469-3_20 ◽

2021 ◽

pp. 227-235

Author(s):

Shailender Kumar ◽

Namrata Jha ◽

Nikhil Sachdeva

Keyword(s):

Deep Learning ◽

Intrusion Detection ◽

Intrusion Detection Systems ◽

Network Intrusion Detection ◽

Learning Approach ◽

Detection Systems ◽

Network Intrusion ◽

Network Intrusion Detection Systems

Download Full-text

Ensemble-Based Online Machine Learning Algorithms for Network Intrusion Detection Systems Using Streaming Data

Information ◽

10.3390/info11060315 ◽

2020 ◽

Vol 11 (6) ◽

pp. 315

Author(s):

Nathan Martindale ◽

Muhammad Ismail ◽

Douglas A. Talbert

Keyword(s):

Machine Learning ◽

Random Forest ◽

Intrusion Detection ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Intrusion Detection Systems ◽

Network Intrusion Detection ◽

Detection Systems ◽

Network Intrusion ◽

Network Intrusion Detection Systems

As new cyberattacks are launched against systems and networks on a daily basis, the ability for network intrusion detection systems to operate efficiently in the big data era has become critically important, particularly as more low-power Internet-of-Things (IoT) devices enter the market. This has motivated research in applying machine learning algorithms that can operate on streams of data, trained online or “live” on only a small amount of data kept in memory at a time, as opposed to the more classical approaches that are trained solely offline on all of the data at once. In this context, one important concept from machine learning for improving detection performance is the idea of “ensembles”, where a collection of machine learning algorithms are combined to compensate for their individual limitations and produce an overall superior algorithm. Unfortunately, existing research lacks proper performance comparison between homogeneous and heterogeneous online ensembles. Hence, this paper investigates several homogeneous and heterogeneous ensembles, proposes three novel online heterogeneous ensembles for intrusion detection, and compares their performance accuracy, run-time complexity, and response to concept drifts. Out of the proposed novel online ensembles, the heterogeneous ensemble consisting of an adaptive random forest of Hoeffding Trees combined with a Hoeffding Adaptive Tree performed the best, by dealing with concept drift in the most effective way. While this scheme is less accurate than a larger size adaptive random forest, it offered a marginally better run-time, which is beneficial for online training.

Download Full-text

CNN-Based Network Intrusion Detection against Denial-of-Service Attacks

Electronics ◽

10.3390/electronics9060916 ◽

2020 ◽

Vol 9 (6) ◽

pp. 916 ◽

Cited By ~ 7

Author(s):

Jiyeon Kim ◽

Jiwon Kim ◽

Hyunjung Kim ◽

Minsun Shim ◽

Eunjung Choi

Keyword(s):

Neural Network ◽

Deep Learning ◽

Intrusion Detection ◽

Denial Of Service ◽

Intrusion Detection Systems ◽

Network Intrusion Detection ◽

National Defense ◽

Dos Attacks ◽

Detection Systems ◽

Network Intrusion

As cyberattacks become more intelligent, it is challenging to detect advanced attacks in a variety of fields including industry, national defense, and healthcare. Traditional intrusion detection systems are no longer enough to detect these advanced attacks with unexpected patterns. Attackers bypass known signatures and pretend to be normal users. Deep learning is an alternative to solving these issues. Deep Learning (DL)-based intrusion detection does not require a lot of attack signatures or the list of normal behaviors to generate detection rules. DL defines intrusion features by itself through training empirical data. We develop a DL-based intrusion model especially focusing on denial of service (DoS) attacks. For the intrusion dataset, we use KDD CUP 1999 dataset (KDD), the most widely used dataset for the evaluation of intrusion detection systems (IDS). KDD consists of four types of attack categories, such as DoS, user to root (U2R), remote to local (R2L), and probing. Numerous KDD studies have been employing machine learning and classifying the dataset into the four categories or into two categories such as attack and benign. Rather than focusing on the broad categories, we focus on various attacks belonging to same category. Unlike other categories of KDD, the DoS category has enough samples for training each attack. In addition to KDD, we use CSE-CIC-IDS2018 which is the most up-to-date IDS dataset. CSE-CIC-IDS2018 consists of more advanced DoS attacks than that of KDD. In this work, we focus on the DoS category of both datasets and develop a DL model for DoS detection. We develop our model based on a Convolutional Neural Network (CNN) and evaluate its performance through comparison with an Recurrent Neural Network (RNN). Furthermore, we suggest the optimal CNN design for the better performance through numerous experiments.

Download Full-text

Enhanced Network Intrusion Detection System

Sensors ◽

10.3390/s21237835 ◽

2021 ◽

Vol 21 (23) ◽

pp. 7835

Author(s):

Ketan Kotecha ◽

Raghav Verma ◽

Prahalad V. Rao ◽

Priyanshu Prasad ◽

Vipul Kumar Mishra ◽

...

Keyword(s):

Intrusion Detection ◽

Intrusion Detection System ◽

Detection System ◽

Network Intrusion Detection ◽

High Detection Rate ◽

Detection Systems ◽

Network Intrusion ◽

Network Intrusion Detection System ◽

Network Intrusion Detection Systems ◽

High Detection

A reasonably good network intrusion detection system generally requires a high detection rate and a low false alarm rate in order to predict anomalies more accurately. Older datasets cannot capture the schema of a set of modern attacks; therefore, modelling based on these datasets lacked sufficient generalizability. This paper operates on the UNSW-NB15 Dataset, which is currently one of the best representatives of modern attacks and suggests various models. We discuss various models and conclude our discussion with the model that performs the best using various kinds of evaluation metrics. Alongside modelling, a comprehensive data analysis on the features of the dataset itself using our understanding of correlation, variance, and similar factors for a wider picture is done for better modelling. Furthermore, hypothetical ponderings are discussed for potential network intrusion detection systems, including suggestions on prospective modelling and dataset generation as well.

Download Full-text

On evaluation of Network Intrusion Detection Systems: Statistical analysis of CIDDS-001 dataset using Machine Learning Techniques

10.36227/techrxiv.11454276.v1 ◽

2019 ◽

Author(s):

Abhishek Verma ◽

Virender Ranga

Keyword(s):

Machine Learning ◽

Intrusion Detection ◽

Intrusion Detection Systems ◽

Machine Learning Techniques ◽

Network Intrusion Detection ◽

Detection Systems ◽

Network Intrusion ◽

Learning Techniques ◽

Network Intrusion Detection Systems ◽

Secure Networks

In the era of digital revolution, a huge amount of data is being generated from different networks on a daily basis. Security of this data is of utmost importance. Intrusion Detection Systems are found to be one the best solutions towards detecting intrusions. Network Intrusion Detection Systems are employed as a defence system to secure networks. Various techniques for the effective development of these defence systems have been proposed in the literature. However, the research on the development of datasets used for training and testing purpose of such defence systems is equally concerned. Better datasets improve the online and offline intrusion detection capability of detection model. Benchmark datasets like KDD 99 and NSL-KDD cup 99 obsolete and do not contain network traces of modern attacks like Denial of Service, hence are unsuitable for the evaluation purpose. In this work, a detailed analysis of CIDDS-001 dataset has been done and presented. We have used different well-known machine learning techniques for analysing the complexity of the dataset. Eminent evaluation metrics including Detection Rate, Accuracy, False Positive Rate, Kappa statistics, Root mean squared error have been used to show the performance of employed machine learning techniques.

Download Full-text

Statistical analysis of CIDDS-001 dataset for Network Intrusion Detection Systems using Distance-based Machine Learning

Procedia Computer Science ◽

10.1016/j.procs.2017.12.091 ◽

2018 ◽

Vol 125 ◽

pp. 709-716 ◽

Cited By ~ 27

Author(s):

Abhishek Verma ◽

Virender Ranga

Keyword(s):

Machine Learning ◽

Statistical Analysis ◽

Intrusion Detection ◽

Intrusion Detection Systems ◽

Network Intrusion Detection ◽

Detection Systems ◽

Network Intrusion ◽

Network Intrusion Detection Systems

Download Full-text

On evaluation of Network Intrusion Detection Systems: Statistical analysis of CIDDS-001 dataset using Machine Learning Techniques

10.36227/techrxiv.11454276 ◽

2019 ◽

Author(s):

Abhishek Verma ◽

Virender Ranga

Keyword(s):

Machine Learning ◽

Intrusion Detection ◽

Intrusion Detection Systems ◽

Machine Learning Techniques ◽

Network Intrusion Detection ◽

Detection Systems ◽

Network Intrusion ◽

Learning Techniques ◽

Network Intrusion Detection Systems ◽

Secure Networks

Download Full-text

Evading Machine Learning Based Network Intrusion Detection Systems with GANs

10.1002/9781119723950.ch17 ◽

2021 ◽

pp. 335-356

Author(s):

Bolor‐Erdene Zolbayar ◽

Ryan Sheatsley ◽

Patrick McDaniel ◽

Mike Weisman

Keyword(s):

Machine Learning ◽

Intrusion Detection ◽

Intrusion Detection Systems ◽

Network Intrusion Detection ◽

Detection Systems ◽

Network Intrusion ◽

Network Intrusion Detection Systems

Download Full-text