Models versus Datasets: Reducing Bias through Building a Comprehensive IDS Benchmark

Rasheed Ahmad; Izzat Alsmadi; Wasim Alhamdani; Lo’ai Tawalbeh

doi:10.3390/fi13120318

Models versus Datasets: Reducing Bias through Building a Comprehensive IDS Benchmark

Future Internet ◽

10.3390/fi13120318 ◽

2021 ◽

Vol 13 (12) ◽

pp. 318

Author(s):

Rasheed Ahmad ◽

Izzat Alsmadi ◽

Wasim Alhamdani ◽

Lo’ai Tawalbeh

Keyword(s):

Deep Learning ◽

Classification Model ◽

Intrusion Detection Systems ◽

Complex Nature ◽

Learning Approaches ◽

Learning Models ◽

Detection Systems ◽

Benchmark Datasets ◽

And Performance ◽

Performance Results

Today, deep learning approaches are widely used to build Intrusion Detection Systems for securing IoT environments. However, the models’ hidden and complex nature raises various concerns, such as trusting the model output and understanding why the model made certain decisions. Researchers generally publish their proposed model’s settings and performance results based on a specific dataset and a classification model but do not report the proposed model’s output and findings. Similarly, many researchers suggest an IDS solution by focusing only on a single benchmark dataset and classifier. Such solutions are prone to generating inaccurate and biased results. This paper overcomes these limitations in previous work by analyzing various benchmark datasets and various individual and hybrid deep learning classifiers towards finding the best IDS solution for IoT that is efficient, lightweight, and comprehensive in detecting network anomalies. We also showed the model’s localized predictions and analyzed the top contributing features impacting the global performance of deep learning models. This paper aims to extract the aggregate knowledge from various datasets and classifiers and analyze the commonalities to avoid any possible bias in results and increase the trust and transparency of deep learning models. We believe this paper’s findings will help future researchers build a comprehensive IDS based on well-performing classifiers and utilize the aggregated knowledge and the minimum set of significantly contributing features.

Download Full-text

An empirical model in intrusion detection systems using principal component analysis and deep learning models

Computational Intelligence ◽

10.1111/coin.12342 ◽

2020 ◽

Author(s):

Hariharan Rajadurai ◽

Usha Devi Gandhi

Keyword(s):

Principal Component Analysis ◽

Deep Learning ◽

Intrusion Detection ◽

Empirical Model ◽

Principal Component ◽

Component Analysis ◽

Intrusion Detection Systems ◽

Learning Models ◽

Detection Systems

Download Full-text

Review on Generative Deep Learning Models and Datasets for Intrusion Detection Systems

Revue d intelligence artificielle ◽

10.18280/ria.340213 ◽

2020 ◽

Vol 34 (2) ◽

pp. 215-226

Author(s):

Gayatri Ketepalli ◽

Premamayudu Bulla

Keyword(s):

Deep Learning ◽

Intrusion Detection ◽

Intrusion Detection Systems ◽

Learning Models ◽

Detection Systems

Download Full-text

Deep learning approaches for anomaly-based intrusion detection systems: A survey, taxonomy, and open issues

Knowledge-Based Systems ◽

10.1016/j.knosys.2019.105124 ◽

2020 ◽

Vol 189 ◽

pp. 105124 ◽

Cited By ~ 23

Author(s):

Arwa Aldweesh ◽

Abdelouahid Derhab ◽

Ahmed Z. Emam

Keyword(s):

Deep Learning ◽

Intrusion Detection ◽

Intrusion Detection Systems ◽

Learning Approaches ◽

Detection Systems ◽

Open Issues

Download Full-text

Deep learning systems detect dysplasia with human-like accuracy using histopathology and probe-based confocal laser endomicroscopy

Scientific Reports ◽

10.1038/s41598-021-84510-4 ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Shan Guleria ◽

Tilak U. Shah ◽

J. Vincent Pulido ◽

Matthew Fasullo ◽

Lubaina Ehsan ◽

...

Keyword(s):

Deep Learning ◽

Diagnostic Accuracy ◽

High Sensitivity ◽

Confocal Laser Endomicroscopy ◽

Confocal Laser ◽

Learning Approaches ◽

Learning Models ◽

Whole Slide Image ◽

Slide Image ◽

Level Model

AbstractProbe-based confocal laser endomicroscopy (pCLE) allows for real-time diagnosis of dysplasia and cancer in Barrett’s esophagus (BE) but is limited by low sensitivity. Even the gold standard of histopathology is hindered by poor agreement between pathologists. We deployed deep-learning-based image and video analysis in order to improve diagnostic accuracy of pCLE videos and biopsy images. Blinded experts categorized biopsies and pCLE videos as squamous, non-dysplastic BE, or dysplasia/cancer, and deep learning models were trained to classify the data into these three categories. Biopsy classification was conducted using two distinct approaches—a patch-level model and a whole-slide-image-level model. Gradient-weighted class activation maps (Grad-CAMs) were extracted from pCLE and biopsy models in order to determine tissue structures deemed relevant by the models. 1970 pCLE videos, 897,931 biopsy patches, and 387 whole-slide images were used to train, test, and validate the models. In pCLE analysis, models achieved a high sensitivity for dysplasia (71%) and an overall accuracy of 90% for all classes. For biopsies at the patch level, the model achieved a sensitivity of 72% for dysplasia and an overall accuracy of 90%. The whole-slide-image-level model achieved a sensitivity of 90% for dysplasia and 94% overall accuracy. Grad-CAMs for all models showed activation in medically relevant tissue regions. Our deep learning models achieved high diagnostic accuracy for both pCLE-based and histopathologic diagnosis of esophageal dysplasia and its precursors, similar to human accuracy in prior studies. These machine learning approaches may improve accuracy and efficiency of current screening protocols.

Download Full-text

Understanding Natural Disaster Scenes from Mobile Images Using Deep Learning

Applied Sciences ◽

10.3390/app11093952 ◽

2021 ◽

Vol 11 (9) ◽

pp. 3952

Author(s):

Shimin Tang ◽

Zhiqiang Chen

Keyword(s):

Deep Learning ◽

Natural Disaster ◽

Scene Understanding ◽

Computing Methods ◽

Classification Model ◽

Learning Approach ◽

Learning Models ◽

Damage Level ◽

Feature Extractor ◽

Mobile Imaging

With the ubiquitous use of mobile imaging devices, the collection of perishable disaster-scene data has become unprecedentedly easy. However, computing methods are unable to understand these images with significant complexity and uncertainties. In this paper, the authors investigate the problem of disaster-scene understanding through a deep-learning approach. Two attributes of images are concerned, including hazard types and damage levels. Three deep-learning models are trained, and their performance is assessed. Specifically, the best model for hazard-type prediction has an overall accuracy (OA) of 90.1%, and the best damage-level classification model has an explainable OA of 62.6%, upon which both models adopt the Faster R-CNN architecture with a ResNet50 network as a feature extractor. It is concluded that hazard types are more identifiable than damage levels in disaster-scene images. Insights are revealed, including that damage-level recognition suffers more from inter- and intra-class variations, and the treatment of hazard-agnostic damage leveling further contributes to the underlying uncertainties.

Download Full-text

Deep Learning with Neuroimaging and Genomics in Alzheimer’s Disease

International Journal of Molecular Sciences ◽

10.3390/ijms22157911 ◽

2021 ◽

Vol 22 (15) ◽

pp. 7911

Author(s):

Eugene Lin ◽

Chieh-Hsin Lin ◽

Hsien-Yuan Lane

Keyword(s):

Alzheimer’S Disease ◽

Alzheimer's Disease ◽

Deep Learning ◽

Future Research ◽

Learning Approaches ◽

Learning Models ◽

Learning Techniques ◽

Neuroimaging Data ◽

Similarities And Differences ◽

Normal Controls

A growing body of evidence currently proposes that deep learning approaches can serve as an essential cornerstone for the diagnosis and prediction of Alzheimer’s disease (AD). In light of the latest advancements in neuroimaging and genomics, numerous deep learning models are being exploited to distinguish AD from normal controls and/or to distinguish AD from mild cognitive impairment in recent research studies. In this review, we focus on the latest developments for AD prediction using deep learning techniques in cooperation with the principles of neuroimaging and genomics. First, we narrate various investigations that make use of deep learning algorithms to establish AD prediction using genomics or neuroimaging data. Particularly, we delineate relevant integrative neuroimaging genomics investigations that leverage deep learning methods to forecast AD on the basis of incorporating both neuroimaging and genomics data. Moreover, we outline the limitations as regards to the recent AD investigations of deep learning with neuroimaging and genomics. Finally, we depict a discussion of challenges and directions for future research. The main novelty of this work is that we summarize the major points of these investigations and scrutinize the similarities and differences among these investigations.

Download Full-text

Launching Adversarial Attacks against Network Intrusion Detection Systems for IoT

Journal of Cybersecurity and Privacy ◽

10.3390/jcp1020014 ◽

2021 ◽

Vol 1 (2) ◽

pp. 252-273

Author(s):

Pavlos Papadopoulos ◽

Oliver Thornewill von Essen ◽

Nikolaos Pitropakis ◽

Christos Chrysoulas ◽

Alexios Mylonas ◽

...

Keyword(s):

Machine Learning ◽

Intrusion Detection ◽

Intrusion Detection Systems ◽

Learning Models ◽

Detection Systems ◽

Network Intrusion ◽

Robust Model ◽

Significant Probability ◽

Adversarial Examples ◽

Attack Surface

As the internet continues to be populated with new devices and emerging technologies, the attack surface grows exponentially. Technology is shifting towards a profit-driven Internet of Things market where security is an afterthought. Traditional defending approaches are no longer sufficient to detect both known and unknown attacks to high accuracy. Machine learning intrusion detection systems have proven their success in identifying unknown attacks with high precision. Nevertheless, machine learning models are also vulnerable to attacks. Adversarial examples can be used to evaluate the robustness of a designed model before it is deployed. Further, using adversarial examples is critical to creating a robust model designed for an adversarial environment. Our work evaluates both traditional machine learning and deep learning models’ robustness using the Bot-IoT dataset. Our methodology included two main approaches. First, label poisoning, used to cause incorrect classification by the model. Second, the fast gradient sign method, used to evade detection measures. The experiments demonstrated that an attacker could manipulate or circumvent detection with significant probability.

Download Full-text

A survey on Deep Learning based Intrusion Detection Systems on Internet of Things

10.1109/i-smac52330.2021.9641050 ◽

2021 ◽

Author(s):

S. Tamil Slevi ◽

P. Visalakshi

Keyword(s):

Deep Learning ◽

Internet Of Things ◽

Intrusion Detection ◽

Intrusion Detection Systems ◽

Detection Systems

Download Full-text

Effect of Activation Functions on the Performance of Deep Learning Algorithms for Network Intrusion Detection Systems

Proceedings of ICETIT 2019 - Lecture Notes in Electrical Engineering ◽

10.1007/978-3-030-30577-2_84 ◽

2019 ◽

pp. 949-960

Author(s):

Neha Gupta ◽

Punam Bedi ◽

Vinita Jindal

Keyword(s):

Deep Learning ◽

Intrusion Detection ◽

Learning Algorithms ◽

Intrusion Detection Systems ◽

Network Intrusion Detection ◽

Activation Functions ◽

Detection Systems ◽

Network Intrusion ◽

Network Intrusion Detection Systems

Download Full-text

Product Review Ranking in e-Commerce using Urgency Level Classification Approach

Jurnal Online Informatika ◽

10.15575/join.v5i2.612 ◽

2020 ◽

Vol 5 (2) ◽

pp. 212

Author(s):

Hamdi Ahmad Zuhri ◽

Nur Ulfa Maulidevi

Keyword(s):

Deep Learning ◽

Classification Model ◽

Support Vector ◽

Learning Models ◽

Classification Approach ◽

Value Range ◽

High Bias ◽

Product Domains ◽

Urgency Level ◽

Bayesian Support

Review ranking is useful to give users a better experience. Review ranking studies commonly use upvote value, which does not represent urgency, and it causes problems in prediction. In contrast, manual labeling as wide as the upvote value range provides a high bias and inconsistency. The proposed solution is to use a classification approach to rank the review where the labels are ordinal urgency class. The experiment involved shallow learning models (Logistic Regression, Naïve Bayesian, Support Vector Machine, and Random Forest), and deep learning models (LSTM and CNN). In constructing a classification model, the problem is broken down into several binary classifications that predict tendencies of urgency depending on the separation of classes. The result shows that deep learning models outperform other models in classification dan ranking evaluation. In addition, the review data used tend to contain vocabulary of certain product domains, so further research is needed on data with more diverse vocabulary.

Download Full-text