A Family of Joint Sparse PCA Algorithms for Anomaly Localization in Network Data Streams

A great deal of research attention has been paid to data mining on data streams in recent years. In this chapter, the authors carry out a case study of anomaly detection in large and high-dimensional network connection data streams using Stream Projected Outlier deTector (SPOT) that is proposed in Zhang et al. (2009) to detect anomalies from data streams using subspace analysis. SPOT is deployed on 1999 KDD CUP anomaly detection application. Innovative approaches for training data generation, anomaly classification, false positive reduction, and adoptive detection subspace generation are proposed in this chapter as well. Experimental results demonstrate that SPOT is effective and efficient in detecting anomalies from network data streams and outperforms existing anomaly detection methods.

Download Full-text

Adaptive Anomaly Detection on Network Data Streams

2018 IEEE International Conference on Intelligence and Security Informatics (ISI) ◽

10.1109/isi.2018.8587401 ◽

2018 ◽

Cited By ~ 1

Author(s):

Elizabeth Riddle-Workman ◽

Marina Evangelou ◽

Niall M. Adams

Keyword(s):

Anomaly Detection ◽

Data Streams ◽

Network Data

Download Full-text

THE OPTIMIZED METHOD OF PROCESSING OF NETWORK DATA STREAMS IN RECONFIGURABLE COMPUTING SYSTEMS

Vestnik komp iuternykh i informatsionnykh tekhnologii ◽

10.14489/vkit.2019.06.pp.039-046 ◽

2019 ◽

pp. 39-46

Keyword(s):

Data Streams ◽

Reconfigurable Computing ◽

Network Data ◽

Computing Systems

Download Full-text

Query-Aware Partitioning for Monitoring Massive Network Data Streams

2008 IEEE 24th International Conference on Data Engineering ◽

10.1109/icde.2008.4497612 ◽

2008 ◽

Cited By ~ 6

Author(s):

Theodore Johnson ◽

S. Muthukrishnan ◽

Vladislav Shkapenyuk ◽

Oliver Spatscheck

Keyword(s):

Data Streams ◽

Network Data

Download Full-text

What's new: Finding significant differences in network data streams

IEEE INFOCOM 2004 ◽

10.1109/infcom.2004.1354567 ◽

2004 ◽

Cited By ~ 33

Author(s):

G. Cormode ◽

S. Muthukrishnan

Keyword(s):

Data Streams ◽

Network Data ◽

New Finding

Download Full-text

What's new: finding significant differences in network data streams

IEEE/ACM Transactions on Networking ◽

10.1109/tnet.2005.860096 ◽

2005 ◽

Vol 13 (6) ◽

pp. 1219-1232 ◽

Cited By ~ 63

Author(s):

G. Cormode ◽

S. Muthukrishnan

Keyword(s):

Data Streams ◽

Network Data ◽

New Finding

Download Full-text

SALAD: An Exploration of Split Active Learning based Unsupervised Network Data Stream Anomaly Detection using Autoencoders

10.36227/techrxiv.14896773 ◽

2021 ◽

Author(s):

Christopher Nixon ◽

Mohamed Sedky ◽

Mohamed Hassan

Keyword(s):

Machine Learning ◽

Active Learning ◽

Intrusion Detection ◽

Anomaly Detection ◽

Data Streams ◽

Data Stream ◽

Learning Strategy ◽

Network Data ◽

Anomaly Detector ◽

Active Learning Strategy

<div>Machine learning based intrusion detection systems monitor network data streams for cyber attacks. Challenges in this space include detection of unknown attacks, adaptation to changes in the data stream such as changes in underlying behaviour, the human cost of labeling data to retrain the machine learning model and the processing and memory constraints of a real-time data stream. Failure to manage the aforementioned factors could result in missed attacks, degraded detection performance, unnecessary expense or delayed detection times. This research evaluated autoencoders, a type of feed-forward neural network, as online anomaly detectors for network data streams. The autoencoder method was combined with an active learning strategy to further reduce labeling cost and speed up training and adaptation times, resulting in a proposed Split Active Learning Anomaly Detector (SALAD) method. The proposed method was evaluated with the NSL-KDD, KDD Cup 1999, and UNSW-NB15 data sets, using the scikit-multiflow framework. Results demonstrated that a novel Adaptive Anomaly Threshold method, combined with a split active learning strategy offered superior anomaly detection performance with a labeling budget of just 20%, significantly reducing the required human expertise to annotate the network data. Processing times of the autoencoder anomaly detector method were demonstrated to be significantly lower than traditional online learning methods, allowing for greatly improved responsiveness to attacks occurring in real time. Future research areas are applying unsupervised threshold methods, multi-label classification, sample annotation, and hybrid intrusion detection.</div>

Download Full-text