Traffic classification at the radio spectrum level using deep learning models trained with synthetic data

Tom De Schepper; Miguel Camelo; Jeroen Famaey; Steven Latré

doi:10.1002/nem.2100

Ransomware Traffic Classification Using Deep Learning Models

International Journal of Web Portals ◽

10.4018/ijwp.2020010101 ◽

2020 ◽

Vol 12 (1) ◽

pp. 1-11

Author(s):

Arivudainambi D. ◽

Varun Kumar K.A. ◽

Vinoth Kumar R. ◽

Visu P.

Keyword(s):

Deep Learning ◽

Real Time ◽

Network Traffic ◽

Classification Model ◽

Traffic Classification ◽

Learning Models ◽

Learning Methods ◽

Novel Method

Ransomware is a malware which affects the systems data with modern encryption techniques, and the data is recovered once a ransom amount is paid. In this research, the authors show how ransomware propagates and infects devices. Live traffic classifications of ransomware have been meticulously analyzed. Further, a novel method for the classification of ransomware traffic by using deep learning methods is presented. Based on classification, the detection of ransomware is approached with the characteristics of the network traffic and its communications. In more detail, the behavior of popular ransomware, Crypto Wall, is analyzed and based on this knowledge, a real-time ransomware live traffic classification model is proposed.

Download Full-text

Training Deep Learning Models via Synthetic Data: Application in Unmanned Aerial Vehicles

Computer Analysis of Images and Patterns - Communications in Computer and Information Science ◽

10.1007/978-3-030-29930-9_8 ◽

2019 ◽

pp. 81-90 ◽

Cited By ~ 2

Author(s):

Andreas Kamilaris ◽

Corjan van den Brink ◽

Savvas Karatsiolis

Keyword(s):

Deep Learning ◽

Unmanned Aerial Vehicles ◽

Synthetic Data ◽

Learning Models ◽

Aerial Vehicles ◽

Data Application

Download Full-text

Auditing and Debugging Deep Learning Models via Flip Points: Individual-Level and Group-Level Analysis

La Matematica ◽

10.1007/s44007-021-00003-w ◽

2021 ◽

Author(s):

Roozbeh Yousefzadeh ◽

Dianne P. O’Leary

Keyword(s):

Deep Learning ◽

Synthetic Data ◽

Numerical Linear Algebra ◽

Optimization Techniques ◽

Mathematical Function ◽

Learning Models ◽

Individual Level ◽

Deep Model ◽

Social Applications ◽

Decision Boundaries

AbstractDeep learning models have been criticized for their lack of easy interpretation, which undermines confidence in their use for important applications. Nevertheless, they are consistently utilized in many applications, consequential to humans’ lives, usually because of their better performance. Therefore, there is a great need for computational methods that can explain, audit, and debug such models. Here, we use flip points to accomplish these goals for deep learning classifiers used in social applications. A trained deep learning classifier is a mathematical function that maps inputs to classes. By way of training, the function partitions its domain and assigns a class to each of the partitions. Partitions are defined by the decision boundaries which are expected to be geometrically complex. This complexity is usually what makes deep learning models powerful classifiers. Flip points are points on those boundaries and, therefore, the key to understanding and changing the functional behavior of models. We use advanced numerical optimization techniques and state-of-the-art methods in numerical linear algebra, such as rank determination and reduced-order models to compute and analyze them. The resulting insight into the decision boundaries of a deep model can clearly explain the model’s output on the individual level, via an explanation report that is understandable by non-experts. We also develop a procedure to understand and audit model behavior towards groups of people. We show that examining decision boundaries of models in certain subspaces can reveal hidden biases that are not easily detectable. Flip points can also be used as synthetic data to alter the decision boundaries of a model and improve their functional behaviors. We demonstrate our methods by investigating several models trained on standard datasets used in social applications of machine learning. We also identify the features that are most responsible for particular classifications and misclassifications. Finally, we discuss the implications of our auditing procedure in the public policy domain.

Download Full-text

PhD Forum: Data traffic classification using deep learning models

2021 IEEE 22nd International Symposium on a World of Wireless, Mobile and Multimedia Networks (WoWMoM) ◽

10.1109/wowmom51794.2021.00035 ◽

2021 ◽

Author(s):

Meenaxi M Raikar

Keyword(s):

Deep Learning ◽

Traffic Classification ◽

Data Traffic ◽

Learning Models

Download Full-text

Payload-Based Traffic Classification Using Multi-Layer LSTM in Software Defined Networks

Applied Sciences ◽

10.3390/app9122550 ◽

2019 ◽

Vol 9 (12) ◽

pp. 2550 ◽

Cited By ~ 1

Author(s):

Lim ◽

Kim ◽

Hong ◽

Han

Keyword(s):

Deep Learning ◽

Network Traffic ◽

Short Term Memory ◽

Single Layer ◽

Classification Performance ◽

Traffic Classification ◽

Software Defined Networks ◽

Learning Models ◽

Network Operator ◽

Network Traffic Classification

Recently, with the advent of various Internet of Things (IoT) applications, a massive amount of network traffic is being generated. A network operator must provide different quality of service, according to the service provided by each application. Toward this end, many studies have investigated how to classify various types of application network traffic accurately. Especially, since many applications use temporary or dynamic IP or Port numbers in the IoT environment, only payload-based network traffic classification technology is more suitable than the classification using the packet header information as well as payload. Furthermore, to automatically respond to various applications, it is necessary to classify traffic using deep learning without the network operator intervention. In this study, we propose a traffic classification scheme using a deep learning model in software defined networks. We generate flow-based payload datasets through our own network traffic pre-processing, and train two deep learning models: 1) the multi-layer long short-term memory (LSTM) model and 2) the combination of convolutional neural network and single-layer LSTM models, to perform network traffic classification. We also execute a model tuning procedure to find the optimal hyper-parameters of the two deep learning models. Lastly, we analyze the network traffic classification performance on the basis of the F1-score for the two deep learning models, and show the superiority of the multi-layer LSTM model for network packet classification.

Download Full-text

Synthetic Data Generation for Deep Learning Models

10.35199/dfx2021.11 ◽

2021 ◽

Author(s):

Martin Denk Christoph Petroll

Keyword(s):

Deep Learning ◽

Synthetic Data ◽

Data Generation ◽

Learning Models ◽

Synthetic Data Generation

Download Full-text

Application-Based Online Traffic Classification with Deep Learning Models on SDN Networks

Advances in Technology Innovation ◽

10.46604/aiti.2020.4286 ◽

2020 ◽

Author(s):

Lin-Huang Chang ◽

Tsung-Han Lee ◽

Hung-Chi Chu ◽

Cheng-Wei Su

Keyword(s):

Deep Learning ◽

Traffic Classification ◽

Software Defined Network ◽

Learning Models ◽

Learning Mechanisms ◽

Network Applications ◽

Performance Analyses ◽

Online Traffic ◽

Deep Learning Model ◽

Sdn Controller

The traffic classification based on the network applications is one important issue for network management. In this paper, we propose an application-based online and offline traffic classification, based on deep learning mechanisms, over software-defined network (SDN) testbed. The designed deep learning model, resigned in the SDN controller, consists of multilayer perceptron (MLP), convolutional neural network (CNN), and Stacked Auto-Encoder (SAE), in the SDN testbed. We employ an open network traffic dataset with seven most popular applications as the deep learning training and testing datasets. By using the TCPreplay tool, the dataset traffic samples are re-produced and analyzed in our SDN testbed to emulate the online traffic service. The performance analyses, in terms of accuracy, precision, recall, and F1 indicators, are conducted and compared with three deep learning models.

Download Full-text

Levenshtein Augmentation Improves Performance of SMILES Based Deep-Learning Synthesis Prediction

10.26434/chemrxiv.12562121 ◽

2020 ◽

Author(s):

Dean Sumner ◽

Jiazhen He ◽

Amol Thakkar ◽

Ola Engkvist ◽

Esben Jannik Bjerrum

Keyword(s):

Neural Networks ◽

Pattern Recognition ◽

Deep Learning ◽

Recurrent Neural Networks ◽

Data Augmentation ◽

State Of The Art ◽

Sequence Similarity ◽

Learning Models ◽

Underlying Network

SMILES randomization, a form of data augmentation, has previously been shown to increase the performance of deep learning models compared to non-augmented baselines. Here, we propose a novel data augmentation method we call “Levenshtein augmentation” which considers local SMILES sub-sequence similarity between reactants and their respective products when creating training pairs. The performance of Levenshtein augmentation was tested using two state of the art models - transformer and sequence-to-sequence based recurrent neural networks with attention. Levenshtein augmentation demonstrated an increase performance over non-augmented, and conventionally SMILES randomization augmented data when used for training of baseline models. Furthermore, Levenshtein augmentation seemingly results in what we define as attentional gain – an enhancement in the pattern recognition capabilities of the underlying network to molecular motifs.

Download Full-text

Improving the Accuracy of Protein-Ligand Binding Affinity Prediction by Deep Learning Models: Benchmark and Model

10.26434/chemrxiv.9866912 ◽

2019 ◽

Author(s):

Mohammad Rezaei ◽

Yanjun Li ◽

Xiaolin Li ◽

Chenglong Li

Keyword(s):

Deep Learning ◽

Drug Design ◽

Binding Affinity ◽

Benchmark Dataset ◽

Rational Drug Design ◽

Learning Models ◽

Structure Based Drug Design ◽

Binding Affinity Prediction ◽

Affinity Prediction ◽

Rational Drug

Introduction: The ability to discriminate among ligands binding to the same protein target in terms of their relative binding affinity lies at the heart of structure-based drug design. Any improvement in the accuracy and reliability of binding affinity prediction methods decreases the discrepancy between experimental and computational results. Objectives: The primary objectives were to find the most relevant features affecting binding affinity prediction, least use of manual feature engineering, and improving the reliability of binding affinity prediction using efficient deep learning models by tuning the model hyperparameters. Methods: The binding site of target proteins was represented as a grid box around their bound ligand. Both binary and distance-dependent occupancies were examined for how an atom affects its neighbor voxels in this grid. A combination of different features including ANOLEA, ligand elements, and Arpeggio atom types were used to represent the input. An efficient convolutional neural network (CNN) architecture, DeepAtom, was developed, trained and tested on the PDBbind v2016 dataset. Additionally an extended benchmark dataset was compiled to train and evaluate the models. Results: The best DeepAtom model showed an improved accuracy in the binding affinity prediction on PDBbind core subset (Pearson’s R=0.83) and is better than the recent state-of-the-art models in this field. In addition when the DeepAtom model was trained on our proposed benchmark dataset, it yields higher correlation compared to the baseline which confirms the value of our model. Conclusions: The promising results for the predicted binding affinities is expected to pave the way for embedding deep learning models in virtual screening and rational drug design fields.

Download Full-text

Data science in economics: comprehensive review of advanced machine learning and deep learning methods

10.31232/osf.io/4pxq2 ◽

2020 ◽

Author(s):

Saeed Nosratabadi ◽

Amir Mosavi ◽

Puhong Duan ◽

Pedram Ghamisi ◽

Ferdinand Filip ◽

...

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Data Science ◽

State Of The Art ◽

Science Methods ◽

Learning Models ◽

Diverse Range ◽

Hybrid Machine ◽

Economics Research

This paper provides a state-of-the-art investigation of advances in data science in emerging economic applications. The analysis was performed on novel data science methods in four individual classes of deep learning models, hybrid deep learning models, hybrid machine learning, and ensemble models. Application domains include a wide and diverse range of economics research from the stock market, marketing, and e-commerce to corporate banking and cryptocurrency. Prisma method, a systematic literature review methodology, was used to ensure the quality of the survey. The findings reveal that the trends follow the advancement of hybrid models, which, based on the accuracy metric, outperform other learning algorithms. It is further expected that the trends will converge toward the advancements of sophisticated hybrid deep learning models.

Download Full-text