ConvProtoNet: Deep Prototype Induction towards Better Class Representation for Few-Shot Malware Classification

Zhijie Tang; Peng Wang; Junfeng Wang

doi:10.3390/app10082847

ConvProtoNet: Deep Prototype Induction towards Better Class Representation for Few-Shot Malware Classification

Applied Sciences ◽

10.3390/app10082847 ◽

2020 ◽

Vol 10 (8) ◽

pp. 2847 ◽

Cited By ~ 2

Author(s):

Zhijie Tang ◽

Peng Wang ◽

Junfeng Wang

Keyword(s):

Large Datasets ◽

Fine Tuning ◽

General Knowledge ◽

Classification Methods ◽

Malware Classification ◽

Class Level ◽

Meta Learning ◽

Prototype Reduction ◽

Neural Network Structure ◽

Class Representation

Traditional malware classification relies on known malware types and significantly large datasets labeled manually which limits its ability to recognize new malware classes. For unknown malware types or new variants of existing malware containing only a few samples each class, common classification methods often fail to work well due to severe overfitting. In this paper, we propose a new neural network structure called ConvProtoNet which employs few-shot learning to address the problem of scarce malware samples while prevent from overfitting. We design a convolutional induction module to replace the insufficient prototype reduction in most few-shot models and generates more appropriate class-level malware prototypes for classification. We also adopt meta-learning scheme to make classifier robust enough to adapt unseen malware classes without fine-tuning. Even in extreme conditions where only 5 samples in each class are provided, ConvProtoNet still achieves more than 70% accuracy on average and outperforms other traditional malware classification methods or existed few-shot models in experiments conducted on several datasets. Extra experiments across datasets illustrate that ConvProtoNet learns general knowledge of malware which is dataset-invariant and careful model analysis proves effectiveness of ConvProtoNet in few-shot malware classification.

Download Full-text

Train Fast While Reducing False Positives: Improving Animal Classification Performance Using Convolutional Neural Networks

Geomatics ◽

10.3390/geomatics1010004 ◽

2021 ◽

Vol 1 (1) ◽

pp. 34-49

Author(s):

Mael Moreni ◽

Jerome Theau ◽

Samuel Foucher

Keyword(s):

False Positive ◽

Classification Performance ◽

Large Datasets ◽

False Positives ◽

Fine Tuning ◽

Training Time ◽

In The Wild ◽

Test Sets ◽

High Level ◽

Time Decrease

The combination of unmanned aerial vehicles (UAV) with deep learning models has the capacity to replace manned aircrafts for wildlife surveys. However, the scarcity of animals in the wild often leads to highly unbalanced, large datasets for which even a good detection method can return a large amount of false detections. Our objectives in this paper were to design a training method that would reduce training time, decrease the number of false positives and alleviate the fine-tuning effort of an image classifier in a context of animal surveys. We acquired two highly unbalanced datasets of deer images with a UAV and trained a Resnet-18 classifier using hard-negative mining and a series of recent techniques. Our method achieved sub-decimal false positive rates on two test sets (1 false positive per 19,162 and 213,312 negatives respectively), while training on small but relevant fractions of the data. The resulting training times were therefore significantly shorter than they would have been using the whole datasets. This high level of efficiency was achieved with little tuning effort and using simple techniques. We believe this parsimonious approach to dealing with highly unbalanced, large datasets could be particularly useful to projects with either limited resources or extremely large datasets.

Download Full-text

Cost-sensitive meta-learning framework

Journal of Modelling in Management ◽

10.1108/jm2-03-2021-0065 ◽

2021 ◽

Vol ahead-of-print (ahead-of-print) ◽

Author(s):

Samar Ali Shilbayeh ◽

Sunil Vadera

Keyword(s):

Machine Learning ◽

Learning System ◽

Classification Algorithms ◽

Data Sets ◽

Classification Methods ◽

Content Type ◽

Learning Framework ◽

Cost Sensitive Classification ◽

Meta Learning ◽

Training Examples

Purpose This paper aims to describe the use of a meta-learning framework for recommending cost-sensitive classification methods with the aim of answering an important question that arises in machine learning, namely, “Among all the available classification algorithms, and in considering a specific type of data and cost, which is the best algorithm for my problem?” Design/methodology/approach This paper describes the use of a meta-learning framework for recommending cost-sensitive classification methods for the aim of answering an important question that arises in machine learning, namely, “Among all the available classification algorithms, and in considering a specific type of data and cost, which is the best algorithm for my problem?” The framework is based on the idea of applying machine learning techniques to discover knowledge about the performance of different machine learning algorithms. It includes components that repeatedly apply different classification methods on data sets and measures their performance. The characteristics of the data sets, combined with the algorithms and the performance provide the training examples. A decision tree algorithm is applied to the training examples to induce the knowledge, which can then be used to recommend algorithms for new data sets. The paper makes a contribution to both meta-learning and cost-sensitive machine learning approaches. Those both fields are not new, however, building a recommender that recommends the optimal case-sensitive approach for a given data problem is the contribution. The proposed solution is implemented in WEKA and evaluated by applying it on different data sets and comparing the results with existing studies available in the literature. The results show that a developed meta-learning solution produces better results than METAL, a well-known meta-learning system. The developed solution takes the misclassification cost into consideration during the learning process, which is not available in the compared project. Findings The proposed solution is implemented in WEKA and evaluated by applying it to different data sets and comparing the results with existing studies available in the literature. The results show that a developed meta-learning solution produces better results than METAL, a well-known meta-learning system. Originality/value The paper presents a major piece of new information in writing for the first time. Meta-learning work has been done before but this paper presents a new meta-learning framework that is costs sensitive.

Download Full-text

Malware Classification Using Simhash Encoding and PCA (MCSP)

Symmetry ◽

10.3390/sym12050830 ◽

2020 ◽

Vol 12 (5) ◽

pp. 830

Author(s):

Young-Man Kwon ◽

Jae-Ju An ◽

Myung-Jae Lim ◽

Seongsoo Cho ◽

Won-Mo Gal

Keyword(s):

Covariance Matrix ◽

Computer Systems ◽

Classification Methods ◽

Imbalanced Dataset ◽

Malware Classification ◽

Maximum Accuracy ◽

Average Accuracy ◽

Executable File ◽

Linear Transform ◽

Malicious Program

Malware is any malicious program that can attack the security of other computer systems for various purposes. The threat of malware has significantly increased in recent years. To protect our computer systems, we need to analyze an executable file to decide whether it is malicious or not. In this paper, we propose two malware classification methods: malware classification using Simhash and PCA (MCSP), and malware classification using Simhash and linear transform (MCSLT). PCA uses the symmetrical covariance matrix. The former method combines Simhash encoding and PCA, and the latter combines Simhash encoding and linear transform layer. To verify the performance of our methods, we compared them with basic malware classification using Simhash and CNN (MCSC) using tanh and relu activation. We used a highly imbalanced dataset with 10,736 samples. As a result, our MCSP method showed the best performance with a maximum accuracy of 98.74% and an average accuracy of 98.59%. It showed an average F1 score of 99.2%. In addition, the MCSLT method showed better performance than MCSC in accuracy and F1 score.

Download Full-text

A Review of Malware Classification Methods using Machine Learning

SSRN Electronic Journal ◽

10.2139/ssrn.3769906 ◽

2021 ◽

Author(s):

Nishidh Singh Shekhawat ◽

Rejo Mathew

Keyword(s):

Machine Learning ◽

Classification Methods ◽

Malware Classification

Download Full-text

MANNWARE: A Malware Classification Approach with a Few Samples Using a Memory Augmented Neural Network

Information ◽

10.3390/info11010051 ◽

2020 ◽

Vol 11 (1) ◽

pp. 51 ◽

Cited By ~ 1

Author(s):

Kien Tran ◽

Hiroshi Sato ◽

Masao Kubo

Keyword(s):

Neural Network ◽

Language Processing ◽

Fine Tuning ◽

Sources Of Information ◽

Classification Problems ◽

Malware Classification ◽

Feature Spaces ◽

N Gram ◽

The One ◽

Processing Techniques

The ability to stop malware as soon as they start spreading will always play an important role in defending computer systems. It must be a huge benefit for organizations as well as society if intelligent defense systems could themselves detect and prevent new types of malware as soon as they reveal only a tiny amount of samples. An approach introduced in this paper takes advantage of One-shot/Few-shot learning algorithms to solve the malware classification problems using a Memory Augmented Neural Network in combination with the Natural Language Processing techniques such as word2vec, n-gram. We embed the malware’s API calls, which are very valuable sources of information for identifying malware’s behaviors, in the different feature spaces, and then feed them to the one-shot/few-shot learning models. Evaluating the model on the two datasets (FFRI 2017 and APIMDS) shows that the models with different parameters could yield high accuracy on malware classification with only a few samples. For example, on the APIMDS dataset, it was able to guess 78.85% correctly after seeing only nine malware samples and 89.59% after fine-tuning with a few other samples. The results confirmed very good accuracies compared to the other traditional methods, and point to a new area of malware research.

Download Full-text

Meta-Learning Initializations for Low-Resource Drug Discovery

10.26434/chemrxiv.11981622.v1 ◽

2020 ◽

Author(s):

Cuong Q. Nguyen ◽

Constantine Kreatsoulas ◽

Kim M. Branson

Keyword(s):

Deep Learning ◽

Drug Discovery ◽

State Of The Art ◽

Chemical Properties ◽

Fine Tuning ◽

Learning Performance ◽

Low Resource ◽

Average Improvement ◽

Meta Learning ◽

In Silico Models

Building in silico models to predict chemical properties and activities is a crucial step in drug discovery. However, drug discovery projects are often characterized by limited labeled data, hindering the applications of deep learning in this setting. Meanwhile advances in meta-learning have enabled state-of-the-art performances in few-shot learning benchmarks, naturally prompting the question: Can meta-learning improve deep learning performance in low-resource drug discovery projects? In this work, we assess the efficiency of the Model-Agnostic Meta-Learning (MAML) algorithm – along with its variants FO-MAML and ANIL – at learning to predict chemical properties and activities. Using the ChEMBL20 dataset to emulate low-resource settings, our benchmark shows that meta-initializations perform comparably to or outperform multi-task pre-training baselines on 16 out of 20 in-distribution tasks and on all out-of-distribution tasks, providing an average improvement in AUPRC of 7.2% and 14.9% respectively. Finally, we observe that meta-initializations consistently result in the best performing models across fine-tuning sets with k ∈ {16, 32, 64, 128, 256} instances.<br>

Download Full-text

Classification of Microscopic Laser Engraving Surface Defect Images Based on Transfer Learning Method

Electronics ◽

10.3390/electronics10161993 ◽

2021 ◽

Vol 10 (16) ◽

pp. 1993

Author(s):

Jing Zhang ◽

Zhenhao Li ◽

Ruqian Hao ◽

Xiangzhou Wang ◽

Xiaohui Du ◽

...

Keyword(s):

Feature Extraction ◽

Image Classification ◽

Transfer Learning ◽

Surface Defect ◽

Feature Space ◽

Fine Tuning ◽

Defect Classification ◽

Classification Methods ◽

Laser Engraving ◽

Traditional Classification

Microscopic laser engraving surface defect classification plays an important role in the industrial quality inspection field. The key challenges of accurate surface defect classification are the complete description of the defect and the correct distinction into categories in the feature space. Traditional classification methods focus on the terms of feature extraction and independent classification; therefore, feed handcrafted features may result in useful feature loss. In recent years, convolutional neural networks (CNNs) have achieved excellent results in image classification tasks with the development of deep learning. Deep convolutional networks integrate feature extraction and classification into self-learning, but require large datasets. The training datasets for microscopic laser engraving image classification are small; therefore, we used pre-trained CNN models and applied two fine-tuning strategies. Transfer learning proved to perform well even on small future datasets. The proposed method was evaluated on the datasets consisting of 1986 laser engraving images captured by a metallographic microscope and annotated by experienced staff. Because handcrafted features were not used, our method is more robust and achieves better results than traditional classification methods. Under five-fold-validation, the average accuracy of the best model based on DenseNet121 is 96.72%.

Download Full-text

Improvement of EMG Pattern Recognition Model Performance in Repeated Uses by Combining Feature Selection and Incremental Transfer Learning

Frontiers in Neurorobotics ◽

10.3389/fnbot.2021.699174 ◽

2021 ◽

Vol 15 ◽

Author(s):

Qi Li ◽

Anyuan Zhang ◽

Zhenlan Li ◽

Yan Wu

Keyword(s):

Pattern Recognition ◽

Feature Selection ◽

Classification Accuracy ◽

Fine Tuning ◽

Support Vector ◽

Classification Methods ◽

Data Set ◽

Feature Vectors ◽

Emg Pattern Recognition ◽

Sequential Forward Search

Electromyography (EMG) pattern recognition is one of the widely used methods to control the rehabilitation robots and prostheses. However, the changes in the distribution of EMG data due to electrodes shifting results in classification decline, which hinders its clinical application in repeated uses. Adaptive learning can solve this problem but takes additional time. To address this, an efficient scheme is developed by comparing the performance of 12 combinations of three feature selection methods [no feature selection (NFS), sequential forward search (SFS), and particle swarm optimization (PSO)] and four classification methods [non-adaptive support vector machine (N-SVM), incremental SVM (I-SVM), SVM based on TrAdaBoost (T-SVM), and I-SVM based on TrAdaBoost (TI-SVM)] in the classification of EMG data of 12 subjects for 5 consecutive days. Our results showed that TI-SVM achieved the highest classification accuracy among the classification methods (p < 0.05). The SFS method achieved the same classification accuracy as that of the scheme trained with the feature vectors selected by the NFS method (p = 0.999) while achieving a lower training time than that of TI-SVM combined with the NFS method (p = 0.043). Although the PSO method outperformed the NFS and SFS methods by achieving reduced training and response times (p < 0.05), the PSO method achieved a considerably lower classification accuracy than that of the scheme trained with the feature vectors selected by the NFS (p = 0.001) or SFS (p = 0.001) method. Furthermore, TI-SVM combined with the SFS method outperformed the CNN method with fine-tuning in classification accuracy on a small data set (p = 0.001). The results indicate that TI-SVM combined with the SFS method is suitable for improving the performance of EMG pattern recognition in repeated uses.

Download Full-text

Meta-Learning Applied to the Selection of the Classification Methods in Industrial Images

10.17648/sbai-2019-111375 ◽

2019 ◽

Author(s):

Luis Fernando Marin Sepulveda ◽

Aristofanes Silva ◽

João Otávio Diniz

Keyword(s):

Classification Methods ◽

Meta Learning ◽

Selection Of

Download Full-text

Research on image classification method based on improved multi-scale relational network

PeerJ Computer Science ◽

10.7717/peerj-cs.613 ◽

2021 ◽

Vol 7 ◽

pp. e613

Author(s):

Wenfeng Zheng ◽

Xiangjun Liu ◽

Lirong Yin

Keyword(s):

Deep Learning ◽

Image Classification ◽

Learning Style ◽

Learning Algorithm ◽

Small Sample ◽

Fine Tuning ◽

Multi Scale ◽

Meta Learning ◽

Model Independent ◽

Relational Network

Small sample learning aims to learn information about object categories from a single or a few training samples. This learning style is crucial for deep learning methods based on large amounts of data. The deep learning method can solve small sample learning through the idea of meta-learning “how to learn by using previous experience.” Therefore, this paper takes image classification as the research object to study how meta-learning quickly learns from a small number of sample images. The main contents are as follows: After considering the distribution difference of data sets on the generalization performance of measurement learning and the advantages of optimizing the initial characterization method, this paper adds the model-independent meta-learning algorithm and designs a multi-scale meta-relational network. First, the idea of META-SGD is adopted, and the inner learning rate is taken as the learning vector and model parameter to learn together. Secondly, in the meta-training process, the model-independent meta-learning algorithm is used to find the optimal parameters of the model. The inner gradient iteration is canceled in the process of meta-validation and meta-test. The experimental results show that the multi-scale meta-relational network makes the learned measurement have stronger generalization ability, which further improves the classification accuracy on the benchmark set and avoids the need for fine-tuning of the model-independent meta-learning algorithm.

Download Full-text