DNS Tunneling Detection Method Based on Multilabel Support Vector Machine

Security and Communication Networks ◽

10.1155/2018/6137098 ◽

2018 ◽

Vol 2018 ◽

pp. 1-9 ◽

Cited By ~ 8

Author(s):

Ahmed Almusawi ◽

Haleh Amintoosi

Keyword(s):

Machine Learning ◽

Support Vector Machine ◽

Detection Method ◽

Binary Classification ◽

Experimental Results ◽

Classification Method ◽

Support Vector ◽

Class Label ◽

Svm Classification ◽

Different Types

DNS tunneling is a method used by malicious users who intend to bypass the firewall to send or receive commands and data. This has a significant impact on revealing or releasing classified information. Several researchers have examined the use of machine learning in terms of detecting DNS tunneling. However, these studies have treated the problem of DNS tunneling as a binary classification where the class label is either legitimate or tunnel. In fact, there are different types of DNS tunneling such as FTP-DNS tunneling, HTTP-DNS tunneling, HTTPS-DNS tunneling, and POP3-DNS tunneling. Therefore, there is a vital demand to not only detect the DNS tunneling but rather classify such tunnel. This study aims to propose a multilabel support vector machine in order to detect and classify the DNS tunneling. The proposed method has been evaluated using a benchmark dataset that contains numerous DNS queries and is compared with a multilabel Bayesian classifier based on the number of corrected classified DNS tunneling instances. Experimental results demonstrate the efficacy of the proposed SVM classification method by obtaining an f-measure of 0.80.

Download Full-text

Label Self-Advised Support Vector Machine (LSA-SVM)—Automated Classification of Foot Drop Rehabilitation Case Study

Biosensors ◽

10.3390/bios9040114 ◽

2019 ◽

Vol 9 (4) ◽

pp. 114 ◽

Cited By ~ 1

Author(s):

Sahar Adil Abboud ◽

Saba Al-Wais ◽

Salma Hameedi Abdullah ◽

Fady Alnajjar ◽

Adel Al-Jumaily

Keyword(s):

Support Vector Machine ◽

Motor Neurons ◽

Experimental Results ◽

Foot Drop ◽

Classification Method ◽

Support Vector ◽

Major Health Problem ◽

Rehabilitation Devices ◽

Predictive Data Mining ◽

Ankle And Foot

Stroke represents a major health problem in our society. One of the effects of stroke is foot drop. Foot drop (FD) is a weakness that occurs in specific muscles in the ankle and foot such as the anterior tibialis, gastrocnemius, plantaris and soleus muscles. Foot flexion and extension are normally generated by lower motor neurons (LMN). The affected muscles impact the ankle and foot in both downward and upward motions. One possible solution for FD is to investigate the movement based on the bio signal (myoelectric signal) of the muscles. Bio signal control systems like electromyography (EMG) are used for rehabilitation devices that include foot drop. One of these systems is function electrical stimulation (FES). This paper proposes new methods and algorithms to develop the performance of myoelectric pattern recognition (M-PR), to improve automated rehabilitation devices, to test these methodologies in offline and real-time experimental datasets. Label classifying is a predictive data mining application with multiple applications in the world, including automatic labeling of resources such as videos, music, images and texts. We combine the label classification method with the self-advised support vector machine (SA-SVM) to create an adapted and altered label classification method, named the label self-advised support vector machine (LSA-SVM). For the experimental data, we collected data from foot drop patients using the sEMG device, in the Metro Rehabilitation Hospital in Sydney, Australia using Ethical Approval (UTS HREC NO. ETH15-0152). The experimental results for the EMG dataset and benchmark datasets exhibit its benefits. Furthermore, the experimental results on UCI datasets indicate that LSA-SVM achieves the best performance when working together with SA-SVM and SVM. This paper describes the state-of-the-art procedures for M-PR and studies all the conceivable structures.

Download Full-text

An New Fuzzy Support Vector Machine for Binary Classification

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.433-440.2856 ◽

2012 ◽

Vol 433-440 ◽

pp. 2856-2861 ◽

Cited By ~ 1

Author(s):

Rui Zhang ◽

Tong Bo Liu ◽

Ming Wen Zheng

Keyword(s):

Support Vector Machine ◽

Binary Classification ◽

Experimental Results ◽

New Method ◽

Support Vector ◽

Fuzzy Support Vector Machine

In this paper, we proposed a new fuzzy support vector machine(called L2–FSVM here), which error part of object is L2–norm.Meanwhile we introduce a new method of generating fuzzy memberships so as to reduce to effects of outliers. The experimental results demonstrate that the L2-FSVM method provides improved ability to reduce to effects of outliers in comparison with traditional SVMs and FSVMs, and claim that L2–FSVM is the best way to solve the binary classiﬁcation in the three methods stated above.

Download Full-text

Rescale-Invariant SVM for Binary Classification

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2017/348 ◽

2017 ◽

Cited By ~ 1

Author(s):

Mojtaba Montazery ◽

Nic Wilson

Keyword(s):

Machine Learning ◽

Decision Making ◽

Support Vector Machines ◽

Binary Classification ◽

Experimental Results ◽

Support Vector ◽

Computation Method ◽

Learning Methods ◽

Machine Learning Methods ◽

Vector Machines

Support Vector Machines (SVM) are among the most well-known machine learning methods, with broad use in different scientific areas. However, one necessary pre-processing phase for SVM is normalization (scaling) of features, since SVM is not invariant to the scales of the features’ spaces, i.e., different ways of scaling may lead to different results. We define a more robust decision-making approach for binary classification, in which one sample strongly belongs to a class if it belongs to that class for all possible rescalings of features. We derive a way of characterising the approach for binary SVM that allows determining when an instance strongly belongs to a class and when the classification is invariant to rescaling. The characterisation leads to a computation method to determine whether one sample is strongly positive, strongly negative or neither. Our experimental results back up the intuition that being strongly positive suggests stronger confidence that an instance really is positive.

Download Full-text

Machine Learning Methods for Fear Classification Based on Physiological Features

Sensors ◽

10.3390/s21134519 ◽

2021 ◽

Vol 21 (13) ◽

pp. 4519

Author(s):

Livia Petrescu ◽

Cătălin Petrescu ◽

Ana Oprea ◽

Oana Mitruț ◽

Gabriela Moise ◽

...

Keyword(s):

Machine Learning ◽

Support Vector Machine ◽

Dimensionality Reduction ◽

Data Augmentation ◽

Binary Classification ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Physiological Data ◽

Gradient Boosting ◽

Support Vector

This paper focuses on the binary classification of the emotion of fear, based on the physiological data and subjective responses stored in the DEAP dataset. We performed a mapping between the discrete and dimensional emotional information considering the participants’ ratings and extracted a substantial set of 40 types of features from the physiological data, which represented the input to various machine learning algorithms—Decision Trees, k-Nearest Neighbors, Support Vector Machine and artificial networks—accompanied by dimensionality reduction, feature selection and the tuning of the most relevant hyperparameters, boosting classification accuracy. The methodology we approached included tackling different situations, such as resolving the problem of having an imbalanced dataset through data augmentation, reducing overfitting, computing various metrics in order to obtain the most reliable classification scores and applying the Local Interpretable Model-Agnostic Explanations method for interpretation and for explaining predictions in a human-understandable manner. The results show that fear can be predicted very well (accuracies ranging from 91.7% using Gradient Boosting Trees to 93.5% using dimensionality reduction and Support Vector Machine) by extracting the most relevant features from the physiological data and by searching for the best parameters which maximize the machine learning algorithms’ classification scores.

Download Full-text

BCD-WERT: a novel approach for breast cancer detection using whale optimization based efficient features and extremely randomized tree algorithm

PeerJ Computer Science ◽

10.7717/peerj-cs.390 ◽

2021 ◽

Vol 7 ◽

pp. e390

Author(s):

Shafaq Abbas ◽

Zunera Jalil ◽

Abdul Rehman Javed ◽

Iqra Batool ◽

Mohammad Zubair Khan ◽

...

Keyword(s):

Breast Cancer ◽

Machine Learning ◽

Support Vector Machine ◽

Feature Selection ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Experimental Results ◽

Support Vector ◽

Novel Approach ◽

Whale Optimization

Breast cancer is one of the leading causes of death in the current age. It often results in subpar living conditions for a patient as they have to go through expensive and painful treatments to fight this cancer. One in eight women all over the world is affected by this disease. Almost half a million women annually do not survive this fight and die from this disease. Machine learning algorithms have proven to outperform all existing solutions for the prediction of breast cancer using models built on the previously available data. In this paper, a novel approach named BCD-WERT is proposed that utilizes the Extremely Randomized Tree and Whale Optimization Algorithm (WOA) for efficient feature selection and classification. WOA reduces the dimensionality of the dataset and extracts the relevant features for accurate classification. Experimental results on state-of-the-art comprehensive dataset demonstrated improved performance in comparison with eight other machine learning algorithms: Support Vector Machine (SVM), Random Forest, Kernel Support Vector Machine, Decision Tree, Logistic Regression, Stochastic Gradient Descent, Gaussian Naive Bayes and k-Nearest Neighbor. BCD-WERT outperformed all with the highest accuracy rate of 99.30% followed by SVM achieving 98.60% accuracy. Experimental results also reveal the effectiveness of feature selection techniques in improving prediction accuracy.

Download Full-text

A Multi-Classification Method of Improved SVM-based Information Fusion for Traffic Parameters Forecasting

PROMET - Traffic&Transportation ◽

10.7307/ptt.v28i2.1643 ◽

2016 ◽

Vol 28 (2) ◽

pp. 117-124 ◽

Cited By ~ 2

Author(s):

Hongzhuan Zhao ◽

Dihua Sun ◽

Min Zhao ◽

Senlin Cheng

Keyword(s):

Support Vector Machine ◽

Information Fusion ◽

Binary Tree ◽

Classification Method ◽

Traffic Information ◽

Support Vector ◽

Cyber Physical System ◽

Svm Classification ◽

Forecasting Performance ◽

Multi Classification

With the enrichment of perception methods, modern transportation system has many physical objects whose states are influenced by many information factors so that it is a typical Cyber-Physical System (CPS). Thus, the traffic information is generally multi-sourced, heterogeneous and hierarchical. Existing research results show that the multisourced traffic information through accurate classification in the process of information fusion can achieve better parameters forecasting performance. For solving the problem of traffic information accurate classification, via analysing the characteristics of the multi-sourced traffic information and using redefined binary tree to overcome the shortcomings of the original Support Vector Machine (SVM) classification in information fusion, a multi-classification method using improved SVM in information fusion for traffic parameters forecasting is proposed. The experiment was conducted to examine the performance of the proposed scheme, and the results reveal that the method can get more accurate and practical outcomes.

Download Full-text

An Improvement Of Least Square - Twin Support Vector Machine

Research and Development on Information and Communication Technology ◽

10.32913/mic-ict-research.v2021.n1.956 ◽

2021 ◽

Vol 2021 (1) ◽

Author(s):

Thanh Vi Nguyen ◽

Thế Cường Nguyễn

Keyword(s):

Support Vector Machine ◽

Binary Classification ◽

Least Square ◽

Experimental Results ◽

Twin Support Vector Machine ◽

Support Vector ◽

Classification Problems ◽

Training Time ◽

Data Points ◽

Better Than

n binary classification problems, two classes of data seem tobe different from each other. It is expected to be more complicated dueto the number of data points of clusters in each class also be different.Traditional algorithms as Support Vector Machine (SVM), Twin Support Vector Machine (TSVM), or Least Square Twin Support VectorMachine (LSTSVM) cannot sufficiently exploit information about thenumber of data points in each cluster of the data. Which may be effectto the accuracy of classification problems. In this paper, we proposes anew Improved Least Square - Support Vector Machine (called ILS-SVM)for binary classification problems with a class-vs-clusters strategy. Experimental results show that the ILS-SVM training time is faster thanthat of TSVM, and the ILS-SVM accuracy is better than LSTSVM andTSVM in most cases.

Download Full-text

Perbandingan Algoritme Machine Learning untuk Memprediksi Pengambil Matakuliah

Jurnal Teknologi Informasi dan Ilmu Komputer ◽

10.25126/jtiik.2019651755 ◽

2019 ◽

Vol 6 (5) ◽

pp. 543 ◽

Cited By ~ 1

Author(s):

Fitra A. Bachtiar ◽

Indra K. Syahputra ◽

Satrio A. Wicaksono

Keyword(s):

Machine Learning ◽

Support Vector Machine ◽

Nearest Neighbor ◽

Classification Method ◽

Support Vector ◽

Test Results ◽

K Nearest Neighbor ◽

Student Data ◽

Prediction Problems ◽

Imbalance Dataset

Pada setiap awal semester bagian akademik melakukan penjadwalan dan penentuan matakuliah yang akan dibuka untuk semester berikutnya. Akan tetapi proses tersebut memiliki permasalahan antara lain kelas yang dibuka terlalu banyak dibanding jumlah siswa yang berminat atau sebaliknya. Selain itu, dalam permasalahan prediksi data yang terkumpul memiliki kecenderungan tidak seimbang pada setiap kelas (imbalance class). Hal ini akan berdampak pada proses penjadwalan yang kurang tepat. Sehingga dibutuhkan sistem yang dapat memprediksi mahasiswa pengambil mata kuliah. Akan tetapi ada banyak algoritme yang dapat digunakan untuk proses prediksi. Penelitian ini membandingkan performa algoritma untuk klasifikasi mahasiswa pengambil matakuliah. Pada penelitian ini prediksi dilakukan berdasarkan atribut dari data mahasiswa. Atribut-atribut tersebut yaitu Nilai, IP, IPK, SKS, SKSK dan Semester. Pada setiap observasi pada atribut-atribut tersebut prediksi akan dilakukan apakah mahasiswa tersebut mengambil mata kuliah tertentu. Prediksi dibagi menjadi 2 kelas yaitu ‘Ya’ untuk mahasiswa yang diprediksi mengambil matakuliah dan ‘Tidak’ untuk mahasiswa yang diprediksi tidak mengambil matakuliah. Teknik Synthetic Minority Oversampling Technique (SMOTE) digunakan untuk menangani data yang tidak seimbang. Pada penelitian ini klasifikasi dilakukan dengan membandingkan algoritme k-Nearest Neighbor (k-NN) dan Support Vector Machine (SVM) untuk kasus prediksi pengambil matakuliah. Hasil pengujian menggunakan 3 mata kuliah sebagai sampel. Dari hasil rerata, diperoleh hasil prediksi k-NN memiliki kinerja yang lebih baik daripada SVM. Selain itu, penggunaan teknik SMOTE dapat mempengaruhi hasil klasifikasi berupa peningkatan nilai AUC, CA, F1, precision dan recall. AbstractAt the beginning of each semester, the academic section conducts scheduling and determining the courses offered for the next semester. However, the process has problems such as too many classes offered to the student compared to the number of students who take the class or vice versa. Besides that, in the prediction problems, the collected data has an imbalance tendency in each class. As a result, these problems could cause in ineffective scheduling. Thus, there is a need to build a system that can predict students taking courses. However, there are many algorithms that can be used for the prediction. This study compares the performance of algorithms for classifications of students taking courses. In this study, predictions are modeled based on the attributes of student data, namely Grades, GPA, Cumulative GPA, Semester Credits, Cumulative Semester Credits and Semester. The classification process will be carried out to produce a prediction of whether the student takes a particular subject or not. Classification results are divided into 2 classes, namely 'Yes' for students who are predicted to take and 'No' for students who are predicted not to take the class. To handle imbalance dataset will use Synthetic Minority Oversampling Technique (SMOTE) techniques. Classification method used in this study are k-Nearest Neighbor (k-NN) and Support Vector Machine (SVM) algorithms to compare their performance for prediction cases. The test results used 3 courses as a sample. In average k-NN prediction results have a better performance than SVM. In addition, the use of SMOTE techniques can influence the classification results in the form of an increase in AUC, CA, F1, precision and recall values.

Download Full-text

Analysis of rainfall classification over Tanah Laut disrict based on global climate indicators using support vector machine method

Journal of Physics Conference Series ◽

10.1088/1742-6596/2106/1/012009 ◽

2021 ◽

Vol 2106 (1) ◽

pp. 012009

Author(s):

N Hayah ◽

O Soesanto ◽

M A Rahman

Keyword(s):

Support Vector Machine ◽

Global Climate ◽

Climatic Conditions ◽

Training Data ◽

Classification Method ◽

Support Vector ◽

Rainfall Forecasting ◽

Machine Method ◽

Svm Classification ◽

The Relationship

Abstract The Support Vector Machine (SVM) classification method can be applied in various fields, one of which is meteorology and climatology in rainfall forecasting. Thus, a study was conducted by classifying rainfall to recognize the relationship between global phenomena and rainfall and the results of applying the classification using the SVM method to rainfall in the Tanah Laut Regency. The analysis is carried out using the SVM Multiclass concept with 4 categories of rainfall classification: low, medium, high, and Extreme. The kernel used in SVM is the RBF kernel with optimization parameters used, namely Cost (C) 1,5,10,15 and Gamma (γ) 1,5,10,15. The dataset formed is based on the annual period, climatic conditions, and seasonality. The Spearman Rank correlation test describes the relationship between global phenomena and rainfall with a correlation range of (−0.1456 ) − (0.43144) for the entire dataset. The implementation of the SVM classification method shows that the Cost (C) 10 and Gamma (γ) ≥ 5 parameters obtained the highest accuracy of 100% on the training data. In contrast, in testing the data testing, the accuracy was good, namely the accuracy of 78.00% in La Nina and 81.38% in seasonal periods.

Download Full-text

Twin Support Vector Machine for Multiple Instance Learning Based on Bag Dissimilarities

Advances in Artificial Intelligence ◽

10.1155/2016/1269708 ◽

2016 ◽

Vol 2016 ◽

pp. 1-18 ◽

Cited By ~ 1

Author(s):

Divya Tomar ◽

Sonal Agarwal

Keyword(s):

Support Vector Machine ◽

Multiple Instance Learning ◽

Experimental Results ◽

Training Dataset ◽

Twin Support Vector Machine ◽

Support Vector ◽

Learning Approaches ◽

Class Label ◽

Positive Class ◽

Post Hoc

In multiple instance learning (MIL) framework, an object is represented by a set of instances referred to as bag. A positive class label is assigned to a bag if it contains at least one positive instance; otherwise a bag is labeled with negative class label. Therefore, the task of MIL is to learn a classifier at bag level rather than at instance level. Traditional supervised learning approaches cannot be applied directly in such kind of situation. In this study, we represent each bag by a vector of its dissimilarities to the other existing bags in the training dataset and propose a multiple instance learning based Twin Support Vector Machine (MIL-TWSVM) classifier. We have used different ways to represent the dissimilarity between two bags and performed a comparative analysis of them. The experimental results on ten benchmark MIL datasets demonstrate that the proposed MIL-TWSVM classifier is computationally inexpensive and competitive with state-of-the-art approaches. The significance of the experimental results has been tested by using Friedman statistic and Nemenyi post hoc tests.

Download Full-text