scholarly journals DNS Tunneling Detection Method Based on Multilabel Support Vector Machine

2018 ◽  
Vol 2018 ◽  
pp. 1-9 ◽  
Author(s):  
Ahmed Almusawi ◽  
Haleh Amintoosi

DNS tunneling is a method used by malicious users who intend to bypass the firewall to send or receive commands and data. This has a significant impact on revealing or releasing classified information. Several researchers have examined the use of machine learning in terms of detecting DNS tunneling. However, these studies have treated the problem of DNS tunneling as a binary classification where the class label is either legitimate or tunnel. In fact, there are different types of DNS tunneling such as FTP-DNS tunneling, HTTP-DNS tunneling, HTTPS-DNS tunneling, and POP3-DNS tunneling. Therefore, there is a vital demand to not only detect the DNS tunneling but rather classify such tunnel. This study aims to propose a multilabel support vector machine in order to detect and classify the DNS tunneling. The proposed method has been evaluated using a benchmark dataset that contains numerous DNS queries and is compared with a multilabel Bayesian classifier based on the number of corrected classified DNS tunneling instances. Experimental results demonstrate the efficacy of the proposed SVM classification method by obtaining an f-measure of 0.80.

Biosensors ◽  
2019 ◽  
Vol 9 (4) ◽  
pp. 114 ◽  
Author(s):  
Sahar Adil Abboud ◽  
Saba Al-Wais ◽  
Salma Hameedi Abdullah ◽  
Fady Alnajjar ◽  
Adel Al-Jumaily

Stroke represents a major health problem in our society. One of the effects of stroke is foot drop. Foot drop (FD) is a weakness that occurs in specific muscles in the ankle and foot such as the anterior tibialis, gastrocnemius, plantaris and soleus muscles. Foot flexion and extension are normally generated by lower motor neurons (LMN). The affected muscles impact the ankle and foot in both downward and upward motions. One possible solution for FD is to investigate the movement based on the bio signal (myoelectric signal) of the muscles. Bio signal control systems like electromyography (EMG) are used for rehabilitation devices that include foot drop. One of these systems is function electrical stimulation (FES). This paper proposes new methods and algorithms to develop the performance of myoelectric pattern recognition (M-PR), to improve automated rehabilitation devices, to test these methodologies in offline and real-time experimental datasets. Label classifying is a predictive data mining application with multiple applications in the world, including automatic labeling of resources such as videos, music, images and texts. We combine the label classification method with the self-advised support vector machine (SA-SVM) to create an adapted and altered label classification method, named the label self-advised support vector machine (LSA-SVM). For the experimental data, we collected data from foot drop patients using the sEMG device, in the Metro Rehabilitation Hospital in Sydney, Australia using Ethical Approval (UTS HREC NO. ETH15-0152). The experimental results for the EMG dataset and benchmark datasets exhibit its benefits. Furthermore, the experimental results on UCI datasets indicate that LSA-SVM achieves the best performance when working together with SA-SVM and SVM. This paper describes the state-of-the-art procedures for M-PR and studies all the conceivable structures.


2012 ◽  
Vol 433-440 ◽  
pp. 2856-2861 ◽  
Author(s):  
Rui Zhang ◽  
Tong Bo Liu ◽  
Ming Wen Zheng

In this paper, we proposed a new fuzzy support vector machine(called L2–FSVM here), which error part of object is L2–norm.Meanwhile we introduce a new method of generating fuzzy memberships so as to reduce to effects of outliers. The experimental results demonstrate that the L2-FSVM method provides improved ability to reduce to effects of outliers in comparison with traditional SVMs and FSVMs, and claim that L2–FSVM is the best way to solve the binary classification in the three methods stated above.


Author(s):  
Mojtaba Montazery ◽  
Nic Wilson

Support Vector Machines (SVM) are among the most well-known machine learning methods, with broad use in different scientific areas. However, one necessary pre-processing phase for SVM is normalization (scaling) of features, since SVM is not invariant to the scales of the features’ spaces, i.e., different ways of scaling may lead to different results. We define a more robust decision-making approach for binary classification, in which one sample strongly belongs to a class if it belongs to that class for all possible rescalings of features. We derive a way of characterising the approach for binary SVM that allows determining when an instance strongly belongs to a class and when the classification is invariant to rescaling. The characterisation leads to a computation method to determine whether one sample is strongly positive, strongly negative or neither. Our experimental results back up the intuition that being strongly positive suggests stronger confidence that an instance really is positive.


Sensors ◽  
2021 ◽  
Vol 21 (13) ◽  
pp. 4519
Author(s):  
Livia Petrescu ◽  
Cătălin Petrescu ◽  
Ana Oprea ◽  
Oana Mitruț ◽  
Gabriela Moise ◽  
...  

This paper focuses on the binary classification of the emotion of fear, based on the physiological data and subjective responses stored in the DEAP dataset. We performed a mapping between the discrete and dimensional emotional information considering the participants’ ratings and extracted a substantial set of 40 types of features from the physiological data, which represented the input to various machine learning algorithms—Decision Trees, k-Nearest Neighbors, Support Vector Machine and artificial networks—accompanied by dimensionality reduction, feature selection and the tuning of the most relevant hyperparameters, boosting classification accuracy. The methodology we approached included tackling different situations, such as resolving the problem of having an imbalanced dataset through data augmentation, reducing overfitting, computing various metrics in order to obtain the most reliable classification scores and applying the Local Interpretable Model-Agnostic Explanations method for interpretation and for explaining predictions in a human-understandable manner. The results show that fear can be predicted very well (accuracies ranging from 91.7% using Gradient Boosting Trees to 93.5% using dimensionality reduction and Support Vector Machine) by extracting the most relevant features from the physiological data and by searching for the best parameters which maximize the machine learning algorithms’ classification scores.


2021 ◽  
Vol 7 ◽  
pp. e390
Author(s):  
Shafaq Abbas ◽  
Zunera Jalil ◽  
Abdul Rehman Javed ◽  
Iqra Batool ◽  
Mohammad Zubair Khan ◽  
...  

Breast cancer is one of the leading causes of death in the current age. It often results in subpar living conditions for a patient as they have to go through expensive and painful treatments to fight this cancer. One in eight women all over the world is affected by this disease. Almost half a million women annually do not survive this fight and die from this disease. Machine learning algorithms have proven to outperform all existing solutions for the prediction of breast cancer using models built on the previously available data. In this paper, a novel approach named BCD-WERT is proposed that utilizes the Extremely Randomized Tree and Whale Optimization Algorithm (WOA) for efficient feature selection and classification. WOA reduces the dimensionality of the dataset and extracts the relevant features for accurate classification. Experimental results on state-of-the-art comprehensive dataset demonstrated improved performance in comparison with eight other machine learning algorithms: Support Vector Machine (SVM), Random Forest, Kernel Support Vector Machine, Decision Tree, Logistic Regression, Stochastic Gradient Descent, Gaussian Naive Bayes and k-Nearest Neighbor. BCD-WERT outperformed all with the highest accuracy rate of 99.30% followed by SVM achieving 98.60% accuracy. Experimental results also reveal the effectiveness of feature selection techniques in improving prediction accuracy.


2016 ◽  
Vol 28 (2) ◽  
pp. 117-124 ◽  
Author(s):  
Hongzhuan Zhao ◽  
Dihua Sun ◽  
Min Zhao ◽  
Senlin Cheng

With the enrichment of perception methods, modern transportation system has many physical objects whose states are influenced by many information factors so that it is a typical Cyber-Physical System (CPS). Thus, the traffic information is generally multi-sourced, heterogeneous and hierarchical. Existing research results show that the multisourced traffic information through accurate classification in the process of information fusion can achieve better parameters forecasting performance. For solving the problem of traffic information accurate classification, via analysing the characteristics of the multi-sourced traffic information and using redefined binary tree to overcome the shortcomings of the original Support Vector Machine (SVM) classification in information fusion, a multi-classification method using improved SVM in information fusion for traffic parameters forecasting is proposed. The experiment was conducted to examine the performance of the proposed scheme, and the results reveal that the method can get more accurate and practical outcomes.


Author(s):  
Thanh Vi Nguyen ◽  
Thế Cường Nguyễn

n binary classification problems, two classes of data seem tobe different from each other. It is expected to be more complicated dueto the number of data points of clusters in each class also be different.Traditional algorithms as Support Vector Machine (SVM), Twin Support Vector Machine (TSVM), or Least Square Twin Support VectorMachine (LSTSVM) cannot sufficiently exploit information about thenumber of data points in each cluster of the data. Which may be effectto the accuracy of classification problems. In this paper, we proposes anew Improved Least Square - Support Vector Machine (called ILS-SVM)for binary classification problems with a class-vs-clusters strategy. Experimental results show that the ILS-SVM training time is faster thanthat of TSVM, and the ILS-SVM accuracy is better than LSTSVM andTSVM in most cases.


2019 ◽  
Vol 6 (5) ◽  
pp. 543 ◽  
Author(s):  
Fitra A. Bachtiar ◽  
Indra K. Syahputra ◽  
Satrio A. Wicaksono

<p class="Abstrak">Pada setiap awal semester bagian akademik melakukan penjadwalan dan penentuan matakuliah yang akan dibuka untuk semester berikutnya. Akan tetapi proses tersebut memiliki permasalahan antara lain kelas yang dibuka terlalu banyak dibanding jumlah siswa yang berminat atau sebaliknya. Selain itu, dalam permasalahan prediksi data yang terkumpul memiliki kecenderungan tidak seimbang pada setiap kelas (<em>imbalance class</em>). Hal ini akan berdampak pada proses penjadwalan yang kurang tepat. Sehingga dibutuhkan sistem yang dapat memprediksi mahasiswa pengambil mata kuliah. Akan tetapi ada banyak algoritme yang dapat digunakan untuk proses prediksi. Penelitian ini membandingkan performa algoritma untuk klasifikasi mahasiswa pengambil matakuliah. Pada penelitian ini prediksi dilakukan berdasarkan atribut dari data mahasiswa. Atribut-atribut tersebut yaitu Nilai, IP, IPK, SKS, SKSK dan Semester. Pada setiap observasi pada atribut-atribut tersebut prediksi akan dilakukan apakah mahasiswa tersebut mengambil mata kuliah tertentu. Prediksi dibagi menjadi 2 kelas yaitu ‘Ya’ untuk mahasiswa yang diprediksi mengambil matakuliah dan ‘Tidak’ untuk mahasiswa yang diprediksi tidak mengambil matakuliah. Teknik <em>Synthetic Minority Oversampling Technique</em> (SMOTE) digunakan untuk menangani data yang tidak seimbang. Pada penelitian ini klasifikasi dilakukan dengan membandingkan algoritme <em>k</em><em>-Nearest Neighbor </em>(<em>k</em>-NN) dan <em>Support Vector Machine </em>(SVM) untuk kasus prediksi pengambil matakuliah. Hasil pengujian menggunakan 3 mata kuliah sebagai sampel. Dari hasil rerata, diperoleh hasil prediksi <em>k</em>-NN memiliki kinerja yang lebih baik daripada SVM. Selain itu, penggunaan teknik SMOTE dapat mempengaruhi hasil klasifikasi berupa peningkatan nilai AUC, CA, F1, <em>precision</em> dan <em>recall</em>.</p><p class="Abstrak"><strong><br /></strong></p><p class="Abstrak"><strong>Abstract</strong></p><p class="Abstract"><em>At the beginning of each semester, the academic section conducts scheduling and determining the courses offered for the next semester. However, the process has problems such as too many classes offered to the student compared to the number of students who take the class or vice versa. Besides that, in the prediction problems, the collected data has an imbalance tendency in each class. As a result, these problems could cause in ineffective scheduling. Thus, there is a need to build a system that can predict students taking courses. However, there are many algorithms that can be used for the prediction. This study compares the performance of algorithms for classifications of students taking courses. In this study, predictions are modeled based on the attributes of student data, namely Grades, GPA, Cumulative GPA, Semester Credits, Cumulative Semester Credits and Semester. The classification process will be carried out to produce a prediction of whether the student takes a particular subject or not. Classification results are divided into 2 classes, namely 'Yes' for students who are predicted to take and 'No' for students who are predicted not to take the class. To handle imbalance dataset will use Synthetic Minority Oversampling Technique (SMOTE) techniques. Classification method used in this study are k-Nearest Neighbor (k-NN) and Support Vector Machine (SVM) algorithms to compare their performance for prediction cases. The test results used 3 courses as a sample. In average k-NN prediction results have a better performance than SVM. In addition, the use of SMOTE techniques can influence the classification results in the form of an increase in AUC, CA, F1, precision and recall values.<strong></strong></em></p><p class="Abstrak"><strong><br /></strong></p>


2021 ◽  
Vol 2106 (1) ◽  
pp. 012009
Author(s):  
N Hayah ◽  
O Soesanto ◽  
M A Rahman

Abstract The Support Vector Machine (SVM) classification method can be applied in various fields, one of which is meteorology and climatology in rainfall forecasting. Thus, a study was conducted by classifying rainfall to recognize the relationship between global phenomena and rainfall and the results of applying the classification using the SVM method to rainfall in the Tanah Laut Regency. The analysis is carried out using the SVM Multiclass concept with 4 categories of rainfall classification: low, medium, high, and Extreme. The kernel used in SVM is the RBF kernel with optimization parameters used, namely Cost (C) 1,5,10,15 and Gamma (γ) 1,5,10,15. The dataset formed is based on the annual period, climatic conditions, and seasonality. The Spearman Rank correlation test describes the relationship between global phenomena and rainfall with a correlation range of (−0.1456 ) − (0.43144) for the entire dataset. The implementation of the SVM classification method shows that the Cost (C) 10 and Gamma (γ) ≥ 5 parameters obtained the highest accuracy of 100% on the training data. In contrast, in testing the data testing, the accuracy was good, namely the accuracy of 78.00% in La Nina and 81.38% in seasonal periods.


2016 ◽  
Vol 2016 ◽  
pp. 1-18 ◽  
Author(s):  
Divya Tomar ◽  
Sonal Agarwal

In multiple instance learning (MIL) framework, an object is represented by a set of instances referred to as bag. A positive class label is assigned to a bag if it contains at least one positive instance; otherwise a bag is labeled with negative class label. Therefore, the task of MIL is to learn a classifier at bag level rather than at instance level. Traditional supervised learning approaches cannot be applied directly in such kind of situation. In this study, we represent each bag by a vector of its dissimilarities to the other existing bags in the training dataset and propose a multiple instance learning based Twin Support Vector Machine (MIL-TWSVM) classifier. We have used different ways to represent the dissimilarity between two bags and performed a comparative analysis of them. The experimental results on ten benchmark MIL datasets demonstrate that the proposed MIL-TWSVM classifier is computationally inexpensive and competitive with state-of-the-art approaches. The significance of the experimental results has been tested by using Friedman statistic and Nemenyi post hoc tests.


Sign in / Sign up

Export Citation Format

Share Document