Evaluating Grayware Characteristics and Risks

Journal of Computer Networks and Communications ◽

10.1155/2011/569829 ◽

2011 ◽

Vol 2011 ◽

pp. 1-28 ◽

Cited By ~ 1

Author(s):

Zhongqiang Chen ◽

Zhanyan Liang ◽

Yuan Zhang ◽

Zhongrong Chen

Keyword(s):

Information Gain ◽

Feature Space ◽

Training Data ◽

Support Vector ◽

Learning Models ◽

Generalization Capability ◽

Self Organizing Maps ◽

Defense Strategies ◽

Security Applications ◽

Vector Machines

Grayware encyclopedias collect known species to provide information for incident analysis, however, the lack of categorization and generalization capability renders them ineffective in the development of defense strategies against clustered strains. A grayware categorization framework is therefore proposed here to not only classify grayware according to diverse taxonomic features but also facilitate evaluations on grayware risk to cyberspace. Armed with Support Vector Machines, the framework builds learning models based on training data extracted automatically from grayware encyclopedias and visualizes categorization results with Self-Organizing Maps. The features used in learning models are selected with information gain and the high dimensionality of feature space is reduced by word stemming and stopword removal process. The grayware categorizations on diversified features reveal that grayware typically attempts to improve its penetration rate by resorting to multiple installation mechanisms and reduced code footprints. The framework also shows that grayware evades detection by attacking victims' security applications and resists being removed by enhancing its clotting capability with infected hosts. Our analysis further points out that species in categoriesSpywareandAdwarecontinue to dominate the grayware landscape and impose extremely critical threats to the Internet ecosystem.

Download Full-text

On the Sparseness and Generalization Capability of Least Squares Support Vector Machines

Journal of Systems Science and Information ◽

10.1515/jssi-2015-0279 ◽

2015 ◽

Vol 3 (3) ◽

pp. 279-288 ◽

Cited By ~ 1

Author(s):

Aijun Yan ◽

Xiaoqian Huang ◽

Hongshan Shao

Keyword(s):

Support Vector Machines ◽

Least Squares ◽

Principal Component ◽

Training Data ◽

Kernel Principal Component Analysis ◽

Function Parameter ◽

Support Vector ◽

Generalization Capability ◽

Penalty Factor ◽

Vector Machines

AbstractCompared with standard support vector machines (SVM), sparseness is lost in the modeling process of least squares support vector machines (LS-SVM), causing limited generalization capability. An improved method using quadratic renyi-entropy pruning is presented to deal with the above problems. First, a kernel principal component analysis (KPCA) is used to denoise the training data. Next, the authors use the genetic algorithm to estimate and optimize the kernel function parameter and penalty factor. Then, pick the subset that has the largest quadratic entropy to train and prune, and repeat this process until the cumulative error rate reaches the condition requirement. Finally, comparing experiments on the data classification and regression indicates that the proposed method is effective and may improve the sparseness and the generalization capability of LS-SVM model.

Download Full-text

An Expert System Based on Fisher Score and LS-SVM for Cardiac Arrhythmia Diagnosis

Computational and Mathematical Methods in Medicine ◽

10.1155/2013/849674 ◽

2013 ◽

Vol 2013 ◽

pp. 1-6 ◽

Cited By ~ 19

Author(s):

Ersen Yılmaz

Keyword(s):

Expert System ◽

Cardiac Arrhythmia ◽

Feature Space ◽

Support Vector ◽

Feature Subset ◽

Fisher Score ◽

Data Set ◽

Second Stage ◽

Vector Machines ◽

Two Stages

An expert system having two stages is proposed for cardiac arrhythmia diagnosis. In the first stage, Fisher score is used for feature selection to reduce the feature space dimension of a data set. The second stage is classification stage in which least squares support vector machines classifier is performed by using the feature subset selected in the first stage to diagnose cardiac arrhythmia. Performance of the proposed expert system is evaluated by using an arrhythmia data set which is taken from UCI machine learning repository.

Download Full-text

Optimizing Training Data and Hyperparameters of Support Vector Machines Using a Memetic Algorithm

Advances in Intelligent Systems and Computing - Man-Machine Interactions 6 ◽

10.1007/978-3-030-31964-9_22 ◽

2019 ◽

pp. 229-238

Author(s):

Wojciech Dudzik ◽

Michal Kawulok ◽

Jakub Nalepa

Keyword(s):

Support Vector Machines ◽

Memetic Algorithm ◽

Training Data ◽

Support Vector ◽

Vector Machines

Download Full-text

DETECTION OF DISEASE SYMPTOMS ON HYPERSPECTRAL 3D PLANT MODELS

ISPRS Annals of Photogrammetry Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-annals-iii-7-89-2016 ◽

2016 ◽

Vol III-7 ◽

pp. 89-96 ◽

Cited By ~ 2

Author(s):

Ribana Roscher ◽

Jan Behmann ◽

Anne-Katrin Mahlein ◽

Jan Dupuis ◽

Heiner Kuhlmann ◽

...

Keyword(s):

Support Vector Machines ◽

Sparse Representation ◽

Test Data ◽

Training Data ◽

Hyperspectral Images ◽

Spectral Information ◽

Support Vector ◽

Leaf Spot Disease ◽

Disease Symptoms ◽

Vector Machines

We analyze the benefit of combining hyperspectral images information with 3D geometry information for the detection of <i>Cercospora</i> leaf spot disease symptoms on sugar beet plants. Besides commonly used one-class Support Vector Machines, we utilize an unsupervised sparse representation-based approach with group sparsity prior. Geometry information is incorporated by representing each sample of interest with an inclination-sorted dictionary, which can be seen as an 1D topographic dictionary. We compare this approach with a sparse representation based approach without geometry information and One-Class Support Vector Machines. One-Class Support Vector Machines are applied to hyperspectral data without geometry information as well as to hyperspectral images with additional pixelwise inclination information. Our results show a gain in accuracy when using geometry information beside spectral information regardless of the used approach. However, both methods have different demands on the data when applied to new test data sets. One-Class Support Vector Machines require full inclination information on test and training data whereas the topographic dictionary approach only need spectral information for reconstruction of test data once the dictionary is build by spectra with inclination.

Download Full-text

Information Gain, Correlation and Support Vector Machines

Feature Extraction - Studies in Fuzziness and Soft Computing ◽

10.1007/978-3-540-35488-8_23 ◽

2008 ◽

pp. 463-470 ◽

Cited By ~ 32

Author(s):

Danny Roobaert ◽

Grigoris Karakoulas ◽

Nitesh V. Chawla

Keyword(s):

Support Vector Machines ◽

Information Gain ◽

Support Vector ◽

Vector Machines

Download Full-text

Using Remote Monitoring and Machine Learning to Classify Slam Events of Wave Piercing Catamarans

The International Journal of Maritime Engineering ◽

10.5750/ijme.v163ia3.797 ◽

2021 ◽

Vol 163 (A3) ◽

Author(s):

B Shabani ◽

J Ali-Lavroff ◽

D S Holloway ◽

S Penev ◽

D Dessi ◽

...

Keyword(s):

Machine Learning ◽

Monitoring System ◽

Probability Distributions ◽

Feature Space ◽

Support Vector ◽

Acceleration Threshold ◽

Vector Machines ◽

Structural Responses ◽

As Stress

An onboard monitoring system can measure features such as stress cycles counts and provide warnings due to slamming. Considering current technology trends there is the opportunity of incorporating machine learning methods into monitoring systems. A hull monitoring system has been developed and installed on a 111 m wave piercing catamaran (Hull 091) to remotely monitor the ship kinematics and hull structural responses. Parallel to that, an existing dataset of a similar vessel (Hull 061) was analysed using unsupervised and supervised learning models; these were found to be beneficial for the classification of bow entry events according to key kinematic parameters. A comparison of different algorithms including linear support vector machines, naïve Bayes and decision tree for the bow entry classification were conducted. In addition, using empirical probability distributions, the likelihood of wet-deck slamming was estimated given a vertical bow acceleration threshold of 1 in head seas, clustering the feature space with the approximate probabilities of 0.001, 0.030 and 0.25.

Download Full-text

Learning to Identify At-Risk Students in Distance Education Using Interaction Counts

Revista de Informática Teórica e Aplicada ◽

10.22456/2175-2745.62211 ◽

2016 ◽

Vol 23 (2) ◽

pp. 124 ◽

Cited By ~ 2

Author(s):

Douglas Detoni ◽

Cristian Cechinel ◽

Ricardo Araujo Matsumura ◽

Daniela Francisco Brauner

Keyword(s):

Machine Learning ◽

At Risk ◽

At Risk Students ◽

Drop Out ◽

Support Vector ◽

Learning Models ◽

Data Set ◽

Student Dropout ◽

Vector Machines ◽

Machine Learning Models

Student dropout is one of the main problems faced by distance learning courses. One of the major challenges for researchers is to develop methods to predict the behavior of students so that teachers and tutors are able to identify at-risk students as early as possible and provide assistance before they drop out or fail in their courses. Machine Learning models have been used to predict or classify students in these settings. However, while these models have shown promising results in several settings, they usually attain these results using attributes that are not immediately transferable to other courses or platforms. In this paper, we provide a methodology to classify students using only interaction counts from each student. We evaluate this methodology on a data set from two majors based on the Moodle platform. We run experiments consisting of training and evaluating three machine learning models (Support Vector Machines, Naive Bayes and Adaboost decision trees) under different scenarios. We provide evidences that patterns from interaction counts can provide useful information for classifying at-risk students. This classification allows the customization of the activities presented to at-risk students (automatically or through tutors) as an attempt to avoid students drop out.

Download Full-text

Convex Hull in Feature Space for Support Vector Machines

Advances in Artificial Intelligence — IBERAMIA 2002 - Lecture Notes in Computer Science ◽

10.1007/3-540-36131-6_42 ◽

2002 ◽

pp. 411-419 ◽

Cited By ~ 12

Author(s):

Edgar Osuna ◽

Osberth De Castro

Keyword(s):

Support Vector Machines ◽

Convex Hull ◽

Feature Space ◽

Support Vector ◽

Vector Machines

Download Full-text

Support Vector Machines Training Data Selection Using a Genetic Algorithm

Lecture Notes in Computer Science - Structural, Syntactic, and Statistical Pattern Recognition ◽

10.1007/978-3-642-34166-3_61 ◽

2012 ◽

pp. 557-565 ◽

Cited By ~ 29

Author(s):

Michal Kawulok ◽

Jakub Nalepa

Keyword(s):

Genetic Algorithm ◽

Support Vector Machines ◽

Training Data ◽

Data Selection ◽

Support Vector ◽

Vector Machines ◽

Training Data Selection

Download Full-text

A Study on Machine Vision Techniques for the Inspection of Health Personnels’ Protective Suits for the Treatment of Patients in Extreme Isolation

Electronics ◽

10.3390/electronics8070743 ◽

2019 ◽

Vol 8 (7) ◽

pp. 743 ◽

Cited By ~ 1

Author(s):

Alice Stazio ◽

Juan G. Victores ◽

David Estevez ◽

Carlos Balaguer

Keyword(s):

Logistic Regression ◽

Machine Vision ◽

Training Data ◽

Gradient Boosting ◽

Support Vector ◽

Classification Algorithms ◽

Adaptive Boosting ◽

Blood Stains ◽

Extreme Gradient Boosting ◽

Vector Machines

The examination of Personal Protective Equipment (PPE) to assure the complete integrity of health personnel in contact with infected patients is one of the most necessary tasks when treating patients affected by infectious diseases, such as Ebola. This work focuses on the study of machine vision techniques for the detection of possible defects on the PPE that could arise after contact with the aforementioned pathological patients. A preliminary study on the use of image classification algorithms to identify blood stains on PPE subsequent to the treatment of the infected patient is presented. To produce training data for these algorithms, a synthetic dataset was generated from a simulated model of a PPE suit with blood stains. Furthermore, the study proceeded with the utilization of images of the PPE with a physical emulation of blood stains, taken by a real prototype. The dataset reveals a great imbalance between positive and negative samples; therefore, all the selected classification algorithms are able to manage this kind of data. Classifiers range from Logistic Regression and Support Vector Machines, to bagging and boosting techniques such as Random Forest, Adaptive Boosting, Gradient Boosting and eXtreme Gradient Boosting. All these algorithms were evaluated on accuracy, precision, recall and F 1 score; and additionally, execution times were considered. The obtained results report promising outcomes of all the classifiers, and, in particular Logistic Regression resulted to be the most suitable classification algorithm in terms of F 1 score and execution time, considering both datasets.

Download Full-text