Ranking Support Vector Machine with Kernel Approximation

Computational Intelligence and Neuroscience ◽

10.1155/2017/4629534 ◽

2017 ◽

Vol 2017 ◽

pp. 1-9 ◽

Cited By ~ 1

Author(s):

Kai Chen ◽

Rongchun Li ◽

Yong Dou ◽

Zhengfa Liang ◽

Qi Lv

Keyword(s):

Support Vector Machine ◽

Learning To Rank ◽

Support Vector ◽

Kernel Matrix ◽

Ranking Problem ◽

Ranking Algorithms ◽

Hinge Loss ◽

Kernel Approximation ◽

Truncated Newton ◽

Nonlinear Kernels

Learning to rank algorithm has become important in recent years due to its successful application in information retrieval, recommender system, and computational biology, and so forth. Ranking support vector machine (RankSVM) is one of the state-of-art ranking models and has been favorably used. Nonlinear RankSVM (RankSVM with nonlinear kernels) can give higher accuracy than linear RankSVM (RankSVM with a linear kernel) for complex nonlinear ranking problem. However, the learning methods for nonlinear RankSVM are still time-consuming because of the calculation of kernel matrix. In this paper, we propose a fast ranking algorithm based on kernel approximation to avoid computing the kernel matrix. We explore two types of kernel approximation methods, namely, the Nyström method and random Fourier features. Primal truncated Newton method is used to optimize the pairwise L2-loss (squared Hinge-loss) objective function of the ranking model after the nonlinear kernel approximation. Experimental results demonstrate that our proposed method gets a much faster training speed than kernel RankSVM and achieves comparable or better performance over state-of-the-art ranking algorithms.

Download Full-text

Robust one-class support vector machine with rescaled hinge loss function

Pattern Recognition ◽

10.1016/j.patcog.2018.07.015 ◽

2018 ◽

Vol 84 ◽

pp. 152-164 ◽

Cited By ~ 11

Author(s):

Hong-Jie Xing ◽

Man Ji

Keyword(s):

Support Vector Machine ◽

Loss Function ◽

Support Vector ◽

Hinge Loss

Download Full-text

Anomaly Detection for Industrial Control Networks Based on Improved One-Class Support Vector Machine

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s0218001421500129 ◽

2020 ◽

pp. 2150012

Author(s):

Haicheng Qu ◽

Jianzhong Zhou ◽

Jitao Qin ◽

Xiaorong Tian

Keyword(s):

Machine Learning ◽

Support Vector Machine ◽

Anomaly Detection ◽

Support Vector ◽

Industrial Control ◽

Control Networks ◽

Detection Algorithms ◽

Hinge Loss ◽

Feature Fitting ◽

Industrial Control Networks

In traditional network anomaly detection algorithms, the anomaly threshold needs to be defined manually. Keeping this as background, this study proposes an anomaly detection algorithm (VAEOCSVM), which combines the variable auto-encoder (VAE) and one-class support vector machine (OCSVM) to realize anomaly detection in industrial control networks. First, the VAE model is used to obtain the distribution of the original normal sample data represented by the low-dimensional code; the reconstruction error of the VAE model is merged into the new input. Then, using OCSVM’s hinge-loss objective function and the random Fourier feature fitting radial basis function (RBF) kernel method, the OCSVM model is represented and solved using the deep neural network and gradient descent method. Finally, the decision function of the OCSVM model is constructed by using the solved parameter information to realize the detection of abnormal data. The proposed algorithm is compared with other machine-learning-based anomaly detection algorithms in terms of multiple indicators such as precision, recall, and [Formula: see text] score. The experimental results using various datasets show that the proposed algorithm has a better outlier recognition ability than the machine-learning-based anomaly detection algorithms.

Download Full-text

Stochastic Subgradient for Large-Scale Support Vector Machine Using the Generalized Pinball Loss Function

Symmetry ◽

10.3390/sym13091652 ◽

2021 ◽

Vol 13 (9) ◽

pp. 1652

Author(s):

Wanida Panup ◽

Rabian Wangkeeree

Keyword(s):

Support Vector Machine ◽

Loss Function ◽

Gradient Descent ◽

Large Scale ◽

Stochastic Gradient ◽

Stochastic Gradient Descent ◽

Support Vector ◽

Hinge Loss ◽

Gradient Descent Algorithm ◽

Pinball Loss

In this paper, we propose a stochastic gradient descent algorithm, called stochastic gradient descent method-based generalized pinball support vector machine (SG-GPSVM), to solve data classification problems. This approach was developed by replacing the hinge loss function in the conventional support vector machine (SVM) with a generalized pinball loss function. We show that SG-GPSVM is convergent and that it approximates the conventional generalized pinball support vector machine (GPSVM). Further, the symmetric kernel method was adopted to evaluate the performance of SG-GPSVM as a nonlinear classifier. Our suggested algorithm surpasses existing methods in terms of noise insensitivity, resampling stability, and accuracy for large-scale data scenarios, according to the experimental results.

Download Full-text

Bearing fault diagnosis via kernel matrix construction based support vector machine

Journal of Vibroengineering ◽

10.21595/jve.2017.18482 ◽

2017 ◽

Vol 19 (5) ◽

pp. 3445-3461 ◽

Cited By ~ 5

Author(s):

Tefang Chen ◽

Chenxi Wu ◽

Rong Jiang

Keyword(s):

Support Vector Machine ◽

Fault Diagnosis ◽

Support Vector ◽

Kernel Matrix ◽

Bearing Fault ◽

Bearing Fault Diagnosis ◽

Matrix Construction

Download Full-text

Improved Sparsity of Support Vector Machine with Robustness Towards Label Noise Based on Rescaled $$\alpha $$-Hinge Loss with Non-smooth Regularizer

Neural Processing Letters ◽

10.1007/s11063-020-10346-0 ◽

2020 ◽

Vol 52 (3) ◽

pp. 2211-2239

Author(s):

Manisha Singla ◽

Debdas Ghosh ◽

K. K. Shukla

Keyword(s):

Support Vector Machine ◽

Support Vector ◽

Label Noise ◽

Hinge Loss

Download Full-text

In-sample Model Selection for Trimmed Hinge Loss Support Vector Machine

Neural Processing Letters ◽

10.1007/s11063-012-9235-z ◽

2012 ◽

Vol 36 (3) ◽

pp. 275-283 ◽

Cited By ~ 16

Author(s):

Davide Anguita ◽

Alessandro Ghio ◽

Luca Oneto ◽

Sandro Ridella

Keyword(s):

Support Vector Machine ◽

Model Selection ◽

Support Vector ◽

Hinge Loss ◽

Selection For

Download Full-text

Chunk incremental learning for cost-sensitive hinge loss support vector machine

Pattern Recognition ◽

10.1016/j.patcog.2018.05.023 ◽

2018 ◽

Vol 83 ◽

pp. 196-208 ◽

Cited By ~ 20

Author(s):

Bin Gu ◽

Xin Quan ◽

Yunhua Gu ◽

Victor S. Sheng ◽

Guansheng Zheng

Keyword(s):

Support Vector Machine ◽

Incremental Learning ◽

Support Vector ◽

Hinge Loss

Download Full-text

Parameter Tuning via Kernel Matrix Approximation for Support Vector Machine

Journal of Computers ◽

10.4304/jcp.7.8.2047-2054 ◽

2012 ◽

Vol 7 (8) ◽

Author(s):

Chenhao Yang ◽

Lizhong Ding ◽

Shizhong Liao

Keyword(s):

Support Vector Machine ◽

Parameter Tuning ◽

Support Vector ◽

Kernel Matrix ◽

Matrix Approximation

Download Full-text

Extended Robust Support Vector Machine Based on Financial Risk Minimization

Neural Computation ◽

10.1162/neco_a_00647 ◽

2014 ◽

Vol 26 (11) ◽

pp. 2541-2569 ◽

Cited By ~ 5

Author(s):

Akiko Takeda ◽

Shuhei Fujiwara ◽

Takafumi Kanamori

Keyword(s):

At Risk ◽

Support Vector Machine ◽

Value At Risk ◽

Financial Risk ◽

Risk Measure ◽

Conditional Value At Risk ◽

Support Vector ◽

Risk Minimization ◽

Hinge Loss ◽

Margin Distribution

Financial risk measures have been used recently in machine learning. For example, [Formula: see text]-support vector machine ([Formula: see text]-SVM) minimizes the conditional value at risk (CVaR) of margin distribution. The measure is popular in finance because of the subadditivity property, but it is very sensitive to a few outliers in the tail of the distribution. We propose a new classification method, extended robust SVM (ER-SVM), which minimizes an intermediate risk measure between the CVaR and value at risk (VaR) by expecting that the resulting model becomes less sensitive than [Formula: see text]-SVM to outliers. We can regard ER-SVM as an extension of robust SVM, which uses a truncated hinge loss. Numerical experiments imply the ER-SVM’s possibility of achieving a better prediction performance with proper parameter setting.

Download Full-text

Clustering using similarity and kernel matrices

10.12681/eadd/37387 ◽

2014 ◽

Author(s):

Γρηγόριος Τζώρτζης

Keyword(s):

Support Vector Machine ◽

Objective Function ◽

Mixture Models ◽

Multiple Kernel Learning ◽

Kernel Learning ◽

Support Vector ◽

Kernel Matrix ◽

Multiple Kernel

Η παρούσα διατριβή μελετά το πρόβλημα της ομαδοποίησης (clustering), που έχει ως στόχο τον διαχωρισμό ενός συνόλου δεδομένων σε ομάδες (clusters), χωρίς τη χρήση επίβλεψης, ώστε τα δεδομένα που ανήκουν στη ίδια ομάδα να είναι όμοια μεταξύ τους και ανόμοια με αυτά των άλλων ομάδων, βάσει ενός μέτρου ομοιότητας/ανομοιότητας. Συγκεκριμένα, η διατριβή επικεντρώνεται στην παρουσίαση μεθόδων ομαδοποίησης που αφορούν τρεις βασικούς θεματικούς άξονες: α) την ομαδοποίηση δεδομένων για τα οποία έχουμε διαθέσιμο μόνο τον πίνακα εγγύτητας και όχι τα ίδια τα δεδομένα (proximity-based clustering), β) την μάθηση με πολλαπλές όψεις (multi-view learning), όπου για τα ίδια δεδομένα έχουμε στη διάθεσή μας πολλαπλές αναπαραστάσεις (όψεις) που προέρχονται από διαφορετικές πηγές ή/και διαφορετικούς χώρους χαρακτηριστικών και γ) την μάθηση με πολλαπλούς πυρήνες (multiple kernel learning), όπου ταυτόχρονα με την ομαδοποίηση θέλουμε να μάθουμε και τον κατάλληλο πυρήνα (kernel) για τα δεδομένα. Συνήθως ο πυρήνας παραμετροποιείται ως ένας συνδυασμός από δοθέντες πυρήνες (basis kernels) και στοχεύουμε στην μάθηση κατάλληλων τιμών για τις παραμέτρους του συνδυασμού.Αρχικά προτείνεται μια μέθοδος για την αντιμετώπιση του γνωστού προβλήματος της αρχικοποίησης (initialization problem), από το οποίο πάσχει ο αλγόριθμος k-means. Συγκεκριμένα, τροποποιούμε το κριτήριο (objective function) του k-means έτσι ώστε να δίδεται μεγαλύτερη έμφαση στην ελαχιστοποίηση των ομάδων που στην τρέχουσα επανάληψη εμφανίζουν μεγάλη διακύμανση (intra-cluster variance). Κατ’ αυτόν τον τρόπο ο χώρος λύσεων σταδιακά περιορίζεται σε ομάδες που εμφανίζουν παρεμφερή διακύμανση, το οποίο επιτρέπει στη μέθοδό μας να εντοπίζει σε συστηματική βάση λύσεις καλύτερης ποιότητας σε σχέση με τον k-means, καθώς επανεκκινείται από τυχαία αρχικά κέντρα. Επιπλέον, παρουσιάζεται η προσαρμογή της μεθόδου ώστε να μπορεί να εφαρμοστεί για ομαδοποίηση με πίνακα ομοιότητας (kernel matrix), τροποποιώντας το κριτήριο του αλγορίθμου kernel k -means.Στη συνέχεια, η διατριβή εστιάζεται στο πρόβλημα της ομαδοποίησης με πολλαπλές όψεις. Η βασική συνεισφορά στο αντικείμενο αυτό σχετίζεται με την ανάθεση βαρών στις όψεις, τα οποία μαθαίνονται αυτόματα και τα οποία αντικατοπτρίζουν την ποιότητα των όψεων. Οι υπάρχουσες προσεγγίσεις θεωρούν όλες τις όψεις εξίσου σημαντικές, κάτι που μπορεί να οδηγήσει σε σημαντική μείωση της απόδοσης εάν υπάρχουν εκφυλισμένες όψεις (π.χ. όψεις με θόρυβο) στο σύνολο δεδομένων. Ειδικότερα, παρουσιάζονται για το ανωτέρω πρόβλημα δύο διαφορετικές μεθοδολογίες. Στην πρώτη περίπτωση αναπαριστούμε τις όψεις μέσω κυρτών μικτών μοντέλων (convex mixture models) λαμβάνοντας υπόψη τις διαφορετικές στατιστικές ιδιότητές τους και παρουσιάζουμε έναν αλγόριθμο με βάρη στις όψεις και έναν χωρίς βάρη. Στην δεύτερη περίπτωση αναπαριστούμε την κάθε όψη μέσω ενός πίνακα ομοιότητας (kernel matrix) και μαθαίνουμε ένα συνδυασμό με βάρη από τους πίνακες αυτούς. Το προτεινόμενο μοντέλο διαθέτει μία παράμετρο που ελέγχει την αραιότητα των βαρών, επιτρέποντας την καλύτερη προσαρμογή του συνδυασμού στα δεδομένα.Η τελευταία ενότητα της διατριβής αφορά στην ομαδοποίηση με πολλαπλούς πυρήνες, όπου συνήθως το κριτήριο που βελτιστοποιείται είναι το εύρος (margin) της λύσης, όπως είναι γνωστό από τον ταξινομητή SVM (support vector machine). Στην προσέγγιση που προτείνεται, βελτιστοποιείται ο λόγος μεταξύ του εύρους και της διακύμανσης (intra-cluster variance) των ομάδων, λαμβάνοντας έτσι υπόψη τόσο τον διαχωρισμό (separability) τους όσο και το πόσο συμπαγείς (compactness) είναι οι ομάδες, το οποίο δύναται να οδηγήσει σε καλύτερες λύσεις. Έχει δειχθεί ότι το εύρος από μόνο του δεν επαρκεί ως κριτήριο για την μάθηση του κατάλληλου πυρήνα, καθότι μπορεί να γίνει αυθαίρετα μεγάλο μέσω μίας απλής κλιμάκωσης (scaling) του πυρήνα. Αντιθέτως, το κριτήριο που προτείνουμε είναι αμετάβλητο (invariant) σε κλιμακώσεις του πυρήνα και, επιπλέον, το ολικό του βέλτιστο είναι αμετάβλητο ως προς τον τύπο της νόρμας που εφαρμόζεται στους περιορισμούς (constraints) των παραμέτρων του πυρήνα. Τα πειραματικά αποτελέσματα επιβεβαιώνουν τις ιδιότητες του κριτηρίου μας, καθώς και τις αναμενόμενες βελτιωμένες επιδόσεις ομαδοποίησης.

Download Full-text