Nearest labelset using double distances for multi-label classification

PeerJ Computer Science ◽

10.7717/peerj-cs.242 ◽

2019 ◽

Vol 5 ◽

pp. e242

Author(s):

Hyukjun Gweon ◽

Matthias Schonlau ◽

Stefan H. Steiner

Keyword(s):

Maximum Likelihood ◽

Supervised Learning ◽

Feature Space ◽

Training Data ◽

Model Parameters ◽

Data Sets ◽

Weighted Sum ◽

Novel Approach ◽

Binomial Regression ◽

F Measure

Multi-label classification is a type of supervised learning where an instance may belong to multiple labels simultaneously. Predicting each label independently has been criticized for not exploiting any correlation between labels. In this article we propose a novel approach, Nearest Labelset using Double Distances (NLDD), that predicts the labelset observed in the training data that minimizes a weighted sum of the distances in both the feature space and the label space to the new instance. The weights specify the relative tradeoff between the two distances. The weights are estimated from a binomial regression of the number of misclassified labels as a function of the two distances. Model parameters are estimated by maximum likelihood. NLDD only considers labelsets observed in the training data, thus implicitly taking into account label dependencies. Experiments on benchmark multi-label data sets show that the proposed method on average outperforms other well-known approaches in terms of 0/1 loss, and multi-label accuracy and ranks second on the F-measure (after a method called ECC) and on Hamming loss (after a method called RF-PCT).

Download Full-text

MULFE: Multi-Label Learning via Label-Specific Feature Space Ensemble

ACM Transactions on Knowledge Discovery from Data ◽

10.1145/3451392 ◽

2021 ◽

Vol 16 (1) ◽

pp. 1-24

Author(s):

Yaojin Lin ◽

Qinghua Hu ◽

Jinghua Liu ◽

Xingquan Zhu ◽

Xindong Wu

Keyword(s):

Empirical Studies ◽

Feature Space ◽

Training Data ◽

Data Sets ◽

Learning Framework ◽

Feature Spaces ◽

Public Data ◽

Margin Distribution ◽

Label Correlations ◽

Label Correlation

In multi-label learning, label correlations commonly exist in the data. Such correlation not only provides useful information, but also imposes significant challenges for multi-label learning. Recently, label-specific feature embedding has been proposed to explore label-specific features from the training data, and uses feature highly customized to the multi-label set for learning. While such feature embedding methods have demonstrated good performance, the creation of the feature embedding space is only based on a single label, without considering label correlations in the data. In this article, we propose to combine multiple label-specific feature spaces, using label correlation, for multi-label learning. The proposed algorithm, mu lti- l abel-specific f eature space e nsemble (MULFE), takes consideration label-specific features, label correlation, and weighted ensemble principle to form a learning framework. By conducting clustering analysis on each label’s negative and positive instances, MULFE first creates features customized to each label. After that, MULFE utilizes the label correlation to optimize the margin distribution of the base classifiers which are induced by the related label-specific feature spaces. By combining multiple label-specific features, label correlation based weighting, and ensemble learning, MULFE achieves maximum margin multi-label classification goal through the underlying optimization framework. Empirical studies on 10 public data sets manifest the effectiveness of MULFE.

Download Full-text

Experiments of Image Classification Using Dissimilarity Spaces Built with Siamese Networks

Sensors ◽

10.3390/s21051573 ◽

2021 ◽

Vol 21 (5) ◽

pp. 1573

Author(s):

Loris Nanni ◽

Giovanni Minchio ◽

Sheryl Brahnam ◽

Gianluca Maguolo ◽

Alessandra Lumini

Keyword(s):

Vector Space ◽

Image Classification ◽

Ad Hoc ◽

Feature Space ◽

Medical Data ◽

Training Data ◽

Data Sets ◽

Large Set ◽

Clustering Methods ◽

Siamese Networks

Traditionally, classifiers are trained to predict patterns within a feature space. The image classification system presented here trains classifiers to predict patterns within a vector space by combining the dissimilarity spaces generated by a large set of Siamese Neural Networks (SNNs). A set of centroids from the patterns in the training data sets is calculated with supervised k-means clustering. The centroids are used to generate the dissimilarity space via the Siamese networks. The vector space descriptors are extracted by projecting patterns onto the similarity spaces, and SVMs classify an image by its dissimilarity vector. The versatility of the proposed approach in image classification is demonstrated by evaluating the system on different types of images across two domains: two medical data sets and two animal audio data sets with vocalizations represented as images (spectrograms). Results show that the proposed system’s performance competes competitively against the best-performing methods in the literature, obtaining state-of-the-art performance on one of the medical data sets, and does so without ad-hoc optimization of the clustering methods on the tested data sets.

Download Full-text

The Weibull Birnbaum-Saunders Distribution And Its Applications

Statistics Optimization & Information Computing ◽

10.19139/soic-2310-5070-887 ◽

2020 ◽

Vol 9 (1) ◽

pp. 61-81

Author(s):

Lazhar BENKHELIFA

Keyword(s):

Maximum Likelihood ◽

Estimation Method ◽

Likelihood Estimation ◽

Real Data ◽

Reliability Estimation ◽

Maximum Likelihood Estimates ◽

Model Parameters ◽

Data Sets ◽

Proposed Model ◽

Modeling Data

A new lifetime model, with four positive parameters, called the Weibull Birnbaum-Saunders distribution is proposed. The proposed model extends the Birnbaum-Saunders distribution and provides great flexibility in modeling data in practice. Some mathematical properties of the new distribution are obtained including expansions for the cumulative and density functions, moments, generating function, mean deviations, order statistics and reliability. Estimation of the model parameters is carried out by the maximum likelihood estimation method. A simulation study is presented to show the performance of the maximum likelihood estimates of the model parameters. The flexibility of the new model is examined by applying it to two real data sets.

Download Full-text

Discovering Latent Class Labels for Multi-Label Learning

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2020/423 ◽

2020 ◽

Author(s):

Jun Huang ◽

Linchuan Xu ◽

Jing Wang ◽

Lei Feng ◽

Kenji Yamanishi

Keyword(s):

Large Scale ◽

Latent Class ◽

Training Data ◽

Data Sets ◽

Robust Learning ◽

Large Scale Data ◽

Novel Approach ◽

Fixed Set ◽

Class Labels ◽

Scale Data

Existing multi-label learning (MLL) approaches mainly assume all the labels are observed and construct classification models with a fixed set of target labels (known labels). However, in some real applications, multiple latent labels may exist outside this set and hide in the data, especially for large-scale data sets. Discovering and exploring the latent labels hidden in the data may not only find interesting knowledge but also help us to build a more robust learning model. In this paper, a novel approach named DLCL (i.e., Discovering Latent Class Labels for MLL) is proposed which can not only discover the latent labels in the training data but also predict new instances with the latent and known labels simultaneously. Extensive experiments show a competitive performance of DLCL against other state-of-the-art MLL approaches.

Download Full-text

A New Flexible Three-Parameter Model: Properties, Clayton Copula, and Modeling Real Data

Symmetry ◽

10.3390/sym12030440 ◽

2020 ◽

Vol 12 (3) ◽

pp. 440 ◽

Cited By ~ 8

Author(s):

Abdulhakim A. Al-babtain ◽

I. Elbatal ◽

Haitham M. Yousof

Keyword(s):

Maximum Likelihood ◽

Maximum Likelihood Method ◽

Real Data ◽

Likelihood Method ◽

Model Parameters ◽

Simple Type ◽

Data Sets ◽

New Model ◽

Clayton Copula ◽

Mathematical Properties

In this article, we introduced a new extension of the binomial-exponential 2 distribution. We discussed some of its structural mathematical properties. A simple type Copula-based construction is also presented to construct the bivariate- and multivariate-type distributions. We estimated the model parameters via the maximum likelihood method. Finally, we illustrated the importance of the new model by the study of two real data applications to show the flexibility and potentiality of the new model in modeling skewed and symmetric data sets.

Download Full-text

SEMI-SUPERVISED SEQUENCE CLASSIFICATION WITH HMMs

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s0218001405004034 ◽

2005 ◽

Vol 19 (02) ◽

pp. 165-182 ◽

Cited By ~ 7

Author(s):

SHI ZHONG

Keyword(s):

Supervised Learning ◽

Learning Strategies ◽

Test Data ◽

Unlabeled Data ◽

Training Data ◽

Model Complexity ◽

Model Parameters ◽

Training Process ◽

Transductive Learning ◽

Model Training

Using unlabeled data to help supervised learning has become an increasingly attractive methodology and proven to be effective in many applications. This paper applies semi-supervised classification algorithms, based on hidden Markov models, to classify sequences. For model-based classification, semi-supervised learning amounts to using both labeled and unlabeled data to train model parameters. We examine three different strategies of using labeled and unlabeled data in the model training process. These strategies differ in how and when labeled and unlabeled data contribute to the model training process. We also compare regular semi-supervised learning, where there are separate unlabeled training data and unlabeled test data, with transductive learning where we do not differentiate between unlabeled training data and unlabeled test data. Our experimental results on synthetic and real EEG time-series show that substantially improved classification accuracy can be achieved by these semi-supervised learning strategies. The effect of model complexity on semi-supervised learning is also studied in our experiments.

Download Full-text

Persistent self-supervised learning: From stereo to monocular vision for obstacle avoidance

International Journal of Micro Air Vehicles ◽

10.1177/1756829318756355 ◽

2018 ◽

Vol 10 (2) ◽

pp. 186-206 ◽

Cited By ~ 3

Author(s):

Kevin van Hecke ◽

Guido de Croon ◽

Laurens van der Maaten ◽

Daniel Hennes ◽

Dario Izzo

Keyword(s):

Supervised Learning ◽

Stereo Vision ◽

Large Data ◽

Monocular Vision ◽

Training Data ◽

Data Sets ◽

Learning Approaches ◽

Robust Learning ◽

Flying Robot ◽

First Time

Self-supervised learning is a reliable learning mechanism in which a robot uses an original, trusted sensor cue for training to recognize an additional, complementary sensor cue. We study for the first time in self-supervised learning how a robot’s learning behavior should be organized, so that the robot can keep performing its task in the case that the original cue becomes unavailable. We study this persistent form of self-supervised learning in the context of a flying robot that has to avoid obstacles based on distance estimates from the visual cue of stereo vision. Over time it will learn to also estimate distances based on monocular appearance cues. A strategy is introduced that has the robot switch from flight based on stereo to flight based on monocular vision, with stereo vision purely used as “training wheels” to avoid imminent collisions. This strategy is shown to be an effective approach to the “feedback-induced data bias” problem as also experienced in learning from demonstration. Both simulations and real-world experiments with a stereo vision equipped ARDrone2 show the feasibility of this approach, with the robot successfully using monocular vision to avoid obstacles in a 5 × 5 m room. The experiments show the potential of persistent self-supervised learning as a robust learning approach to enhance the capabilities of robots. Moreover, the abundant training data coming from the own sensors allow to gather large data sets necessary for deep learning approaches.

Download Full-text

The Zubair-Inverse Lomax Distribution with Applications

Asian Journal of Probability and Statistics ◽

10.9734/ajpas/2020/v8i330206 ◽

2020 ◽

pp. 1-14

Author(s):

Jamilu Yunusa Falgore

Keyword(s):

Maximum Likelihood ◽

Moment Generating Function ◽

Likelihood Method ◽

Model Parameters ◽

Data Sets ◽

Monte Carlo Simulation Study ◽

New Model ◽

Inverse Lomax Distribution ◽

The Right ◽

Decreasing Functions

In this article, an extension of Inverse Lomax (IL) distribution with the Zubair-G family is considered . Various statistical properties of the new model where derived, including moment generating function, R´enyi entropy, and order statistics. A Monte Carlo simulation study was presented to evaluate the performance of the maximum likelihood estimators. The new model can be skew to the right, constant, and decreasing functions depending on the parameter values.We discussed the estimation of the model parameters by maximum likelihood method. The application of the new model to the data sets indicates that the new model is better than the existing competitors as it has minimum value of statistics criteria.

Download Full-text

Enhancing Image Diagnosis by the Implementation of Transfer Classifiers

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.c4060.098319 ◽

2019 ◽

Vol 8 (3) ◽

pp. 999-1002

Keyword(s):

Supervised Learning ◽

Transfer Learning ◽

Training Data ◽

Data Sets ◽

Learning Approaches ◽

Learning Techniques ◽

Image Diagnosis ◽

Sensitivity Problem ◽

Common Application ◽

Target Data

Images generated from a variety of sources and foundations today can pose difficulty for a user to interpret similarity in them or analyze them for further use because of their segmentation policies. This unconventionality can generate many errors, because of which the previously used traditional methodologies such as supervised learning techniques less resourceful, which requires huge quantity of labelled training data which mirrors the desired target data. This paper thus puts forward the mechanism of an alternative technique i.e. transfer learning to be used in image diagnosis so that efficiency and accuracy among images can be achieved. This type of mechanism deals with variation in the desired and actual data used for training and the outlier sensitivity, which ultimately enhances the predictions by giving better results in various areas, thus leaving the traditional methodologies behind. The following analysis further discusses about three types of transfer classifiers which can be applied using only small volume of training data sets and their contrast with the traditional method which requires huge quantities of training data having attributes with slight changes. The three different separators were compared amongst them and also together from the traditional methodology being used for a very common application used in our daily life. Also, commonly occurring problems such as the outlier sensitivity problem were taken into consideration and measures were taken to recognise and improvise them. On further research it was observed that the performance of transfer learning exceeds that of the conventional supervised learning approaches being used for small amount of characteristic training data provided reducing the stratification errors to a great extent

Download Full-text

Improved supervised learning methods for EoR parameters reconstruction

Monthly Notices of the Royal Astronomical Society ◽

10.1093/mnras/stz2429 ◽

2019 ◽

Vol 490 (1) ◽

pp. 371-384 ◽

Cited By ~ 3

Author(s):

Aristide Doussot ◽

Evan Eames ◽

Benoit Semelin

Keyword(s):

Neural Network ◽

Neural Networks ◽

Bayesian Inference ◽

Maximum Likelihood ◽

Supervised Learning ◽

Confidence Level ◽

Thermal Noise ◽

Model Parameters ◽

Learning Methods ◽

Parameter Values

ABSTRACT Within the next few years, the Square Kilometre Array (SKA) or one of its pathfinders will hopefully detect the 21-cm signal fluctuations from the Epoch of Reionization (EoR). Then, the goal will be to accurately constrain the underlying astrophysical parameters. Currently, this is mainly done with Bayesian inference. Recently, neural networks have been trained to perform inverse modelling and, ideally, predict the maximum-likelihood values of the model parameters. We build on these by improving the accuracy of the predictions using several supervised learning methods: neural networks, kernel regressions, or ridge regressions. Based on a large training set of 21-cm power spectra, we compare the performances of these methods. When using a noise-free signal generated by the model itself as input, we improve on previous neural network accuracy by one order of magnitude and, using a local ridge kernel regression, we gain another factor of a few. We then reach an accuracy level on the reconstruction of the maximum-likelihood parameter values of a few per cents compared the 1σ confidence level due to SKA thermal noise (as estimated with Bayesian inference). For an input signal affected by an SKA-like thermal noise but constrained to yield the same maximum-likelihood parameter values as the noise-free signal, our neural network exhibits an error within half of the 1σ confidence level due to the SKA thermal noise. This accuracy improves to 10$\, {\rm per\, cent}$ of the 1σ level when using the local ridge kernel. We are thus reaching a performance level where supervised learning methods are a viable alternative to determine the maximum-likelihood parameters values.

Download Full-text