Retrieving Categorical Emotions Using a Probabilistic Framework to Define Preference Learning Samples

AbstractThis paper proposes a method for solving optimization problems in which the decision-maker cannot evaluate the objective function, but rather can only express a preference such as “this is better than that” between two candidate decision vectors. The algorithm described in this paper aims at reaching the global optimizer by iteratively proposing the decision maker a new comparison to make, based on actively learning a surrogate of the latent (unknown and perhaps unquantifiable) objective function from past sampled decision vectors and pairwise preferences. A radial-basis function surrogate is fit via linear or quadratic programming, satisfying if possible the preferences expressed by the decision maker on existing samples. The surrogate is used to propose a new sample of the decision vector for comparison with the current best candidate based on two possible criteria: minimize a combination of the surrogate and an inverse weighting distance function to balance between exploitation of the surrogate and exploration of the decision space, or maximize a function related to the probability that the new candidate will be preferred. Compared to active preference learning based on Bayesian optimization, we show that our approach is competitive in that, within the same number of comparisons, it usually approaches the global optimum more closely and is computationally lighter. Applications of the proposed algorithm to solve a set of benchmark global optimization problems, for multi-objective optimization, and for optimal tuning of a cost-sensitive neural network classifier for object recognition from images are described in the paper. MATLAB and a Python implementations of the algorithms described in the paper are available at http://cse.lab.imtlucca.it/~bemporad/glis.

Download Full-text

Activity recognition and user preference learning for automated configuration of IoT environments

Proceedings of the 10th International Conference on the Internet of Things ◽

10.1145/3410992.3411003 ◽

2020 ◽

Author(s):

Fahed Alkhabbas ◽

Sadi Alawadi ◽

Romina Spalazzese ◽

Paul Davidsson

Keyword(s):

Activity Recognition ◽

User Preference ◽

Preference Learning

Download Full-text

A probabilistic framework for aligning paired-end RNA-seq data

Bioinformatics ◽

10.1093/bioinformatics/btq336 ◽

2010 ◽

Vol 26 (16) ◽

pp. 1950-1957 ◽

Cited By ~ 17

Author(s):

Yin Hu ◽

Kai Wang ◽

Xiaping He ◽

Derek Y. Chiang ◽

Jan F. Prins ◽

...

Keyword(s):

Rna Seq ◽

Probabilistic Framework

Download Full-text

A Unified Formulation of k-Means, Fuzzy c-Means and Gaussian Mixture Model by the Kolmogorov–Nagumo Average

Entropy ◽

10.3390/e23050518 ◽

2021 ◽

Vol 23 (5) ◽

pp. 518

Author(s):

Osamu Komori ◽

Shinto Eguchi

Keyword(s):

Pareto Distribution ◽

Statistical Data ◽

Learning Algorithm ◽

Survival Function ◽

Gaussian Mixture Models ◽

Gaussian Mixture ◽

Simulation Studies ◽

Probabilistic Framework ◽

Underlying Distribution ◽

Fuzzy C Means

Clustering is a major unsupervised learning algorithm and is widely applied in data mining and statistical data analyses. Typical examples include k-means, fuzzy c-means, and Gaussian mixture models, which are categorized into hard, soft, and model-based clusterings, respectively. We propose a new clustering, called Pareto clustering, based on the Kolmogorov–Nagumo average, which is defined by a survival function of the Pareto distribution. The proposed algorithm incorporates all the aforementioned clusterings plus maximum-entropy clustering. We introduce a probabilistic framework for the proposed method, in which the underlying distribution to give consistency is discussed. We build the minorize-maximization algorithm to estimate the parameters in Pareto clustering. We compare the performance with existing methods in simulation studies and in benchmark dataset analyses to demonstrate its highly practical utilities.

Download Full-text

A Framework for Modelling Economic Regional Location Processes Under Uncertainty

Journal of Quantitative Economics ◽

10.1007/s40953-021-00248-w ◽

2021 ◽

Author(s):

Roy Cerqueti ◽

Eleonora Cutrini

Keyword(s):

Theoretical Model ◽

Theoretical Analysis ◽

Site Selection ◽

Decision Problem ◽

Stochastic Theory ◽

Spatial Location ◽

Probabilistic Framework ◽

Location Patterns ◽

Regional Location ◽

Quantitative Tool

AbstractThis paper deals with the theoretical analysis of the spatial concentration and localization of firms and employees over a set of regions. In particular, it provides a simple site-selection theoretical model to describe the probabilistic framework of the location patterns. The adopted quantitative tool is the stochastic theory of urns. The model moves from the empirical evidence of the deviation of the spatial location of companies from the uniform distribution and of employees from the distribution of firms. Factors leading to such deviations are taken into consideration. Specifically, we formalize a decision problem grounded on the economic attributes of the regions and also on the distribution of the existing firms and employees in the territory. To our purpose, the site-selection model is presented as a stepwise process.

Download Full-text

A Conceptual Probabilistic Framework for Annotation Aggregation of Citizen Science Data

Mathematics ◽

10.3390/math9080875 ◽

2021 ◽

Vol 9 (8) ◽

pp. 875

Author(s):

Jesus Cerquides ◽

Mehmet Oğuz Mülâyim ◽

Jerónimo Hernández-González ◽

Amudha Ravi Shankar ◽

Jose Luis Fernandez-Marquez

Keyword(s):

Data Quality ◽

Citizen Science ◽

Graphical Model ◽

Real Life ◽

Scientific Journals ◽

Probabilistic Framework ◽

Science Data ◽

Label Aggregation ◽

Evaluation Of Data ◽

Model Formalism

Over the last decade, hundreds of thousands of volunteers have contributed to science by collecting or analyzing data. This public participation in science, also known as citizen science, has contributed to significant discoveries and led to publications in major scientific journals. However, little attention has been paid to data quality issues. In this work we argue that being able to determine the accuracy of data obtained by crowdsourcing is a fundamental question and we point out that, for many real-life scenarios, mathematical tools and processes for the evaluation of data quality are missing. We propose a probabilistic methodology for the evaluation of the accuracy of labeling data obtained by crowdsourcing in citizen science. The methodology builds on an abstract probabilistic graphical model formalism, which is shown to generalize some already existing label aggregation models. We show how to make practical use of the methodology through a comparison of data obtained from different citizen science communities analyzing the earthquake that took place in Albania in 2019.

Download Full-text

A Probabilistic Framework for Chinese Spelling Check

ACM Transactions on Asian and Low-Resource Language Information Processing ◽

10.1145/2826234 ◽

2015 ◽

Vol 14 (4) ◽

pp. 1-17 ◽

Cited By ~ 3

Author(s):

Kuan-Yu Chen ◽

Hsin-Min Wang ◽

Hsin-Hsi Chen

Keyword(s):

Probabilistic Framework

Download Full-text