Objective Bayesian Inference in Probit Models with Intrinsic Priors Using Variational Approximations

Ang Li; Luis Pericchi; Kun Wang

doi:10.3390/e22050513

Objective Bayesian Inference in Probit Models with Intrinsic Priors Using Variational Approximations

Entropy ◽

10.3390/e22050513 ◽

2020 ◽

Vol 22 (5) ◽

pp. 513

Author(s):

Ang Li ◽

Luis Pericchi ◽

Kun Wang

Keyword(s):

Regression Model ◽

Binary Classification ◽

Mean Field ◽

Variational Approximation ◽

Classification Problems ◽

Probit Regression ◽

Variational Approximations ◽

Intrinsic Prior ◽

Inference Methods ◽

Bayesian Probit Regression

There is not much literature on objective Bayesian analysis for binary classification problems, especially for intrinsic prior related methods. On the other hand, variational inference methods have been employed to solve classification problems using probit regression and logistic regression with normal priors. In this article, we propose to apply the variational approximation on probit regression models with intrinsic prior. We review the mean-field variational method and the procedure of developing intrinsic prior for the probit regression model. We then present our work on implementing the variational Bayesian probit regression model using intrinsic prior. Publicly available data from the world’s largest peer-to-peer lending platform, LendingClub, will be used to illustrate how model output uncertainties are addressed through the framework we proposed. With LendingClub data, the target variable is the final status of a loan, either charged-off or fully paid. Investors may very well be interested in how predictive features like FICO, amount financed, income, etc. may affect the final loan status.

Download Full-text

Bayesian probit regression model for the diagnosis of pulmonary fibrosis: proof-of-principle

BMC Medical Genomics ◽

10.1186/1755-8794-4-70 ◽

2011 ◽

Vol 4 (1) ◽

Cited By ~ 45

Author(s):

Eric B Meltzer ◽

William T Barry ◽

Thomas A D'Amico ◽

Robert D Davis ◽

Shu S Lin ◽

...

Keyword(s):

Pulmonary Fibrosis ◽

Regression Model ◽

Probit Regression ◽

Bayesian Probit Regression ◽

Proof Of Principle

Download Full-text

Architecture Optimization Model for the Deep Neural Network For Binary Classification Problems

International Journal of Intelligent Computing and Information Sciences ◽

10.21608/ijicis.2020.18509.1008 ◽

2020 ◽

Vol 0 (0) ◽

pp. 0-0

Author(s):

Kingsley Ukaoha ◽

Efosa Igodan

Keyword(s):

Neural Network ◽

Optimization Model ◽

Deep Neural Network ◽

Binary Classification ◽

Classification Problems ◽

Architecture Optimization

Download Full-text

Credit Constraint and Rural Household Welfare in the Mezam Division of the North-West Region of Cameroon

Sustainability ◽

10.3390/su13115964 ◽

2021 ◽

Vol 13 (11) ◽

pp. 5964

Author(s):

Louis Atamja ◽

Sungjoon Yoo

Keyword(s):

Regression Model ◽

Rural Household ◽

Household Welfare ◽

Switching Regression ◽

Probit Regression ◽

Credit Constraint ◽

Endogenous Switching Regression ◽

North West ◽

The North ◽

West Region

The purpose of this study is to examine the effect of the rural household’s head and household characteristics on credit accessibility. This study also seeks to investigate how credit constraint affects rural household welfare in the Mezam division of the North-West region of Cameroon. Using data from a household survey questionnaire, we found that 36.88% of the households were credit-constrained, while 63.13% were unconstrained. A probit regression model was used to examine the determinants of households’ credit access, while an endogenous switching regression model was used to analyze the impact of credit constraint on household welfare. The results from the probit regression model indicate the importance of the farmer’s or trader’s organization membership, occupation, and savings to the household’s likelihood of being credit-constrained. On the other hand, a prediction from the endogenous switching regression model confirms that households with access to credit have a better standard of welfare than a constrained household. From the results, it is necessary for the government to subsidize microfinance institutions, so that they can take on the risk of offering credit to rural households.

Download Full-text

Improving Land Cover Classification Using Genetic Programming for Feature Construction

Remote Sensing ◽

10.3390/rs13091623 ◽

2021 ◽

Vol 13 (9) ◽

pp. 1623

Author(s):

João E. Batista ◽

Ana I. R. Cabral ◽

Maria J. P. Vasconcelos ◽

Leonardo Vanneschi ◽

Sara Silva

Keyword(s):

Land Cover ◽

Genetic Programming ◽

Satellite Images ◽

State Of The Art ◽

Binary Classification ◽

Feature Construction ◽

Classification Problems ◽

Construction Methods ◽

Box Models

Genetic programming (GP) is a powerful machine learning (ML) algorithm that can produce readable white-box models. Although successfully used for solving an array of problems in different scientific areas, GP is still not well known in the field of remote sensing. The M3GP algorithm, a variant of the standard GP algorithm, performs feature construction by evolving hyperfeatures from the original ones. In this work, we use the M3GP algorithm on several sets of satellite images over different countries to create hyperfeatures from satellite bands to improve the classification of land cover types. We add the evolved hyperfeatures to the reference datasets and observe a significant improvement of the performance of three state-of-the-art ML algorithms (decision trees, random forests, and XGBoost) on multiclass classifications and no significant effect on the binary classifications. We show that adding the M3GP hyperfeatures to the reference datasets brings better results than adding the well-known spectral indices NDVI, NDWI, and NBR. We also compare the performance of the M3GP hyperfeatures in the binary classification problems with those created by other feature construction methods such as FFX and EFS.

Download Full-text

Inference Methods for the Conditional Logistic Regression Model with Longitudinal Data

Biometrical Journal ◽

10.1002/bimj.200890000 ◽

2008 ◽

Vol 50 (1) ◽

pp. 109-109 ◽

Cited By ~ 1

Author(s):

Radu V. Craiu ◽

Thierry Duchesne ◽

Daniel Fortin

Keyword(s):

Logistic Regression ◽

Longitudinal Data ◽

Regression Model ◽

Logistic Regression Model ◽

Conditional Logistic Regression ◽

Conditional Logistic Regression Model ◽

Inference Methods

Download Full-text

Confidence interval for micro-averaged F1 and macro-averaged F1 scores

Applied Intelligence ◽

10.1007/s10489-021-02635-5 ◽

2021 ◽

Author(s):

Kanae Takahashi ◽

Kouji Yamamoto ◽

Aya Kuchiba ◽

Tatsuki Koyama

Keyword(s):

Binary Classification ◽

Classification Problem ◽

Classification Problems ◽

Summary Measure ◽

Medical Field ◽

Predictive Values ◽

Binary Classification Problem ◽

Multi Class Classification ◽

Sensitivity Specificity ◽

Measures Of Performance

AbstractA binary classification problem is common in medical field, and we often use sensitivity, specificity, accuracy, negative and positive predictive values as measures of performance of a binary predictor. In computer science, a classifier is usually evaluated with precision (positive predictive value) and recall (sensitivity). As a single summary measure of a classifier’s performance, F1 score, defined as the harmonic mean of precision and recall, is widely used in the context of information retrieval and information extraction evaluation since it possesses favorable characteristics, especially when the prevalence is low. Some statistical methods for inference have been developed for the F1 score in binary classification problems; however, they have not been extended to the problem of multi-class classification. There are three types of F1 scores, and statistical properties of these F1 scores have hardly ever been discussed. We propose methods based on the large sample multivariate central limit theorem for estimating F1 scores with confidence intervals.

Download Full-text

Comparing the performance of different neural networks for binary classification problems

2009 Eighth International Symposium on Natural Language Processing ◽

10.1109/snlp.2009.5340935 ◽

2009 ◽

Cited By ~ 25

Author(s):

P. Jeatrakul ◽

K.W. Wong

Keyword(s):

Neural Networks ◽

Binary Classification ◽

Classification Problems

Download Full-text

Gaussian Processes for Classification: Mean-Field Algorithms

Neural Computation ◽

10.1162/089976600300014881 ◽

2000 ◽

Vol 12 (11) ◽

pp. 2655-2684 ◽

Cited By ~ 91

Author(s):

Manfred Opper ◽

Ole Winther

Keyword(s):

Support Vector Machines ◽

Gaussian Processes ◽

Disordered Systems ◽

Binary Classification ◽

Computational Cost ◽

Mean Field ◽

Strong Support ◽

Support Vector ◽

Vector Machines ◽

Leave One Out

We derive a mean-field algorithm for binary classification with gaussian processes that is based on the TAP approach originally proposed in statistical physics of disordered systems. The theory also yields an approximate leave-one-out estimator for the generalization error, which is computed with no extra computational cost. We show that from the TAP approach, it is possible to derive both a simpler “naive” mean-field theory and support vector machines (SVMs) as limiting cases. For both mean-field algorithms and support vector machines, simulation results for three small benchmark data sets are presented. They show that one may get state-of-the-art performance by using the leave-one-out estimator for model selection and the built-in leave-one-out estimators are extremely precise when compared to the exact leave-one-out estimate. The second result is taken as strong support for the internal consistency of the mean-field approach.

Download Full-text

A New Cost Function for Binary Classification Problems Based on the Distributions of the Soft Output for Each Class

2007 International Joint Conference on Neural Networks ◽

10.1109/ijcnn.2007.4371174 ◽

2007 ◽

Cited By ~ 1

Author(s):

Marcelino Lazaro ◽

Jose M. Leiva-Murillo ◽

Antonio Artes-Rodriguez ◽

Anibal R. Figueiras-Vidal

Keyword(s):

Cost Function ◽

Binary Classification ◽

Classification Problems

Download Full-text

Handling binary classification problems with a priority class by using Support Vector Machines

Applied Soft Computing ◽

10.1016/j.asoc.2017.08.023 ◽

2017 ◽

Vol 61 ◽

pp. 661-669 ◽

Cited By ~ 10

Author(s):

L. Gonzalez-Abril ◽

C. Angulo ◽

H. Nuñez ◽

Y. Leal

Keyword(s):

Support Vector Machines ◽

Binary Classification ◽

Support Vector ◽

Classification Problems ◽

Priority Class ◽

Vector Machines ◽

A Priority

Download Full-text