Normalized Information Criteria and Model Selection in the Presence of Missing Data

Nitzan Cohen; Yakir Berchenko

doi:10.3390/math9192474

Normalized Information Criteria and Model Selection in the Presence of Missing Data

Mathematics ◽

10.3390/math9192474 ◽

2021 ◽

Vol 9 (19) ◽

pp. 2474

Author(s):

Nitzan Cohen ◽

Yakir Berchenko

Keyword(s):

Missing Data ◽

Model Selection ◽

Missing Values ◽

Model Averaging ◽

Information Criterion ◽

Current Theory ◽

Information Criteria ◽

Alternative Methods ◽

Sample Sizes ◽

Statistical Efficiency

Information criteria such as the Akaike information criterion (AIC) and Bayesian information criterion (BIC) are commonly used for model selection. However, the current theory does not support unconventional data, so naive use of these criteria is not suitable for data with missing values. Imputation, at the core of most alternative methods, is both distorted as well as computationally demanding. We propose a new approach that enables the use of classic well-known information criteria for model selection when there are missing data. We adapt the current theory of information criteria through normalization, accounting for the different sample sizes used for each candidate model (focusing on AIC and BIC). Interestingly, when the sample sizes are different, our theoretical analysis finds that AICj/nj is the proper correction for AICj that we need to optimize (where nj is the sample size available to the jth model) while −(BICj−BICi)/(nj−ni) is the correction of BIC. Furthermore, we find that the computational complexity of normalized information criteria methods is exponentially better than that of imputation methods. In a series of simulation studies, we find that normalized-AIC and normalized-BIC outperform previous methods (i.e., normalized-AIC is more efficient, and normalized BIC includes only important variables, although it tends to exclude some of them in cases of large correlation). We propose three additional methods aimed at increasing the statistical efficiency of normalized-AIC: post-selection imputation, Akaike sub-model averaging, and minimum-variance averaging. The latter succeeds in increasing efficiency further.

Download Full-text

Model Selection Procedures in Bounds Test of Cointegration: Theoretical Comparison and Empirical Evidence

Economies ◽

10.3390/economies8020049 ◽

2020 ◽

Vol 8 (2) ◽

pp. 49 ◽

Cited By ~ 1

Author(s):

Waqar Badshah ◽

Mehmet Bulut

Keyword(s):

Model Selection ◽

Akaike Information Criterion ◽

Bayesian Information Criterion ◽

Selection Process ◽

Information Criterion ◽

Small Sample ◽

Information Criteria ◽

Path Model ◽

Sample Sizes ◽

Bounds Test

Only unstructured single-path model selection techniques, i.e., Information Criteria, are used by Bounds test of cointegration for model selection. The aim of this paper was twofold; one was to evaluate the performance of these five routinely used information criteria {Akaike Information Criterion (AIC), Akaike Information Criterion Corrected (AICC), Schwarz/Bayesian Information Criterion (SIC/BIC), Schwarz/Bayesian Information Criterion Corrected (SICC/BICC), and Hannan and Quinn Information Criterion (HQC)} and three structured approaches (Forward Selection, Backward Elimination, and Stepwise) by assessing their size and power properties at different sample sizes based on Monte Carlo simulations, and second was the assessment of the same based on real economic data. The second aim was achieved by the evaluation of the long-run relationship between three pairs of macroeconomic variables, i.e., Energy Consumption and GDP, Oil Price and GDP, and Broad Money and GDP for BRICS (Brazil, Russia, India, China and South Africa) countries using Bounds cointegration test. It was found that information criteria and structured procedures have the same powers for a sample size of 50 or greater. However, BICC and Stepwise are better at small sample sizes. In the light of simulation and real data results, a modified Bounds test with Stepwise model selection procedure may be used as it is strongly theoretically supported and avoids noise in the model selection process.

Download Full-text

Nonlinear predictive model selection and model averaging using information criteria

Systems Science & Control Engineering ◽

10.1080/21642583.2018.1496042 ◽

2018 ◽

Vol 6 (1) ◽

pp. 319-328 ◽

Cited By ~ 4

Author(s):

Yuanlin Gu ◽

Hua-Liang Wei ◽

Michael M. Balikhin

Keyword(s):

Model Selection ◽

Predictive Model ◽

Model Averaging ◽

Information Criteria

Download Full-text

Using Model Selection Criteria to Choose the Number of Principal Components

Journal of Statistical Theory and Applications ◽

10.1007/s44199-021-00002-4 ◽

2021 ◽

Vol 20 (3) ◽

pp. 450-461

Author(s):

Stanley L. Sclove

Keyword(s):

Model Selection ◽

Principal Components ◽

Bayesian Information Criterion ◽

Selection Criteria ◽

Information Criterion ◽

Information Criteria ◽

Akaike's Information Criterion ◽

Model Selection Criteria ◽

Adequate Number ◽

Number Of Principal Components

AbstractThe use of information criteria, especially AIC (Akaike’s information criterion) and BIC (Bayesian information criterion), for choosing an adequate number of principal components is illustrated.

Download Full-text

Focused Information Criteria, Model Selection, and Model Averaging in a Tobit Model With a Nonzero Threshold

Journal of Business and Economic Statistics ◽

10.1198/jbes.2011.10075 ◽

2012 ◽

Vol 30 (1) ◽

pp. 132-142 ◽

Cited By ~ 54

Author(s):

Xinyu Zhang ◽

Alan T. K. Wan ◽

Sherry Z. Zhou

Keyword(s):

Model Selection ◽

Model Averaging ◽

Tobit Model ◽

Information Criteria

Download Full-text

دراسة مقارنة لمعايير المعلومات لتحديد رتبة نماذج الانحدار الذاتي

Journal of Economics and Administrative Sciences ◽

10.33095/jeas.v18i65.1159 ◽

2012 ◽

Vol 18 (65) ◽

pp. 323

Author(s):

جنان عباس ناصر

Keyword(s):

Autoregressive Model ◽

Error Term ◽

Information Criterion ◽

Information Criteria ◽

Autoregressive Models ◽

Sample Sizes ◽

Data Generating Process ◽

The Mean ◽

Normally Distributed

In this study, we compare between the traditional Information Criteria (AIC, SIC, HQ, FPE) with The Modified Divergence Information Criterion (MDIC) which used to determine the order of Autoregressive model (AR) for the data generating process, by using the simulation by generating data from several of Autoregressive models, when the error term is normally distributed with different values for its parameters (the mean and the variance),and for different sample sizes.

Download Full-text

On the Use of Information Criteria for Model Selection in Phylogenetics

Molecular Biology and Evolution ◽

10.1093/molbev/msz228 ◽

2019 ◽

Vol 37 (2) ◽

pp. 549-562 ◽

Cited By ~ 1

Author(s):

Edward Susko ◽

Andrew J Roger

Keyword(s):

Model Selection ◽

Bayes Factor ◽

Information Criterion ◽

Information Criteria ◽

Leibler Divergence ◽

Incorrect Model ◽

Substantial Bias ◽

And Performance ◽

Phylogenetic Models ◽

Selection Of

Abstract The information criteria Akaike information criterion (AIC), AICc, and Bayesian information criterion (BIC) are widely used for model selection in phylogenetics, however, their theoretical justification and performance have not been carefully examined in this setting. Here, we investigate these methods under simple and complex phylogenetic models. We show that AIC can give a biased estimate of its intended target, the expected predictive log likelihood (EPLnL) or, equivalently, expected Kullback–Leibler divergence between the estimated model and the true distribution for the data. Reasons for bias include commonly occurring issues such as small edge-lengths or, in mixture models, small weights. The use of partitioned models is another issue that can cause problems with information criteria. We show that for partitioned models, a different BIC correction is required for it to be a valid approximation to a Bayes factor. The commonly used AICc correction is not clearly defined in partitioned models and can actually create a substantial bias when the number of parameters gets large as is the case with larger trees and partitioned models. Bias-corrected cross-validation corrections are shown to provide better approximations to EPLnL than AIC. We also illustrate how EPLnL, the estimation target of AIC, can sometimes favor an incorrect model and give reasons for why selection of incorrectly under-partitioned models might be desirable in partitioned model settings.

Download Full-text

Improving data analysis in herpetology: using Akaike's Information Criterion (AIC) to assess the strength of biological hypotheses

Amphibia-Reptilia ◽

10.1163/156853806777239922 ◽

2006 ◽

Vol 27 (2) ◽

pp. 169-180 ◽

Cited By ~ 232

Author(s):

Marc Mazerolle

Keyword(s):

Model Selection ◽

Observational Studies ◽

Regression Models ◽

Model Averaging ◽

Weighted Average ◽

Information Criterion ◽

Data Sets ◽

Data Set ◽

Robust Estimates ◽

Number Of Individuals

AbstractIn ecology, researchers frequently use observational studies to explain a given pattern, such as the number of individuals in a habitat patch, with a large number of explanatory (i.e., independent) variables. To elucidate such relationships, ecologists have long relied on hypothesis testing to include or exclude variables in regression models, although the conclusions often depend on the approach used (e.g., forward, backward, stepwise selection). Though better tools have surfaced in the mid 1970's, they are still underutilized in certain fields, particularly in herpetology. This is the case of the Akaike information criterion (AIC) which is remarkably superior in model selection (i.e., variable selection) than hypothesis-based approaches. It is simple to compute and easy to understand, but more importantly, for a given data set, it provides a measure of the strength of evidence for each model that represents a plausible biological hypothesis relative to the entire set of models considered. Using this approach, one can then compute a weighted average of the estimate and standard error for any given variable of interest across all the models considered. This procedure, termed model-averaging or multimodel inference, yields precise and robust estimates. In this paper, I illustrate the use of the AIC in model selection and inference, as well as the interpretation of results analysed in this framework with two real herpetological data sets. The AIC and measures derived from it is should be routinely adopted by herpetologists.

Download Full-text

Investigation on the Improvement of Prediction by Bootstrap Model Averaging

Methods of Information in Medicine ◽

10.1055/s-0038-1634035 ◽

2006 ◽

Vol 45 (01) ◽

pp. 44-50 ◽

Cited By ~ 8

Author(s):

N. H. Augustin ◽

W. Sauerbrei ◽

N. Holländer

Keyword(s):

Model Selection ◽

Mean Squared Error ◽

Model Averaging ◽

Predictive Performance ◽

Information Criterion ◽

Full Model ◽

Backward Elimination ◽

Study Results ◽

Model Selection Uncertainty ◽

Bootstrap Model

Summary Objectives: We illustrate a recently proposed two-step bootstrap model averaging (bootstrap MA) approach to cope with model selection uncertainty. The predictive performance is investigated in an example and in a simulation study. Results are compared to those derived from other model selection methods. Methods: In the framework of the linear regression model we use the two-step bootstrap MA, which consists of a screening step to eliminate covariates thought to have no influence on the response, and a model-averaging step. We also apply the full model, variable selection using backward elimination based on Akaike’s Information Criterion (AIC), the Bayes Information Criterion (BIC) and the bagging approach. The predictive performance is measured by the mean squared error (MSE) and the coverage of confidence intervals for the true response. Results: We obtained similar results for all approaches in the example. In the simulation the MSE was reduced by all approaches in comparison to the full model. The smallest values are obtained for bootstrap MA. Only the bootstrap MA and the full model correctly estimated the nominal coverage. The backward elimination procedures led to substantial underestimation and bagging to an overestimation of the true coverage. The screening step of bootstrap MA eliminates most of the unimportant factors. Conclusion: The new bootstrap MA approach shows promising results for predictive performance. It increases practical usefulness by eliminating unimportant factors in the screening step.

Download Full-text

Model selection information criteria in latent class models with missing data and contingency question

Journal of Statistical Computation and Simulation ◽

10.1080/00949655.2012.698621 ◽

2012 ◽

Vol 84 (1) ◽

pp. 159-170 ◽

Cited By ~ 1

Author(s):

Ting Hsiang Lin

Keyword(s):

Missing Data ◽

Model Selection ◽

Latent Class ◽

Latent Class Models ◽

Information Criteria ◽

Class Models

Download Full-text

Focused vector information criterion model selection and model averaging regression with missing response

Metrika ◽

10.1007/s00184-013-0446-8 ◽

2013 ◽

Vol 77 (3) ◽

pp. 415-432 ◽

Cited By ~ 5

Author(s):

Zhimeng Sun ◽

Zhi Su ◽

Jingyi Ma

Keyword(s):

Model Selection ◽

Model Averaging ◽

Information Criterion ◽

Missing Response ◽

Vector Information

Download Full-text