Small sample estimation of regression parameters in the three-variable linear model, with incomplete observations

Marcel G. Dagenais

doi:10.2307/3314692

Improving the Efficiency of Robust Estimators for the Generalized Linear Model

Stats ◽

10.3390/stats4010008 ◽

2021 ◽

Vol 4 (1) ◽

pp. 88-107

Author(s):

Alfio Marazzi

Keyword(s):

Maximum Likelihood ◽

Maximum Likelihood Estimator ◽

Linear Model ◽

Negative Binomial ◽

Small Sample ◽

Hospital Length Of Stay ◽

Robust Estimator ◽

Negative Binomial Regression Model ◽

Likelihood Estimator ◽

Density Power Divergence

The distance constrained maximum likelihood procedure (DCML) optimally combines a robust estimator with the maximum likelihood estimator with the purpose of improving its small sample efficiency while preserving a good robustness level. It has been published for the linear model and is now extended to the GLM. Monte Carlo experiments are used to explore the performance of this extension in the Poisson regression case. Several published robust candidates for the DCML are compared; the modified conditional maximum likelihood estimator starting with a very robust minimum density power divergence estimator is selected as the best candidate. It is shown empirically that the DCML remarkably improves its small sample efficiency without loss of robustness. An example using real hospital length of stay data fitted by the negative binomial regression model is discussed.

Download Full-text

Comment on ‘Small sample GEE estimation of regression parameters for longitudinal data’

Statistics in Medicine ◽

10.1002/sim.7366 ◽

2017 ◽

Vol 36 (22) ◽

pp. 3596-3600 ◽

Cited By ~ 2

Author(s):

N. Lunardon ◽

D. Scharfstein

Keyword(s):

Longitudinal Data ◽

Small Sample ◽

Regression Parameters

Download Full-text

Small sample GEE estimation of regression parameters for longitudinal data

Statistics in Medicine ◽

10.1002/sim.6198 ◽

2014 ◽

Vol 33 (22) ◽

pp. 3869-3881 ◽

Cited By ~ 15

Author(s):

Sudhir Paul ◽

Xuemao Zhang

Keyword(s):

Longitudinal Data ◽

Small Sample ◽

Regression Parameters

Download Full-text

The small-sample properties of some preliminary test estimators in a linear model with autocorrelated errors

Journal of Econometrics ◽

10.1016/0304-4076(84)90036-8 ◽

1984 ◽

Vol 25 (1-2) ◽

pp. 49-61 ◽

Cited By ~ 21

Author(s):

W.E. Griffiths ◽

P.A.A. Beesley

Keyword(s):

Linear Model ◽

Preliminary Test ◽

Small Sample ◽

Small Sample Properties ◽

Autocorrelated Errors

Download Full-text

Small sample properties of modified Prais-Winston estimators in hypothesis testing in a linear model with ar(1) errors

Economics Letters ◽

10.1016/0165-1765(89)90265-6 ◽

1989 ◽

Vol 29 (2) ◽

pp. 147-152 ◽

Cited By ~ 4

Author(s):

Noriko Hashimoto

Keyword(s):

Hypothesis Testing ◽

Linear Model ◽

Small Sample ◽

Small Sample Properties

Download Full-text

Sequential point estimation of regression parameters in a linear model

Annals of the Institute of Statistical Mathematics ◽

10.1007/bf02491449 ◽

1987 ◽

Vol 39 (1) ◽

pp. 55-67 ◽

Cited By ~ 4

Author(s):

Ajit Chaturvedi

Keyword(s):

Linear Model ◽

Point Estimation ◽

Regression Parameters

Download Full-text

Species Distribution Modeling of American Beech (Fagus Grandifolia) Distribution in Southwest Ohio

International Journal of Applied Geospatial Research ◽

10.4018/ijagr.2017070102 ◽

2017 ◽

Vol 8 (3) ◽

pp. 16-36 ◽

Cited By ~ 1

Author(s):

Brandon Flessner ◽

Mary C. Henry ◽

Jerry Green

Keyword(s):

Linear Model ◽

Species Distribution ◽

Small Sample Size ◽

Basal Area ◽

Fagus Grandifolia ◽

Small Sample ◽

Environmental Data ◽

Continuous Variables ◽

American Beech ◽

Boosted Regression Tree

The ability to predict American beech distribution (Fagus grandifolia Ehrh.) from environmental data was tested by using a geographic information system (GIS) in tandem with species distribution models (SDMs). The study was conducted in Butler and Preble counties in Ohio, USA. Topography, soils, and disturbance were approximated through 15 predictor variables with presence/absence and basal area serving as the response variables. Using a generalized linear model (GLM) and a boosted regression tree (BRT) model, curvature, elevation, and tasseled cap greenness were shown to be significant predictors of beech presence. Each of these variables was positively related to beech presence. A linear model using presence only data was not effective in predicting basal area due to a small sample size. This study demonstrates that SDMs can be used successfully to advance one's understanding of the relationship between tree species presence and environmental factors. Large sample sizes are needed to successfully model continuous variables.

Download Full-text

On Small Sample Properties of the Wald, LR and LM Tests in a Linear Model with AR(1) Errors

Communications in Statistics - Simulation and Computation ◽

10.1080/03610919008812921 ◽

1990 ◽

Vol 19 (4) ◽

pp. 1361-1375

Author(s):

Hideo Kozumi

Keyword(s):

Linear Model ◽

Small Sample ◽

Lm Tests ◽

Small Sample Properties

Download Full-text

A Comparison of Different Methods for LAD Regression

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.143-144.1328 ◽

2010 ◽

Vol 143-144 ◽

pp. 1328-1331

Author(s):

Hai Jun Chen ◽

Xiao Ling Liu ◽

Ling Hui Liu

Keyword(s):

Least Squares ◽

Linear Model ◽

Least Squares Method ◽

Small Sample ◽

Least Absolute Deviation ◽

Absolute Deviation ◽

Simple Alternative ◽

Dual Forms

The least squares method is very sensitive to outliers, one of the simple alternative is the least absolute deviation, i.e. L1 regression, which is less sensitive to outliers, so which is more suitable the small sample and much noise situation. In this paper, the L1 problem of linear model is discussed, the previous work is reviewed systematically, different algorithms is compared, it is proved that the dual forms of different algorithms are the same.

Download Full-text

Reliability of Regression-Corrected Climate Forecasts

Journal of Climate ◽

10.1175/jcli-d-13-00565.1 ◽

2014 ◽

Vol 27 (9) ◽

pp. 3393-3404 ◽

Cited By ~ 10

Author(s):

Michael K. Tippett ◽

Timothy DelSole ◽

Anthony G. Barnston

Keyword(s):

Sample Size ◽

Climate Model ◽

Sampling Error ◽

Small Sample Size ◽

Small Sample ◽

Climate Forecast ◽

Regression Parameters ◽

Climate Forecasts ◽

Probability Forecasts ◽

The Impact

Abstract Regression is often used to calibrate climate model forecasts with observations. Reliability is an aspect of forecast quality that refers to the degree of correspondence between forecast probabilities and observed frequencies of occurrence. While regression-corrected climate forecasts are reliable in principle, the estimated regression parameters used in practice are affected by sampling error. The low skill and small sample sizes typically encountered in climate prediction imply substantial sampling error in the estimated regression parameters. Here the reliability of regression-corrected climate forecasts is analyzed for the case of joint-Gaussian distributed ensemble forecasts and observations with regression parameters estimated by least squares. Hypothesis testing of the regression parameters provides direct information about the skill and reliability of the uncorrected ensemble-based probability forecasts. However, the regression-corrected probability forecasts with estimated parameters are systematically “overconfident” because sampling error causes a positive bias in the regression forecast signal variance, despite the fact that the estimates of the regression parameters are themselves unbiased. An analytical description of the reliability diagram of a generic regression-corrected climate forecast is derived and is shown to depend on sample size and population correlation skill, with small sample size and low skill being factors that increase overconfidence. The analytical reliability estimate is shown to capture the effect of sampling error in synthetic data experiments and in a 29-yr dataset of NOAA Climate Forecast System version 2 predictions of seasonal precipitation totals over the Americas. The impact of sampling error on the reliability of regression-corrected forecast has been previously unrecognized and affects all regression-based forecasts. The use of regression parameters estimated by shrinkage methods such as ridge regression substantially reduces overconfidence.

Download Full-text