Effects of Monotone and Nonmonotone Attrition on Parameter Estimates in Regression Models with Educational Data: Demographic Effects on Achievement, Aspirations, and Attitudes

This article presents a method and program for identifying poorly fitting observations for maximum-likelihood regression models for categorical dependent variables. After estimating a model, the program leastlikely will list the observations that have the lowest predicted probabilities of observing the value of the outcome category that was actually observed. For example, when run after estimating a binary logistic regression model, leastlikely will list the observations with a positive outcome that had the lowest predicted probabilities of a positive outcome and the observations with a negative outcome that had the lowest predicted probabilities of a negative outcome. These can be considered the observations in which the outcome is most surprising given the values of the independent variables and the parameter estimates and, like observations with large residuals in ordinary least squares regression, may warrant individual inspection. Use of the program is illustrated with examples using binary and ordered logistic regression.

Download Full-text

A Comparison Study on the Parameter Estimates and Prediction Performance of Penalized Regression Models for the Multilevel Data

Korean Society for Educational Evaluation ◽

10.31158/jeev.2021.34.1.183 ◽

2021 ◽

Vol 34 (1) ◽

pp. 183-205

Author(s):

Hyewon Chung ◽

Soyoung Park

Keyword(s):

Regression Models ◽

Penalized Regression ◽

Prediction Performance ◽

Parameter Estimates ◽

Comparison Study ◽

Multilevel Data

Download Full-text

Effect of Berkson measurement error on parameter estimates in Cox regression models

Lifetime Data Analysis ◽

10.1007/s10985-007-9036-2 ◽

2007 ◽

Vol 13 (2) ◽

pp. 261-272 ◽

Cited By ~ 16

Author(s):

Helmut Küchenhoff ◽

Ralf Bender ◽

Ingo Langner

Keyword(s):

Measurement Error ◽

Regression Models ◽

Cox Regression ◽

Parameter Estimates

Download Full-text

A Regionalised Neyman-Scott Model of Rainfall with Convective and Stratiform Cells

Hydrology and Earth System Sciences ◽

10.5194/hess-1-71-1997 ◽

1997 ◽

Vol 1 (1) ◽

pp. 71-80 ◽

Cited By ~ 14

Author(s):

P. S. P. Cowpertwait ◽

P. E. O'Connell

Keyword(s):

Cluster Model ◽

Regression Models ◽

Parameter Estimates ◽

Model Parameter ◽

Lack Of Fit ◽

Dependent Variables ◽

The Uk ◽

Poisson Cluster ◽

Residual Errors ◽

Urban Sites

Abstract. A single-site Neyman-Scott Poisson cluster model of rainfall, with convective and stratiform cells, is fitted to data for 112 sites scattered throughout the UK using harmonic variables to account for seasonality. The model is regionalised by regressing the estimates of the harmonic variables on site dependent variables (e.g. altitude) to enable rainfall to be simulated at any ungauged site in the UK. An assessment of the residual errors indicates that the regression models can be used with reasonable confidence for urban sites. Furthermore, the regional variations of the model parameter estimates are found to be in agreement with meteorological knowledge and observation. Simulated I h extreme rainfalls are found to compare favourably with observed historical values, although some lack-of-fit is evident for higher aggregation levels.

Download Full-text

Parameter estimates in regression models

IEEE Transactions on Automatic Control ◽

10.1109/tac.1964.1105785 ◽

1964 ◽

Vol 9 (4) ◽

pp. 589-589 ◽

Cited By ~ 3

Author(s):

V. Levadi

Keyword(s):

Regression Models ◽

Parameter Estimates

Download Full-text

Testing for Consistency using Artificial Regressions

Econometric Theory ◽

10.1017/s0266466600012573 ◽

1989 ◽

Vol 5 (3) ◽

pp. 363-384 ◽

Cited By ~ 90

Author(s):

Russell Davidson ◽

James G. MacKinnon

Keyword(s):

Linear Regression ◽

Regression Models ◽

Information Matrix ◽

Binary Choice ◽

Parameter Estimates ◽

Linear Regression Models ◽

Score Tests ◽

Binary Choice Models ◽

Lagrange Multiplier Tests ◽

Information Matrix Test

We consider several issues related to Durbin-Wu-Hausman tests; that is, tests based on the comparison of two sets of parameter estimates. We first review a number of results about these tests in linear regression models, discuss what determines their power, and propose a simple way to improve power in certain cases. We then show how in a general nonlinear setting they may be computed as “score” tests by means of slightly modified versions of any artificial linear regression that can be used to calculate Lagrange multiplier tests, and explore some of the implications of this result. In particular, we show how to create a variant of the information matrix test that tests for parameter consistency. We examine the conventional information matrix test and our new version in the context of binary-choice models, and provide a simple way to compute both tests using artificial regressions.

Download Full-text

Modelling and analysis of incomplete and short lactations

Animal Science ◽

10.1017/s1357729800053273 ◽

2003 ◽

Vol 76 (1) ◽

pp. 19-25 ◽

Cited By ~ 1

Author(s):

F Jaffrézic ◽

P Minini

Keyword(s):

Regression Models ◽

Ad Hoc ◽

Real Data ◽

Random Regression ◽

Accurate Estimation ◽

Parameter Estimates ◽

Longitudinal Models ◽

Lactation Length ◽

Cumulative Production ◽

Daily Milk

AbstractAdvantages of the use of test-day records for genetic evaluation of dairy cattle are now widely accepted. In particular, longitudinal models such as random regression avoid using ad hoc extrapolation procedures to reconstruct complete lactations as they provide individual predictions even for incomplete data. However, these predictions and parameter estimates obtained in the model do not take into account the lactation length. This can be an important drawback for phenotypic and genetic analysis of milk production of cows with shorter lactations. The aim of this paper is to propose a methodology that would correct these predictions, weighting them by the probability at each point in time of each cow being dried off. The proposed procedure is easy to implement and calculations are fast to compute. A simulation study and an application on real data for daily milk records show that the proposed methodology provides a more accurate estimation for individual cumulative production as well as genetic values, and avoids predicting negative productions at the end of the lactation as is often the case with random regression models.

Download Full-text

Measuring Bandwidth Uncertainty in Multiscale Geographically Weighted Regression Using Akaike Weights

10.31219/osf.io/h2qmy ◽

2019 ◽

Author(s):

Ziqi Li ◽

Alexander Stewart Fotheringham ◽

Taylor M. Oshan ◽

Levi John Wolf

Keyword(s):

Geographically Weighted Regression ◽

Regression Models ◽

Bandwidth Selection ◽

Information Criterion ◽

Weighted Regression ◽

Parameter Estimates ◽

Multi Scale ◽

Single Scale ◽

Heterogeneous Processes ◽

Key Parameter

Bandwidth, a key parameter in geographically weighted regression models, is closely related to the spatial scale at which the underlying spatially heterogeneous processes being examined take place. Generally, a single optimal bandwidth (geographically weighted regression) or a set of covariate-specific optimal bandwidths (multiscale geographically weighted regression) is chosen based on some criterion such as the Akaike Information Criterion (AIC) and then parameter estimation and inference are conditional on the choice of this bandwidth. In this paper, we find that bandwidth selection is subject to uncertainty in both single-scale and multi-scale geographically weighted regression models and demonstrate that this uncertainty can be measured and accounted for. Based on simulation studies and an empirical example of obesity rates in Phoenix, we show that bandwidth uncertainties can be quantitatively measured by Akaike weights, and confidence intervals for bandwidths can be obtained. Understanding bandwidth uncertainty offers important insights about the scales over which different processes operate, especially when comparing covariate-specific bandwidths. Additionally, unconditional parameter estimates can be computed based on Akaike weights accounts for bandwidth selection uncertainty.

Download Full-text

Methods to verify parameter equality in nonlinear regression models

Scientia Agricola ◽

10.1590/s0103-90162010000200014 ◽

2010 ◽

Vol 67 (2) ◽

pp. 218-222 ◽

Cited By ~ 3

Author(s):

Lídia Raquel de Carvalho ◽

Sheila Zambello de Pinho ◽

Martha Maria Mischan

Keyword(s):

Nonlinear Regression ◽

Regression Models ◽

Growth Curves ◽

Original Data ◽

Minimum Variance ◽

Ratio Method ◽

Parameter Estimates ◽

Nonlinear Regression Models ◽

Sample Data ◽

Parameter Estimators

In biologic experiments, in which growth curves are adjusted to sample data, treatments applied to the experimental material can affect the parameter estimates. In these cases the interest is to compare the growth functions, in order to distinguish treatments. Three methods that verify the equality of parameters in nonlinear regression models were compared: (i) developed by Carvalho in 1996, performing ANOVA on estimates of parameters of individual fits; (ii) suggested by Regazzi in 2003, using the likelihood ratio method; and (iii) constructing a pooled variance from individual variances. The parametric tests, F and Tukey, were employed when the parameter estimators were near to present the properties of linear model estimators, that is, unbiasedness, normal distribution and minimum variance. The first and second methods presented similar results, but the third method is simpler in calculations and uses all information contained in the original data.

Download Full-text

Linear regression models for biomass table construction, using cluster samples

Canadian Journal of Forest Research ◽

10.1139/x89-103 ◽

1989 ◽

Vol 19 (5) ◽

pp. 664-673 ◽

Cited By ~ 1

Author(s):

Andrew J. R. Gillespie ◽

Tiberius Cunia

Keyword(s):

Least Squares ◽

Regression Models ◽

Predictor Variable ◽

Ordinary Least Squares ◽

Cluster Sampling ◽

Parameter Estimates ◽

Least Squares Regression ◽

Linear Regression Models ◽

Estimation Procedures ◽

Biased Estimates

Biomass tables are often constructed from cluster samples by means of ordinary least squares regression estimation procedures. These procedures assume that sample observations are uncorrelated, which ignores the intracluster correlation of cluster samples and results in underestimates of the model error. We tested alternative estimation procedures by simulation under a variety of cluster sampling methods, to determine combinations of sampling and estimation procedures that yield accurate parameter estimates and reliable estimates of error. Modified, generalized, and jack-knife least squares procedures gave accurate parameter and error estimates when sample trees were selected with equal probability. Regression models that did not include height as a predictor variable yielded biased parameter estimates when sample trees were selected with probability proportional to tree size. Models that included height did not yield biased estimates. There was no discernible gain in precision associated with sampling with probability proportional to size. Random coefficient regressions generally gave biased point estimates with poor precision, regardless of sampling method.

Download Full-text