Mixed models for count data

Keyword(s):  
2015 ◽  
Vol 57 (1) ◽  
pp. 1-19
Author(s):  
Norbert Mielenz ◽  
Joachim Spilke ◽  
Eberhard von Borell

Population-averaged and subject-specific models are available to evaluate count data when repeated observations per subject are present. The latter are also known in the literature as generalised linear mixed models (GLMM). In GLMM repeated measures are taken into account explicitly through random animal effects in the linear predictor. In this paper the relevant GLMMs are presented based on conditional Poisson or negative binomial distribution of the response variable for given random animal effects. Equations for the repeatability of count data are derived assuming normal distribution and logarithmic gamma distribution for the random animal effects. Using count data on aggressive behaviour events of pigs (barrows, sows and boars) in mixed-sex housing, we demonstrate the use of the Poisson »log-gamma intercept«, the Poisson »normal intercept« and the »normal intercept« model with negative binomial distribution. Since not all count data can definitely be seen as Poisson or negative-binomially distributed, questions of model selection and model checking are examined. Emanating from the example, we also interpret the least squares means, estimated on the link as well as the response scale. Options provided by the SAS procedure NLMIXED for estimating model parameters and for estimating marginal expected values are presented.


Author(s):  
Olumide S. Adesina ◽  
Dawud A. Agunbiade ◽  
Pelumi E. Oguntunde ◽  
Tolulope F. Adesina

It is important to fit count data with suitable model(s), models such as Poisson Regression, Quassi Poisson, Negative Binomial, to mention but a few have been adopted by researchers to fit zero truncated count data in the past. In recent times, dedicated models for fitting zero truncated count data have been developed, and they are considered sufficient. This study proposed Bayesian multi-level Poisson and Bayesian multi-level Geometric model, Bayesian Monte Carlo Markov Chain Generalized linear Mixed Models (MCMCglmms) of zero truncated Poisson and MCMCglmms Poisson regression model to fit health count data that is truncated at zero. Suitable model selection criteria were used to determine preferred models for fitting zero truncated data. Results obtained showed that Bayesian multi-level Poisson outperformed Bayesian multi-level Poisson Geometric model; also MCMCglmms of zero truncated Poisson outperformed MCMCglmms Poisson.


2018 ◽  
Author(s):  
Ulrich Knief ◽  
Wolfgang Forstmeier

AbstractWhen data are not normally distributed (e.g. skewed, zero-inflated, binomial, or count data) researchers are often uncertain whether it may be legitimate to use tests that assume Gaussian errors (e.g. regression, t-test, ANOVA, Gaussian mixed models), or whether one has to either model a more specific error structure or use randomization techniques.Here we use Monte Carlo simulations to explore the pros and cons of fitting Gaussian models to non-normal data in terms of risk of type I error, power and utility for parameter estimation.We find that Gaussian models are remarkably robust to non-normality over a wide range of conditions, meaning that P-values remain fairly reliable except for data with influential outliers judged at strict alpha levels. Gaussian models also perform well in terms of power and they can be useful for parameter estimation but usually not for extrapolation. Transformation of data before analysis is often advisable and visual inspection for outliers and heteroscedasticity is important for assessment. In strong contrast, some non-Gaussian models and randomization techniques bear a range of risks that are often insufficiently known. High rates of false-positive conclusions can arise for instance when overdispersion in count data is not controlled appropriately or when randomization procedures ignore existing non-independencies in the data.Overall, we argue that violating the normality assumption bears risks that are limited and manageable, while several more sophisticated approaches are relatively error prone and difficult to check during peer review. Hence, as long as scientists and reviewers are not fully aware of the risks, science might benefit from preferentially trusting Gaussian mixed models in which random effects account for non-independencies in the data in a transparent way.Tweetable abstractGaussian models are remarkably robust to even dramatic violations of the normality assumption.


2017 ◽  
Vol 18 (1) ◽  
Author(s):  
Xinyan Zhang ◽  
Himel Mallick ◽  
Zaixiang Tang ◽  
Lei Zhang ◽  
Xiangqin Cui ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document