An introductory guide to statistical analysis-generalized linear models for count data using R.

Yoshiko Shimono

doi:10.3719/weed.55.287

Evaluation and Comparison of Patterns of Maternal Complications Using Generalized Linear Models of Count Data Time Series

International Journal of Statistics in Medical Research ◽

10.6000/1929-6029.2019.08.05 ◽

2019 ◽

Vol 8 ◽

pp. 32-39

Author(s):

Collins Odhiambo ◽

◽

Freda Kinoti

Keyword(s):

Time Series ◽

Generalized Linear Models ◽

Count Data ◽

Linear Models ◽

Maternal Complications ◽

Count Data Time Series

Download Full-text

Analyzing over-dispersed count data in two-way cross-classification problems using generalized linear models

Journal of Statistical Computation and Simulation ◽

10.1080/00949659908811956 ◽

1999 ◽

Vol 63 (3) ◽

pp. 263-281 ◽

Cited By ~ 3

Author(s):

Nancy L. Campbell ◽

Linda J. Young ◽

George A. Capuano

Keyword(s):

Generalized Linear Models ◽

Count Data ◽

Linear Models ◽

Classification Problems ◽

Cross Classification

Download Full-text

More generalized linear modelling.

Practical R for biologists: an introduction ◽

10.1079/9781789245349.0171 ◽

2021 ◽

pp. 171-186

Author(s):

Donald Quicke ◽

Buntika A. Butcher ◽

Rachel Kruft Welton

Keyword(s):

Generalized Linear Models ◽

Count Data ◽

Binary Data ◽

Linear Models ◽

Explanatory Variable ◽

Response Variable ◽

Explanatory Variables ◽

Continuous Response ◽

Generalized Linear Modelling ◽

Linear Modelling

Abstract This chapter employs generalized linear modelling using the function glm when we know that variances are not constant with one or more explanatory variables and/or we know that the errors cannot be normally distributed, for example, they may be binary data, or count data where negative values are impossible, or proportions which are constrained between 0 and 1. A glm seeks to determine how much of the variation in the response variable can be explained by each explanatory variable, and whether such relationships are statistically significant. The data for generalized linear models take the form of a continuous response variable and a combination of continuous and discrete explanatory variables.

Download Full-text

Infection of Baltic herring (Clupea harengus membras) with Anisakis simplex larvae, 1992–1999: a statistical analysis using generalized linear models

ICES Journal of Marine Science ◽

10.1006/jmsc.2002.1323 ◽

2003 ◽

Vol 60 (1) ◽

pp. 85-93 ◽

Cited By ~ 22

Author(s):

M Podolska

Keyword(s):

Statistical Analysis ◽

Generalized Linear Models ◽

Linear Models ◽

Anisakis Simplex ◽

Clupea Harengus ◽

Baltic Herring

Download Full-text

Changes in the distribution of Poplar Box (Eucalyptus populnea) on major soil groups: An application of the log-linear model.

The Rangeland Journal ◽

10.1071/rj9810033 ◽

1981 ◽

Vol 3 (1) ◽

pp. 33 ◽

Cited By ~ 1

Author(s):

RB Cunningham ◽

AA Webb ◽

A Mortlock

Keyword(s):

Statistical Analysis ◽

Linear Model ◽

Generalized Linear Models ◽

Gradient Analysis ◽

Linear Models ◽

Geographic Location ◽

Significance Tests ◽

Log Linear ◽

Eucalyptus Populnea ◽

Soil Groups

The association of poplar box (Eucalyptus populnea) with five main soil groups is examined. A statistical analysis, using a log- linear model, indicated that the relative frequencies of poplar box sites occumng on major soil groups changed with geographic location. The change in distribution is shown to relate to climate, as indicated by summer and winter moisture indices and the diff- erence between them. This study illustrates the use of log-linear models in ecology; such models, and more generally, Generalized Linear Models, in providing significance tests, have advantages over the non-statistical methods of gradient analysis.

Download Full-text

A novel statistical analysis of cnidocysts in acontiarian sea anemones (Cnidaria, Actiniaria) using generalized linear models with gamma errors

Zoologischer Anzeiger ◽

10.1016/j.jcz.2004.06.002 ◽

2004 ◽

Vol 243 (1-2) ◽

pp. 47-52 ◽

Cited By ~ 11

Author(s):

Fabián H. Acuña ◽

Lila Ricci ◽

Adriana C. Excoffon ◽

Mauricio O. Zamponi

Keyword(s):

Statistical Analysis ◽

Generalized Linear Models ◽

Linear Models ◽

Sea Anemones

Download Full-text

A Statistical Analysis of the Lake Levels at Lake Neusiedl

Austrian Journal of Statistics ◽

10.17713/ajs.v37i2.296 ◽

2016 ◽

Vol 37 (2) ◽

Cited By ~ 2

Author(s):

Johannes Ledolter

Keyword(s):

Statistical Analysis ◽

Generalized Linear Models ◽

Lake Level ◽

Linear Models ◽

Daily Precipitation ◽

Lake Levels ◽

Lake Level Changes ◽

Daily Data ◽

Eastern Border ◽

Logistic Regressions

A long record of daily data is used to study the lake levels of Lake Neusiedl, a large steppe lake at the eastern border of Austria. Daily lake level changes are modeled as functions of precipitation, temperature, and wind conditions. The occurrence and the amount of daily precipitation are modeled with logistic regressions and generalized linear models.

Download Full-text

An introductory guide to statistical analysis-generalized linear models for proportion data using R.

Journal of Weed Science and Technology ◽

10.3719/weed.55.275 ◽

2010 ◽

Vol 55 (4) ◽

pp. 275-286 ◽

Cited By ~ 5

Author(s):

Toshiyuki Imaizumi

Keyword(s):

Statistical Analysis ◽

Generalized Linear Models ◽

Linear Models

Download Full-text

Generalized Linear Models for Count Data

Discrete Data Analysis with R ◽

10.1201/b19022-14 ◽

2015 ◽

pp. 429-504

Author(s):

Michael Friendly ◽

David Meyer ◽

Achim Zeileis

Keyword(s):

Generalized Linear Models ◽

Count Data ◽

Linear Models

Download Full-text

glmGamPoi: Fitting Gamma-Poisson Generalized Linear Models on Single Cell Count Data

Bioinformatics ◽

10.1093/bioinformatics/btaa1009 ◽

2020 ◽

Author(s):

Constantin Ahlmann-Eltze ◽

Wolfgang Huber

Keyword(s):

Single Cell ◽

Poisson Distribution ◽

Generalized Linear Models ◽

Count Data ◽

Linear Models ◽

Differential Expression Analysis ◽

Source Code ◽

Principal Component ◽

R Package ◽

Single Cell Rna Sequencing

Abstract Motivation The Gamma-Poisson distribution is a theoretically and empirically motivated model for the sampling variability of single cell RNA-sequencing counts (Grün et al., 2014; Svensson, 2020; Silverman et al., 2018; Hafemeister and Satija, 2019) and an essential building block for analysis approaches including differential expression analysis (Robinson et al., 2010; McCarthy et al., 2012; Anders and Huber, 2010; Love et al., 2014), principal component analysis (Townes et al., 2019) and factor analysis (Risso et al., 2018). Existing implementations for inferring its parameters from data often struggle with the size of single cell datasets, which can comprise millions of cells; at the same time, they do not take full advantage of the fact that zero and other small numbers are frequent in the data. These limitations have hampered uptake of the model, leaving room for statistically inferior approaches such as logarithm(-like) transformation. Results We present a new R package for fitting the Gamma-Poisson distribution to data with the characteristics of modern single cell datasets more quickly and more accurately than existing methods. The software can work with data on disk without having to load them into RAM simultaneously. Availability The package glmGamPoi is available from Bioconductor for Windows, macOS, and Linux, and source code is available on github.com/const-ae/glmGamPoi under a GPL-3 license.

Download Full-text