Regression Models for Count Data in R

Achim Zeileis, Christian Kleiber, Simon Jackman

Publication: Working/Discussion PaperWU Working Paper

81 Downloads (Pure)

Abstract

The classical Poisson, geometric and negative binomial regression models for count data belong to the family of generalized linear models and are available at the core of the statistics toolbox in the R system for statistical computing. After reviewing the conceptual and computational features of these methods, a new implementation of zero-inflated and hurdle regression models in the functions zeroinfl() and hurdle() from the package pscl is introduced. It re-uses design and functionality of the basic R functions just as the underlying conceptual tools extend the classical models. Both model classes are able to incorporate over-dispersion and excess zeros - two problems that typically occur in count data sets in economics and the social and political sciences - better than their classical counterparts. Using cross-section data on the demand for medical care, it is illustrated how the classical as well as the zero-augmented models can be fitted, inspected and tested in practice. (author's abstract)

Publication series

SeriesResearch Report Series / Department of Statistics and Mathematics
Number53

Bibliographical note

Earlier version

WU Working Paper Series

  • Research Report Series / Department of Statistics and Mathematics

Cite this