Regression Models for Count Data in R

Achim Zeileis, Christian Kleiber, Simon Jackman

Publication: Scientific journalJournal articlepeer-review

3 Downloads (Pure)

Abstract

The classical Poisson, geometric and negative binomial regression models for count data belong to the family of generalized linear models and are available at the core of the statistics toolbox in the R system for statistical computing. After reviewing the conceptual and computational features of these methods, a new implementation of hurdle and zero-inflated regression models in the functions hurdle() and zeroinfl() from the package pscl is introduced. It re-uses design and functionality of the basic R functions just as the underlying conceptual tools extend the classical models. Both hurdle and zero-inflated model, are able to incorporate over-dispersion and excess zeros-two problems that typically occur in count data sets in economics and the social sciences-better than their classical counterparts. Using cross-section data on the demand for medical care, it is illustrated how the classical as well as the zero-augmented models can be fitted, inspected and tested in practice.
Original languageEnglish
Pages (from-to)1 - 25
JournalJournal of Statistical Software
Volume27
Issue number8
DOIs
Publication statusPublished - 2008

Bibliographical note

Updated version

Cite this