Article

Count Data Regression Analysis: Concepts, Overdispersion Detection, Zero-inflation Identification, and Applications with R

Authors
  • Luiz Paulo Fávero (University of São Paulo)
  • Rafael de Freitas Souza (University of São Paulo)
  • Patrícia Belfiore (Federal University of ABC)
  • Hamilton Luiz Corrêa (University of São Paulo)
  • Michel F. C. Haddad (University of Cambridge)

Abstract

In this paper is proposed a straightforward model selection approach that indicates the most suitable count regression model based on relevant data characteristics. The proposed selection approach includes four of the most popular count regression models (i.e. Poisson, negative binomial, and respective zero-inflated frameworks). Moreover, it addresses two of the most relevant problems commonly found in real-world count datasets, namely overdispersion and zero-inflation. The entire selection approach may be performed using the programme language R, being all commands used throughout the paper availabe for practical purposes. It is worth mentioning that counting regression models are still not widespread within the social sciences.

Keywords: count data, count regression, model selection, overdispersion, zero-inflation

How to Cite:

Fávero, L., Souza, R. d., Belfiore, P., Corrêa, H. & Haddad, M. F., (2021) “Count Data Regression Analysis: Concepts, Overdispersion Detection, Zero-inflation Identification, and Applications with R”, Practical Assessment, Research, and Evaluation 26(1): 13. doi: https://doi.org/10.7275/44nn-cj68

Downloads:
Download PDF
View PDF

369 Views

87 Downloads

Published on
10 Jun 2021