×

The k-ZIG: flexible modeling for zero-inflated counts. (English) Zbl 1271.62044

Summary: Many applications involve count data from a process that yields an excess number of zeros. Zero-inflated count models, in particular, zero-inflated Poisson (ZIP) and zero-inflated negative binomial (ZINB) models, along with Poisson hurdle models, are commonly used to address this problem. However, these models struggle to explain extreme incidence of zeros (say more than 80%), especially to find important covariates. In fact, the ZIP may struggle even when the proportion is not extreme. To redress this problem we propose the class of k-ZIG models. These models allow more flexible modeling of both the zero-inflation and the nonzero counts, allowing interplay between these two components. We develop the properties of this new class of models, including reparameterization to a natural link function. The models are straightforwardly fitted within a Bayesian framework. The methodology is illustrated with simulated data examples as well as a forest seedling data set obtained from the USDA Forest Service’s Forest Inventory and Analysis program.

MSC:

62F15 Bayesian inference
62P12 Applications of statistics to environmental and related topics
PDFBibTeX XMLCite
Full Text: DOI

References:

[1] Agarwal, Zero-inflated models with application to spatial count data, Environmental and Ecological Statistics 9 pp 341– (2002) · doi:10.1023/A:1020910605990
[2] Bechtold, General Technical Report (2005)
[3] Cameron, Econometric models based on count data: comparisons and applications of some estimators and tests, Journal of Applied Econometrics 1 pp 29– (1986) · doi:10.1002/jae.3950010104
[4] Cameron, Regression Analysis of Count Data (1998) · Zbl 0924.62004 · doi:10.1017/CBO9780511814365
[5] Canham, Frequency, not relative abundance, of temperate tree species varies along climate gradients in eastern North America, Ecology 91 pp 3433– (2010) · doi:10.1890/10-0312.1
[6] Cohen, Truncated and Censored Samples (1991) · doi:10.1201/b16946
[7] Cui, Zero-inflated generalized Poisson regression mixture model for mapping quantitative trait loci underlying count trait with many zeros, Journal of Theoretical Biology 256 pp 276– (2009) · Zbl 1400.62257 · doi:10.1016/j.jtbi.2008.10.003
[8] Ghosh, Technical Report (1998)
[9] Gurmu, Semiparametric estimation of count regression models, Journal of Econometrics 88 pp 123– (1999) · Zbl 0937.62041 · doi:10.1016/S0304-4076(98)00026-8
[10] Johnson, Distributions in Statistics: Discrete Distributions (1969)
[11] Lambert, Zero-inflated Poisson regression, with an application to defects in manufacturing, Technometrics 34 pp 1– (1992) · Zbl 0850.62756 · doi:10.2307/1269547
[12] Lawless, Negative binomial and mixed Poisson regression, The Canadian Journal of Statistics 15 pp 209– (1987) · Zbl 0632.62060 · doi:10.2307/3314912
[13] Melkersson, Modeling female fertility using inflated count data models, Journal of Population Economics 13 pp 189– (2000) · doi:10.1007/s001480050133
[14] Neelon, A Bayesian model for repeated measures zero-inflated count data with application to outpatient psychiatric service use, Statistical Modelling 10 pp 421– (2010) · doi:10.1177/1471082X0901000404
[15] Rathbun, A spatial zero-inflated poisson regression model for oak regeneration, Environmental and Ecological Statistics 13 pp 409– (2006) · doi:10.1007/s10651-006-0020-x
[16] Smith , W. B. Miles , P. D. Perry , C. H. Pugh , S. A. 2009 Forest resources of the United States, 2007 USDA Forest Service, Washington Office
[17] Wang, Generalized extreme value regression for binary response data: An application to B2B electronic payments system adoption, Annals of Applied Statistics 4 pp 2000– (2010) · Zbl 1220.62165 · doi:10.1214/10-AOAS354
[18] Welsh, Modelling the abundance of rare species: statistical models for counts with extra zeros, Ecological Modelling 88 pp 297– (1996) · doi:10.1016/0304-3800(95)00113-1
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.