Accounting for Excess Zeros and Sample Selection in Poisson and Negative Binomial Regression Models

Source: RePEc

ABSTRACT We present several modifications of the Poisson and negative binomial models for count data to accommodate cases in which the number of zeros in the data exceed what would typically be predicted by either model. The excess zeros can masquerade as overdispersion. We present a new test procedure for distinguishing between zero inflation and overdispersion. We also develop a model for sample selection which is analogous to the Heckman style specification for continuous choice models. An application is presented to a data set on consumer loan behavior in which both of these phenomena are clearly present.

  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: We analyze the claims database of a large malpractice insurer covering more than 8,000 physicians and 9,300 claims. Applying empirical Bayes methods in a regression setting, we construct a predictor of each physician's underlying propensity to incur malpractice claims. Our explanatory factors are physician demographics (age, sex, specialty, training) and physician practice pattern characteristics (practice setting, procedures performed, practice intensity, special risk factors, and characteristics of hospital(s) on staff of). We divide physicians into medical and surgical/ancillary specialty categories and fit separate models to each. In the surgical/ancillary specialty group, physician characteristics can effectively distinguish between more and less claims-prone physicians. Physician characteristics have somewhat less predictive power in the medical specialty group. As measured by predictive information, physician characteristics are superior to 10 years of claims history. Insofar as medical malpractice claims can be thought of as extreme indicators of poor-quality care, this finding suggests that easily gathered physician characteristics can be helpful in designing targeted quality of care improvement policies.
    Journal of Empirical Legal Studies 04/2007; · 1.40 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Recently, the sport of ice climbing has seen a dramatic increase in popularity. This paper uses the travel cost method to estimate the demand for ice climbing in Hyalite Canyon, Montana, one of the premier ice climbing venues in North America. Access to Hyalite and other ice climbing destinations have been put at risk due to liability issues, public land management agendas, and winter road conditions. To this point, there has been no analysis on the economic benefits of ice climbing. In addition to the novel outdoor recreation application, this study applies econometric methods designed to deal with "excess zeros" in the data. Depending upon model specification, per person per trip values are estimated to be in the range of $76 to $135.
    Journal of Environmental Management 11/2009; 91(4):1012-20. · 3.06 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Using a dataset of greenfield investments for the period of 1997-2008, the paper by three Dutch researchers seeks to determine to what extent Chinese and Indian foreign direct investment (FDI) in Europe is attracted to specific regional location factors. The authors utilize descriptive statistics and a negative binominal estimation method to analyze the number of greenfield investments, in an effort to explain why Chinese and Indian FDI is quite unevenly distributed across Europe. Support is marshaled for the hypothesis that Chinese and Indian FDI is more horizontal than vertical in character, and that divergence over time between current core European locations and more peripheral ones is increasing.
    Eurasian Geography and Economics 03/2010; 51(2):254-273. · 1.69 Impact Factor

Full-text (2 Sources)

Available from
Jun 2, 2014