Zero-state Markov switching count-data models: an empirical assessment.

School of Civil Engineering, 550 Stadium Mall Drive, Purdue University, West Lafayette, IN 47907, USA.
Accident; analysis and prevention (Impact Factor: 1.65). 01/2010; 42(1):122-30. DOI: 10.1016/j.aap.2009.07.012
Source: PubMed

ABSTRACT In this study, a two-state Markov switching count-data model is proposed as an alternative to zero-inflated models to account for the preponderance of zeros sometimes observed in transportation count data, such as the number of accidents occurring on a roadway segment over some period of time. For this accident-frequency case, zero-inflated models assume the existence of two states: one of the states is a zero-accident count state, which has accident probabilities that are so low that they cannot be statistically distinguished from zero, and the other state is a normal-count state, in which counts can be non-negative integers that are generated by some counting process, for example, a Poisson or negative binomial. While zero-inflated models have come under some criticism with regard to accident-frequency applications - one fact is undeniable - in many applications they provide a statistically superior fit to the data. The Markov switching approach we propose seeks to overcome some of the criticism associated with the zero-accident state of the zero-inflated model by allowing individual roadway segments to switch between zero and normal-count states over time. An important advantage of this Markov switching approach is that it allows for the direct statistical estimation of the specific roadway-segment state (i.e., zero-accident or normal-count state) whereas traditional zero-inflated models do not. To demonstrate the applicability of this approach, a two-state Markov switching negative binomial model (estimated with Bayesian inference) and standard zero-inflated negative binomial models are estimated using five-year accident frequencies on Indiana interstate highway segments. It is shown that the Markov switching model is a viable alternative and results in a superior statistical fit relative to the zero-inflated models.

  • Cell Biology International 01/2008; 32(3). · 1.64 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: The analysis of highway-crash data has long been used as a basis for influencing highway and vehicle designs, as well as directing and implementing a wide variety of regulatory policies aimed at improving safety. And, over time there has been a steady improvement in statistical methodologies that have enabled safety researchers to extract more information from crash databases to guide a wide array of safety design and policy improvements. In spite of the progress made over the years, important methodological barriers remain in the statistical analysis of crash data and this, along with the availability of many new data sources, present safety researchers with formidable future challenges, but also exciting future opportunities. This paper provides guidance in defining these challenges and opportunities by first reviewing the evolution of methodological applications and available data in highway-accident research. Based on this review, fruitful directions for future methodological developments are identified and the role that new data sources will play in defining these directions is discussed. It is shown that new methodologies that address complex issues relating to unobserved heterogeneity, endogeneity, risk compensation, spatial and temporal correlations, and more, have the potential to significantly expand our understanding of the many factors that affect the likelihood and severity (in terms of personal injury) of highway crashes. This in turn can lead to more effective safety countermeasures that can substantially reduce highway-related injuries and fatalities.
    Analytic Methods in Accident Research. 01/2013;
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: In traditional identification of hot spots, often known as the sites with black spots or accident-prone locations, methodologies are developed based on the total number of accidents. These criteria provide no consideration of whether the accidents were caused or could be averted by road improvements. These traditional methods result in misidentification of locations that are not truly hazardous from a road safety authority perspective and consequently may lead to a misapplication of safety improvement funding. We consider a mixture of the zero-inflated Poisson and the Poisson regression models to analyze zero-inflated data sets drawn from traffic accident studies. Based on the membership probabilities, observations are well separated into two clusters. One is the ZIP cluster; the other is the standard Poisson cluster. A simulation study and real data analysis are performed to demonstrate model fitting performances of the proposed model. The Bayes factor and the Bayesian information criterion are used to compare the proposed model with several competing models. Ultimately, this model could detect accident-prone spots.
    KSCE Journal of Civil Engineering 16(3). · 0.38 Impact Factor

Full-text (2 Sources)

Available from
Jul 25, 2014