CProb: A Computational Tool for Conducting Conditional Probability Analysis

U.S. Environmental Protection Agency, Office of Research and Development, National Health and Environmental Effects Research Lab., Atlantic Ecology Div., 27 Tarzwell Drive, Narragansett, RI 02882, USA.
Journal of Environmental Quality (Impact Factor: 2.65). 11/2008; 37(6):2392-6. DOI: 10.2134/jeq2007.0536
Source: PubMed


Conditional probability is the probability of observing one event given that another event has occurred. In an environmental context, conditional probability helps to assess the association between an environmental contaminant (i.e., the stressor) and the ecological condition of a resource (i.e., the response). These analyses, when combined with controlled experiments and other methodologies, show great promise in evaluating ecological conditions from observational data and in defining water quality and other environmental criteria. Current applications of conditional probability analysis (CPA) are largely done via scripts or cumbersome spreadsheet routines, which may prove daunting to end-users and do not provide access to the underlying scripts. Combining spreadsheets with scripts eases computation through a familiar interface (i.e., Microsoft Excel) and creates a transparent process through full accessibility to the scripts. With this in mind, we developed a software application, CProb, as an Add-in for Microsoft Excel with R, R(D)com Server, and Visual Basic for Applications. CProb calculates and plots scatterplots, empirical cumulative distribution functions, and conditional probability. In this short communication, we describe CPA, our motivation for developing a CPA tool, and our implementation of CPA as a Microsoft Excel Add-in. Further, we illustrate the use of our software with two examples: a water quality example and a landscape example. CProb is freely available for download at http://www.epa.gov/emap/nca/html/regions/cprob.


Available from: Henry A Walker
  • Source
    • "Increasingly, efforts to develop a variety of water quality criteria have focused on the use of field data to develop relationships between biological responses and their stressors, and then to identify levels of the stressors that preserve the desired biological conditions. Examples of stressors other than nutrients that have been investigated in this way include river sediments, pathogens, metals, and specific conductance (Shine et al. 2003; Paul and McDonald 2005; Cormier et al. 2008, 2013; Hollister et al. 2008; Nevers and Whitman 2011; USEPA 2011). "
    [Show abstract] [Hide abstract]
    ABSTRACT: High levels of the nutrients nitrogen and phosphorus can cause unhealthy biological or ecological conditions in surface waters, and prevent the attainment of their designated uses. Regulatory agencies are developing numeric criteria for these nutrients in an effort to ensure that the surface waters in their jurisdictions remain healthy and productive, and that water quality standards are met. These criteria are often derived using field measurements that relate nutrient concentrations and other water quality conditions to expected biological responses such as undesirable growth or changes in aquatic plant and animal communities. Ideally, these numeric criteria can be used to accurately "diagnose" ecosystem health and guide management decisions. However, the degree to which numeric nutrient criteria are useful for decision-making depends on how accurately they reflect the status or risk of nutrient-related biological impairments. Numeric criteria that have little predictive value are not likely to be useful for managing nutrient concerns. This paper presents information on the role of numeric nutrient criteria as biological health indicators, and the potential benefits of sufficiently accurate criteria for nutrient management. In addition, it describes approaches being proposed or adopted in states such as Florida and Maine to improve the accuracy of numeric criteria and criteria-based decisions. This includes a preference for developing site-specific criteria where sufficient data are available, and the use of nutrient concentration and biological response criteria together in a framework to support designated use attainment decisions. Together with systematic planning during criteria development, the accuracy of field-derived numeric nutrient criteria can be assessed and maximized as a part of an overall effort to manage nutrient water quality concerns. Integr Environ Assess Manag © 2013 SETAC.
    Integrated Environmental Assessment and Management 01/2014; 10(1). DOI:10.1002/ieam.1485 · 1.38 Impact Factor
  • [Show abstract] [Hide abstract]
    ABSTRACT: Often when various estuarine benthic indices disagree in their assessments of benthic condition, they are reflecting different aspects of benthic condition. We describe a process to screen indices for associations and, after identifying candidate metrics, evaluate metrics individually against the indices. We utilize radar plots as a multi-metric visualization tool, and conditional probability plots and receiver operating characteristic curves to evaluate associations seen in the plots. We investigated differences in two indices, the US EPA Environmental Monitoring and Assessment Program's benthic index for the Virginian Province and the New York Harbor benthic index of biotic integrity using data collected in New York Harbor and evaluated overall agreement of the indices and associations between each index and measures of habitat and sediment contamination. The indices agreed in approximately 78% of the cases. The New York Harbor benthic index of biotic integrity showed stronger associations with sediment metal contamination and grain size.
    Marine Pollution Bulletin 01/2009; 59(1-3):65-71. DOI:10.1016/j.marpolbul.2008.11.009 · 2.99 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: There is global interest in recovering locally extirpated carnivore species. Successful efforts to recover Louisiana black bear in Louisiana have prompted interest in recovery throughout the species' historical range. We evaluated support for three potential black bear recovery strategies prior to public release of a black bear conservation and management plan for eastern Texas, United States. Data were collected from 1,006 residents living in proximity to potential recovery locations, particularly Big Thicket National Preserve. In addition to traditional logistic regression analysis, we used conditional probability analysis to statistically and visually evaluate probabilities of public support for potential black bear recovery strategies based on socioeconomic characteristics. Allowing black bears to repopulate the region on their own (i.e., without active reintroduction) was the recovery strategy with the greatest probability of acceptance. Recovery strategy acceptance was influenced by many socioeconomic factors. Older and long-time local residents were most likely to want to exclude black bears from the area. Concern about the problems that black bears may cause was the only variable significantly related to support or non-support across all strategies. Lack of personal knowledge about black bears was the most frequent reason for uncertainty about preferred strategy. In order to reduce local uncertainty about possible recovery strategies, we suggest that wildlife managers focus outreach efforts on providing local residents with general information about black bears, as well as information pertinent to minimizing the potential for human-black bear conflict.
    Environmental Management 06/2010; 45(6):1299-311. DOI:10.1007/s00267-010-9485-3 · 1.72 Impact Factor
Show more