Geocoding public health data [Letter}

Virginia Commonwealth University, Ричмонд, Virginia, United States
American Journal of Public Health (Impact Factor: 4.23). 06/2003; 93(5):699; author reply 699-700. DOI: 10.2105/AJPH.93.5.699
Source: PubMed
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: The necessity of aggregating health data over areas can impede our understanding of health determinants. We demonstrate the possibility of creating anonymous links between individual residences and the local environment using digital map data and a data linkage system. Digital map data were used successfully to anonymously link 1.3 million addresses to the local environment. The data linkage system allows detailed environment data surrounding each residence to be linked both to each resident therein and to their medical records. Local environment data specific to each house can be effectively and anonymously linked to the population registered with the National Health Service. Our integrated approach potentially enables flexible fine-scale, large-area observational studies of communities and health.
    Journal of Public Health 06/2009; 31(4):582-8. DOI:10.1093/pubmed/fdp041 · 2.30 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Area-level socioeconomic status (SES) may play an important role in drug abuse patterns, including related health outcomes. This may be particularly relevant for gamma-hydroxybutyrate (GHB), which is prototypical of "party" drug abuse. We retrospectively reviewed GHB-related cases reported to the California Poison Control System (CPCS; January 1, 1999 through June 30, 2007). We limited analysis to CPCS calls containing a residential zip code (ZC). The CPCS data were extracted for key case characteristics, including the residential ZC. We linked cases to corresponding 2000 U.S. Census data for area-level measures of SES and demographics. We used multiple logistic regression analysis to test the associations between area-level SES and GHB case severity, taking into account area-level demographics and individual-level GHB high-risk behaviors. We analyzed 210 cases. Taking into account area-level demographics (age and racial mix; urbanicity) and GHB-related high-risk behaviors (use of GHB congeners; GHB-dependence; co-ingestion of other agents), we associated higher area-level SES with greater GHB case severity. There was 40% increased likelihood of major GHB adverse health outcomes for every $100,000 incremental increase in median home values (OR 1.41; 95% CI 1.1-1.8). For median annual household income (per $10,000), the association was similar (OR 1.39; 95% CI 1.0-1.9). Higher area-level SES is associated with greater GHB-related case severity. This study may serve as a model using a geographic information system (GIS) approach to study the population-based correlates of drugs of abuse reported through poison control surveillance.
    Clinical Toxicology 10/2008; 47(1):48-57. DOI:10.1080/15563650802022839 · 3.12 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: The need to estimate the distance from an individual to a service provider is common in public health research. However, estimated distances are often imprecise and, we suspect, biased due to a lack of specific residential location data. In many cases, to protect subject confidentiality, data sets contain only a ZIP Code or a county. This paper describes an algorithm, known as "the probabilistic sampling method" (PSM), which was used to create a distribution of estimated distances to a health facility for a person whose region of residence was known, but for which demographic details and centroids were known for smaller areas within the region. From this distribution, the median distance is the most likely distance to the facility. The algorithm, using Monte Carlo sampling methods, drew a probabilistic sample of all the smaller areas (Census blocks) within each participant's reported region (ZIP Code), weighting these areas by the number of residents in the same age group as the participant. To test the PSM, we used data from a large cross-sectional study that screened women at a clinic for intimate partner violence (IPV). We had data on each woman's age and ZIP Code, but no precise residential address. We used the PSM to select a sample of census blocks, then calculated network distances from each census block's centroid to the closest IPV facility, resulting in a distribution of distances from these locations to the geocoded locations of known IPV services. We selected the median distance as the most likely distance traveled and computed confidence intervals that describe the shortest and longest distance within which any given percent of the distance estimates lie. We compared our results to those obtained using two other geocoding approaches. We show that one method overestimated the most likely distance and the other underestimated it. Neither of the alternative methods produced confidence intervals for the distance estimates. The algorithm was implemented in R code. The PSM has a number of benefits over traditional geocoding approaches. This methodology improves the precision of estimates of geographic access to services when complete residential address information is unavailable and, by computing the expected distribution of possible distances for any respondent and associated distance confidence limits, sensitivity analyses on distance access measures are possible. Faulty or imprecise distance measures may compromise decisions about service location and misdirect scarce resources.
    International Journal of Health Geographics 01/2011; 10:4. DOI:10.1186/1476-072X-10-4 · 2.62 Impact Factor