Linking inpatient clinical registry data to Medicare claims data using indirect identifiers
ABSTRACT Inpatient clinical registries generally have limited ability to provide a longitudinal perspective on care beyond the acute episode. We present a method to link hospitalization records from registries with Medicare inpatient claims data, without using direct identifiers, to create a unique data source that pairs rich clinical data with long-term outcome data.
The method takes advantage of the hospital clustering observed in each database by demonstrating that different combinations of indirect identifiers within hospitals yield a large proportion of unique patient records. This high level of uniqueness also allows linking without advance knowledge of the Medicare provider number of each registry hospital. We applied this method to 2 inpatient databases and were able to identify 81% of 39,178 records in a large clinical registry of patients with heart failure and 91% of 6,581 heart failure records from a hospital inpatient database. The quality of the link is high, and reasons for incomplete linkage are explored. Finally, we discuss the unique opportunities afforded by combining claims and clinical data for specific analyses.
In the absence of direct identifiers, it is possible to create a high-quality link between inpatient clinical registry data and Medicare claims data. The method will allow researchers to use existing data to create a linked claims-clinical database that capitalizes on the strengths of both types of data sources.
Full-textDOI: · Available from: Bradley G Hammill, Jun 10, 2015
- Cardiology in the Young 12/2012; 22(6):823-830. DOI:10.1017/S1047951112001552 · 0.86 Impact Factor
- [Show abstract] [Hide abstract]
ABSTRACT: The National Cardiovascular Data Registry CathPCI Registry was recently linked with longitudinal Centers for Medicare & Medicaid (CMS) claims data. The degree to which this linked cohort is representative of the overall CathPCI Registry and CMS PCI populations is unknown. CathPCI Registry records were linked to CMS inpatient claims using indirect identifiers. We examined the degree to which hospitals and patients in the linked cohort are representative of the elderly (≥65 years) CathPCI Registry and CMS populations. From 2004 to 2006, 1492 hospitals filed CMS PCI claims and 663 contributed CathPCI Registry data. Of these hospitals, 643 (97%) were linked across data sources. Compared with all CMS PCI hospitals, the linked data set contained fewer governmental, northeastern, southern, and low-volume (<200 beds) sites. Among CMS beneficiaries, 993,351 PCI procedures were performed, including 398,508 (40.1%) at centers in the linked database. Of these, 341,916 (86%) were linked to CathPCI Registry records. Linked and unlinked CMS patients had similar demographic and clinical features. In the CathPCI Registry database, 477,456 elderly patients underwent PCI, with 359,077 (75%) linked to CMS claims. Linked and unlinked National Cardiovascular Data Registry patients were similar, except for less commercial or health maintenance organization insurance in the linked cohort. By using deterministic matching strategies, a large and representative cohort with detailed clinical data from the CathPCI Registry and longitudinal follow-up from CMS claims has been created.Circulation Cardiovascular Quality and Outcomes 01/2012; 5(1):134-40. DOI:10.1161/CIRCOUTCOMES.111.963280 · 5.04 Impact Factor
- [Show abstract] [Hide abstract]
ABSTRACT: There is increasing interest in reporting risk-standardized outcomes for Medicare beneficiaries hospitalized with acute ischemic stroke, but whether it is necessary to include adjustment for initial stroke severity has not been well studied. To evaluate the degree to which hospital outcome ratings and potential eligibility for financial incentives are altered after including initial stroke severity in a claims-based risk model for hospital 30-day mortality for acute ischemic stroke. Data were analyzed from 782 Get With The Guidelines-Stroke participating hospitals on 127,950 fee-for-service Medicare beneficiaries with ischemic stroke who had a score documented for the National Institutes of Health Stroke Scale (NIHSS, a 15-item neurological examination scale with scores from 0 to 42, with higher scores indicating more severe stroke) between April 2003 and December 2009. Performance of claims-based hospital mortality risk models with and without inclusion of NIHSS scores for 30-day mortality was evaluated and hospital rankings from both models were compared. Model discrimination, hospital 30-day mortality outcome rankings, and value-based purchasing financial incentive categories. Across the study population, the mean (SD) NIHSS score was 8.23 (8.11) (median, 5; interquartile range, 2-12). There were 18,186 deaths (14.5%) within the first 30 days, including 7430 deaths (5.8%) during the index hospitalization. The hospital mortality model with NIHSS scores had significantly better discrimination than the model without (C statistic, 0.864; 95% CI, 0.861-0.867, vs 0.772; 95% CI, 0.769-0.776; P < .001). Among hospitals ranked in the top 20% or bottom 20% of performers by the claims model without NIHSS scores, 26.3% were ranked differently by the model with NIHSS scores. Of hospitals initially classified as having "worse than expected" mortality, 57.7% were reclassified to "as expected" by the model with NIHSS scores. The net reclassification improvement (93.1%; 95% CI, 91.6%-94.6%; P < .001) and integrated discrimination improvement (15.0%; 95% CI, 14.6%-15.3%; P < .001) indexes both demonstrated significant enhancement of model performance after the addition of NIHSS. Explained variance and model calibration was also improved with the addition of NIHSS scores. Adding stroke severity as measured by the NIHSS to a hospital 30-day risk model based on claims data for Medicare beneficiaries with acute ischemic stroke was associated with considerably improved model discrimination and change in mortality performance rankings for a substantial portion of hospitals.JAMA The Journal of the American Medical Association 07/2012; 308(3):257-64. DOI:10.1001/jama.2012.7870 · 30.39 Impact Factor