Evaluation of a preliminary physical function item bank supported the expected advantages of the Patient-Reported Outcomes Measurement Information System (PROMIS).

Health Assessment Lab and QualityMetric Incorporated, 275 Wyman Street, Waltham, MA 02451, USA.
Journal of Clinical Epidemiology (Impact Factor: 5.48). 01/2008; 61(1):17-33. DOI: 10.1016/j.jclinepi.2006.06.025
Source: PubMed

ABSTRACT The Patient-Reported Outcomes Measurement Information System (PROMIS) was initiated to improve precision, reduce respondent burden, and enhance the comparability of health outcomes measures. We used item response theory (IRT) to construct and evaluate a preliminary item bank for physical function assuming four subdomains.
Data from seven samples (N=17,726) using 136 items from nine questionnaires were evaluated. A generalized partial credit model was used to estimate item parameters, which were normed to a mean of 50 (SD=10) in the US population. Item bank properties were evaluated through Computerized Adaptive Test (CAT) simulations.
IRT requirements were fulfilled by 70 items covering activities of daily living, lower extremity, and central body functions. The original item context partly affected parameter stability. Items on upper body function, and need for aid or devices did not fit the IRT model. In simulations, a 10-item CAT eliminated floor and decreased ceiling effects, achieving a small standard error (< 2.2) across scores from 20 to 50 (reliability >0.95 for a representative US sample). This precision was not achieved over a similar range by any comparable fixed length item sets.
The methods of the PROMIS project are likely to substantially improve measures of physical function and to increase the efficiency of their administration using CAT.


Available from: Jakob B Bjorner, Apr 28, 2015
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Background Racial and ethnic disparities persist in stroke occurrence, recurrence, morbidity and mortality. Uncontrolled hypertension (HTN) is the most important modifiable risk factor for stroke risk. Home health care organizations care for many patients with uncontrolled HTN and history of stroke; however, recurrent stroke prevention has not been a home care priority. We are conducting a randomized controlled trial (RCT) to compare the effectiveness, relative to usual home care (UHC), of two Community Transitions Interventions (CTIs). The CTIs aim to reduce recurrent stroke risk among post-stroke patients via home-based transitional care focused on better HTN management.Methods/DesignThis 3-arm trial will randomly assign 495 black and Hispanic post-stroke home care patients with uncontrolled systolic blood pressure (SBP) to one of three arms: UHC, UHC complemented by nurse practitioner-delivered transitional care (UHC¿+¿NP) or UHC complemented by an NP plus health coach (UHC¿+¿NP¿+¿HC). Both intervention arms emphasize: 1) linking patients to continuous, responsive preventive and primary care, 2) increasing patients¿/caregivers¿ ability to manage a culturally and individually tailored BP reduction plan, and 3) facilitating the patient¿s reintegration into the community after home health care discharge. The primary hypothesis is that both NP-only and NP¿+¿HC transitional care will be more effective than UHC alone in achieving a SBP reduction. The primary outcome is change in SPB at 3 and 12 months. The study also will examine cost-effectiveness, quality of life and moderators (for example, race/ethnicity) and mediators (for example, changes in health behaviors) that may affect treatment outcomes. All outcome data are collected by staff blinded to group assignment.DiscussionThis study targets care gaps affecting a particularly vulnerable black/Hispanic population characterized by persistent stroke disparities. It focuses on care transitions, a juncture when patients are particularly susceptible to adverse events. The CTI is innovative in adapting for stroke patients an established transitional care model shown to be effective for HF patients, pairing the professional NP with a HC, implementing a culturally tailored intervention, and placing primary emphasis on longer-term risk factor reduction and community reintegration rather than shorter-term transitional care outcomes.Trial NCT01918891; Registered 5 August 2013.
  • [Show abstract] [Hide abstract]
    ABSTRACT: To compare the psychometric functioning of multidimensional disease-specific, multiitem generic, and single-item measures of fatigue in patients with rheumatoid arthritis (RA). Confirmatory factor analysis (CFA) and longitudinal item response theory (IRT) modeling were used to evaluate the measurement structure and local reliability of the Bristol RA Fatigue Multi-Dimensional Questionnaire (BRAF-MDQ), the Medical Outcomes Study Short Form-36 (SF-36) vitality scale, and the BRAF Numerical Rating Scales (BRAF-NRS) in a sample of 588 patients with RA. A 1-factor CFA model yielded a similar fit to a 5-factor model with subscale-specific dimensions, and the items from the different instruments adequately fit the IRT model, suggesting essential unidimensionality in measurement. The SF-36 vitality scale outperformed the BRAF-MDQ at lower levels of fatigue, but was less precise at moderate to higher levels of fatigue. At these levels of fatigue, the living, cognition, and emotion subscales of the BRAF-MDQ provide additional precision. The BRAF-NRS showed a limited measurement range with its highest precision centered on average levels of fatigue. The different instruments appear to access a common underlying domain of fatigue severity, but differ considerably in their measurement precision along the continuum. The SF-36 vitality scale can be used to measure fatigue severity in samples with relatively mild fatigue. For samples expected to have higher levels of fatigue, the multidimensional BRAF-MDQ appears to be a better choice. The BRAF-NRS are not recommended if precise assessment is required, for instance in longitudinal settings.
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Background: The applicability and validity of many patient-reported outcome measures in the high-functioning population are not well understood. Purpose: To compare the psychometric properties of the modified Harris Hip Score (mHHS), the Hip Outcome Score activities of daily living subscale (HOS-ADL) and sports (HOS-sports), and the Lower Extremity Computerized Adaptive Test (LE CAT). The hypotheses was that all instruments would perform well but that the LE CAT would show superiority psychometrically because a combination of CAT and a large item bank allows for a high degree of measurement precision. Study Design: Cohort study (diagnosis); Level of evidence, 2. Methods: Data were collected from 472 advanced-age, active participants from the Huntsman World Senior Games in 2012. Validity evidences were examined through item fit, dimensionality, monotonicity, local independence, differential item functioning, person raw score to measure correlation, and instrument coverage (ie, ceiling and floor effects), and reliability evidences were examined through Cronbach alpha and person separation index. Results: All instruments demonstrated good item fit, unidimensionality, monotonicity, local independence, and person raw score to measure correlations. The HOS-ADL had high ceiling effects of 36.02%, and the mHHS had ceiling effects of 27.54%. The LE CAT had ceiling effects of 8.47%, and the HOS-sports had no ceiling effects. None of the instruments had any floor effects. The mHHS had a very low Cronbach alpha of 0.41 and an extremely low person separation index of 0.08. Reliabilities for the LE CAT were excellent and for the HOS-ADL and HOS-sports were good. Conclusion: The LE CAT showed better psychometric properties overall than the HOS-ADL, HOS-sports, and mHHS for the senior population. The mHHS demonstrated pronounced ceiling effects and poor reliabilities that should be of concern. The high ceiling effects for the HOS-ADL were also of concern. The LE CAT was superior in all psychometric aspects examined in this study. Future research should investigate the LE CAT for wider use in different populations.