Barbara Gandek’s research while affiliated with University of Massachusetts Chan Medical School and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (84)


Effects of MCC impact adjustment methods on PCS outcome predictions by OA-specific QOL impact
Improving multimorbidity measurement using individualized disease-specific quality of life impact assessments: predictive validity of a new comorbidity index
  • Article
  • Full-text available

July 2022

·

20 Reads

·

6 Citations

Health and Quality of Life Outcomes

·

Barbara Gandek

·

John E. Ware

Background Interpretation of health-related quality of life (QOL) outcomes requires improved methods to control for the effects of multiple chronic conditions (MCC). This study systematically compared legacy and improved method effects of aggregating MCC on the accuracy of predictions of QOL outcomes. Methods Online surveys administered generic physical (PCS) and mental (MCS) QOL outcome measures, the Charlson Comorbidity Index (CCI), an expanded chronic condition checklist (CCC), and individualized QOL Disease-specific Impact Scale (QDIS) ratings in a developmental sample (N = 5490) of US adults. Controlling for sociodemographic variables, regression models compared 12- and 35-condition checklists, mortality vs. population QOL-weighting, and population vs. individualized QOL weighting methods. Analyses were cross-validated in an independent sample (N = 1220) representing the adult general population. Models compared estimates of variance explained (adjusted R ² ) and model fit (AIC) for generic PCS and MCS across aggregation methods at baseline and nine-month follow-up. Results In comparison with sociodemographic-only regression models (MCS R ² = 0.08, PCS = 0.09) and Charlson CCI models (MCS R ² = 0.12, PCS = 0.16), increased variance was accounted for using the 35-item CCC (MCS R ² = 0.22, PCS = 0.31), population MCS/PCS QOL weighting (R ² = 0.31–0.38, respectively) and individualized QDIS weighting (R ² = 0.33 & 0.42). Model R ² and fit were replicated upon cross-validation. Conclusions Physical and mental outcomes were more accurately predicted using an expanded MCC checklist, population QOL rather than mortality CCI weighting, and individualized rather than population QOL weighting for each reported condition. The 3-min combination of CCC and QDIS ratings (QDIS-MCC) warrant further testing for purposes of predicting and interpreting QOL outcomes affected by MCC.

Download

Developing a New Version of the SF-6D Health State Classification System From the SF-36v2: SF-6Dv2

January 2020

·

858 Reads

·

111 Citations

Medical Care

John E Brazier

·

·

·

[...]

·

Objective: The objective of this study was to develop the classification system for version of the SF-6D (SF-6Dv2) from the SF-36v2. SF-6Dv2 is an improved version of SF-6D, one of the most widely used generic measures of health for the calculation of quality-adjusted life years. Study design and setting: A 3-step process was undertaken to generate a new classification system: (1) factor analysis to establish dimensionality; (2) Rasch analysis to understand item performance; and (3) tests of differential item function. To evaluate robustness, Rasch analyses were performed in multiple subsets of 2 large cross-sectional datasets from recently discharged hospital patients and online patient samples. Results: On the basis of factor analysis, other psychometric evidence, cross-cultural considerations, and amenability to valuation, the 6-dimension classification used in SF-6D was maintained. SF-6Dv2 resulted in the following modifications to SF-6D: a simpler classification of physical function with clearer separation between levels; a more detailed 5-level description of role limitations; using negative wording to describe vitality; and using pain severity rather than pain interference. Conclusions: The SF-6Dv2 classification system describes more distinct levels of health than SF-6D, changes the descriptions used for a number of dimensions and provides clearer wording for health state valuation. The second stage of the study has developed a utility value set using discrete choice methods so that the measure can be used in health technology assessment. Further work should investigate the psychometric characteristics of the new instrument.



Varying the item format improved the range of measurement in patient-reported outcome measures assessing physical function

March 2017

·

397 Reads

·

20 Citations

Arthritis Research & Therapy

Background: Physical function (PF) is a core patient-reported outcome domain in clinical trials in rheumatic diseases. Frequently used PF measures have ceiling effects, leading to large sample size requirements and low sensitivity to change. In most of these instruments, the response category that indicates the highest PF level is the statement that one is able to perform a given physical activity without any limitations or difficulty. This study investigates whether using an item format with an extended response scale, allowing respondents to state that the performance of an activity is easy or very easy, increases the range of precise measurement of self-reported PF. Methods: Three five-item PF short forms were constructed from the Patient-Reported Outcomes Measurement Information System (PROMIS®) wave 1 data. All forms included the same physical activities but varied in item stem and response scale: format A (“Are you able to …”; “without any difficulty”/“unable to do”); format B (“Does your health now limit you …”; “not at all”/“cannot do”); format C (“How difficult is it for you to …”; “very easy”/“impossible”). Each short-form item was answered by 2217–2835 subjects. We evaluated unidimensionality and estimated a graded response model for the 15 short-form items and remaining 119 items of the PROMIS PF bank to compare item and test information for the short forms along the PF continuum. We then used simulated data for five groups with different PF levels to illustrate differences in scoring precision between the short forms using different item formats. Results: Sufficient unidimensionality of all short-form items and the original PF item bank was supported. Compared to formats A and B, format C increased the range of reliable measurement by about 0.5 standard deviations on the positive side of the PF continuum of the sample, provided more item information, and was more useful in distinguishing known groups with above-average functioning. Conclusions: Using an item format with an extended response scale is an efficient option to increase the measurement range of self-reported physical function without changing the content of the measure or affecting the latent construct of the instrument.


Reducing ceiling effects in PROMIS® Physical Function measures by using an extended response scale

October 2016

·

165 Reads

Objective: To investigate whether an extended response scale increases the range of measurement in static PROMIS® Physical Function (PF) measures. Methods: In the course of PROMIS wave 1 data collection, a 5-item PF short form was presented in three different item formats. Two of these formats are used in the PROMIS PF item bank, both utilizing a response scale with five response categories in which the highest PF category indicates performing an activity “without any difficulty” or degree of limitation expressed as “not at all”. In this study, a third item format was presented using six response categories in which the highest levels of function in performing an activity are “easy” and “very easy”. We compared format-specific item information curves between the three versions of the 5-item short form, using a Graded Response Model which included the different versions of the 5-item short form and all remaining items of the PROMIS PF item bank. We additionally simulated response patterns for five groups with different PF levels (“very low” to “very high”; n=10,000/group) to calculate relative validity (RV) coefficients for each format-specific short form indicating the relative power to distinguish between known groups compared to the full item bank (RV=1.0). Results: More than n=8,500 participants responded to a subset of the five items presented in different item formats. Compared to the two standard item formats (difficulty/limitation), the extended six-category format increased the range of measurement by more than 0.5 standard deviations on the latent PF continuum. It also showed higher power to distinguish between groups with “high” and “very high” PF than the difficulty/limitation formats. Conclusions: Using an item format with an extended response scale is a promising and efficient way to increase measurement range in patient-reported PF measures with the potential to reduce ceiling effects.


Figure 1. Comparison of health information brokers versus nonbrokers within low-income respondents (<$20,000 annually; n=744).  
Figure 2. Comparison of health information brokers versus nonbrokers among respondents born outside the United States (n=508).  
Standardizing disease-specific quality of life measures across multiple chronic conditions: Development and initial evaluation of the QOL Disease Impact Scale (QDIS®)

June 2016

·

795 Reads

·

78 Citations

Health and Quality of Life Outcomes

Background To document the development and evaluation of the Quality of life Disease Impact Scale (QDIS®), a measure that standardizes item content and scoring across chronic conditions and provides a summary, norm-based QOL impact score for each disease. MethodsA bank of 49 disease impact items was constructed from previously-used descriptions of health impact to represent ten frequently-measured quality of life (QOL) content areas and operational definitions successfully utilized in generic QOL surveys. In contrast to health in general, all items were administered with attribution to a specific disease (osteoarthritis, rheumatoid arthritis, angina, myocardial infarction, congestive heart failure, chronic kidney disease (CKD), diabetes, asthma, or COPD). Responses from 5418 adults were analyzed as five disease groups: arthritis, cardiovascular, CKD, diabetes, and respiratory. Unidimensionality, item parameter and scale-level invariance, reliability, validity and responsiveness to change during 9-month follow-up were evaluated by disease group and for all groups combined using multi-group confirmatory factor analysis (MGCFA), item response theory (IRT) and analysis of variance methods. QDIS was normed in an independent chronically ill US population sample (N = 4120). ResultsMGCFA confirmed a 1-factor model, justifying a summary score estimated using equal parameters for each item across disease groups. In support of standardized IRT-based scoring, correlations were very high between disease-specific and standardized IRT item slopes (r = 0.88–0.96), thresholds (r = 0.93–0.99) and person-level scores (r ≥ 0.99). Internal consistency, test-retest and person-level IRT reliability were consistently satisfactory across groups. In support of interpreting QDIS as a disease-specific measure, in comparison with generic measures, QDIS consistently discriminated markedly better across disease severity levels, correlated higher with other disease-specific measures in cross-sectional tests, and was more responsive in comparisons of groups with better, same or worse evaluations of disease-specific outcomes at the 9-month follow-up. Conclusions Standardization of content and scoring across diseases was shown to be justified psychometrically and enabled the first summary measure of disease-specific QOL impact normed in the chronically ill population. This disease-specific approach substantially improves discriminant validity and responsiveness over generic measures and provides a basis for better understanding the relative QOL impact of multiple chronic conditions in research and clinical practice.


The impact of item format on item information and resulting person parameters in patient-reported outcomes measuring physical function

October 2015

·

65 Reads

Quality of Life Research

Aim: An essential step in the development of patient-reported outcomes (PROs) is the determination of the specific item format, including item content and wording. Using the example of measuring physical function (PF), respective item content can focus on either a person’s capability or on a person’s performance, while respective item wording may either be deficit- or resource-oriented. The aim of this study was to investigate if these aspects of item structure affect item information and resulting person parameters (thetas). Methods: In the course of the PROMIS wave 1 data collection, participants rated their PF status based on five daily activities presented in four different item formats, resulting in a set of 20 items. Three formats aimed at capability (“Are you able to…?”, “How difficult is it for you to…?”, “Does your health now limit you to…?”); one asked for performance (“Over the last 7 days, did you…?”). Using Item-Response Theory (IRT), we applied a Graded Response Model (GRM) and a Generalized Partial Credit Model (GPCM) and compared the format-specific item information curves for each of the five activities. By fitting an IRT-based mixed-effect model, we examined the impact of the item format on the theta estimates. Results: Using an incomplete blocked design, data were used from n=15,721 participants who responded to a subset of the PROMIS PF item bank. Item information was similar for capability item formats with a maximum information between 4 and 9, depending on rated activity and the underlying IRT model. The performance item format led to considerably lower item information with a maximum information of <0.5. Capability formats did not differ in resulting theta estimates, while the performance format systematically underestimated person parameters (p<.0001). The range of high item information along the theta continuum was broader for the GRM, while the maximum information was higher for the GPCM. Conclusions: Different capability item formats used for the assessment of physical function lead to similarly high item information and to equal theta estimations. In contrast, a performance item format does not seem to be appropriate for measuring the PROMIS PF construct.



Figure 1. Improved score distribution for new and SF-36 physical functioning measures. Note. Percentages may not add to 100 due to rounding. 
Figure 2 of 2
Preliminary Evaluation of a New German Translated Tobacco Quality of Life Impact Tool to Discriminate Between Healthy Current and Former Smokers and to Explore the Effect of Switching Smokers to a Reduced Toxicant Prototype Cigarette

April 2015

·

101 Reads

·

4 Citations

Nicotine & Tobacco Research

Assessment of health-related quality of life (HRQoL) is well established in clinical research, but ceiling effects in validated tools might prevent detection of changes in well respondents. Tobacco Quality of Life Impact Tool (TQOLITv1) uses conceptual and psychometric advances to enhance detection of HRQoL changes. In a 6-month, forced-switch study, the German TQOLITv1 was assessed in healthy adult (age 23-55 years) current and matched former-smokers. At baseline, smokers were switched to reduced toxicant prototype (RTP) or conventional cigarette for 6 months. TQOLITv1 responses were collected at baseline, 3 and 6 months from current smokers whilst former smokers completed it at the latter two time points. TQOLITv1 includes SF-36v2 and new smoking-specific, physical and general-health measures. Reliability at baseline was good (Cronbach's coefficient alpha > 0.70) for all measures. The baseline percentage with the best possible score (ceiling effect) for former and current smokers was substantially better for the new physical function than SF-36 physical function measure (35% vs. 59% at ceiling, respectively). New smoking-specific measures discriminated current from former smokers better than general health measures. Smoking-specific symptoms (r = 0.73) were more stable from baseline to 6 months than other measures (r = 0.38-0.54) particularly more than the SF-36 mental component score (r = 0.24). Although both product smoking groups worsened in most HRQoL measures, changes in general and smoking-specific HRQoL impact measures favored RTP smokers. The German TQOLITv1 is sufficiently reliable and valid to assess HRQoL and may be more useful than SF-36v2 in evaluation of interventions in well smoking populations including those consuming RTPs. © The Author 2015. Published by Oxford University Press on behalf of the Society for Research on Nicotine and Tobacco.



Citations (75)


... Healthcare 2025, 13, 832 2 of 13 and complex challenges that healthcare systems must face, necessitating the development of different approaches in both patient management and care. In this context of growing demand for healthcare services, the use of tools that allow for the measurement of the quality of healthcare services provided becomes even more essential [3,4]. Quality can be assessed subjectively, based on the perceptions, expectations, and demands of various stakeholders. ...

Reference:

How to Assess Health Gains
Improving multimorbidity measurement using individualized disease-specific quality of life impact assessments: predictive validity of a new comorbidity index

Health and Quality of Life Outcomes

... Cost utility analysis is informed by the Quality Adjusted Life Year (QALY) Current measurement systems for use in cost utility analysis such as the EQ-5D-5 L [5] or SF-6D [6][7][8][9] focus on measuring the construct of health-related quality of life (HRQoL). Constructs are theoretical concepts that are measured empirically and can relate to single or multiple dimensions. ...

Developing a New Version of the SF-6D Health State Classification System From the SF-36v2: SF-6Dv2

Medical Care

... This finding reported decades ago 23 has been replicated recently and independently. 24,25 The current study is the first to standardize this solution across multiple functioning (PF, RP, SF, RE) domains. For feelings conceptualized and modeled as bipolar (ie, having opposite poles), for example, vitality, short-form measurement was achieved using multiple "energy" and "fatigue" items 26 or relied on a much more narrowly defined unipolar fatigue 11 or energy 22 item. ...

Varying the item format improved the range of measurement in patient-reported outcome measures assessing physical function

Arthritis Research & Therapy

... However, disease-specific scales, such as ours, often rely on the presence of diseaserelated events, as opposed to the absence of these events. As a result, the score for scales such as these is often based on frequency and severity of these disease-specific events, yielding a higher number that correlates with a lower overall QOL [33,34]. Given these established precedents, in addition to our inclusion of the global index of severity question item for each domain, our scoring directionality for the RhizoQOL is reasonable for a disease-specific QOL survey instrument. ...

Standardizing disease-specific quality of life measures across multiple chronic conditions: Development and initial evaluation of the QOL Disease Impact Scale (QDIS®)

Health and Quality of Life Outcomes

... During treatment, quality of life was impaired. On the other hand, after reaching SVR, the quality of life of these patients was improved compared to their status before reaching SVR [17][18][19][20][21][22][23]. To date, there are no studies which examined the effect of DAAs on the long-term quality of life of these patients. ...

ASSESSMENT OF THE HEALTH-RELATED QUALITY-OF-LIFE (HQL) OF PATIENTS WITH CHRONIC HEPATITIS-C (CHC)
  • Citing Article
  • April 1994

Gastroenterology

... [24,26] This adaptive logic is the next step when more reliable individualized estimates (e.g., likelihood of treatment relief ) are needed [16]. Feasibility, respondent burden reduction, and clinical utility of such adaptive logic were supported in a national registry pilot study before and after joint replacement, where responsiveness and high correlations between QDIS-OA, QDIS-MCC, and generic PCS outcomes were statistically significant despite a very small sample [83]. Other findings suggest there are points beyond which additional measurement precision may not be worth the burden and cost [26,41]. ...

Cutting edge solutions to improving the efficiency of PRO measurement: from real-data simulations to pilot testing before and after total joint replacement in a national registry
  • Citing Article
  • October 2015

Quality of Life Research

... From baseline to week 25, participants in the BUP-XR treatment groups showed statistically significant improvement in health status and HRQoL measures. The differences between treatment and placebo groups on both physical and mental outcomes measured using the SF-36v2 were greater than the clinically relevant threshold of 2-3 points reported in the published literature (Ware et al., 2007(Ware et al., , 2008. Additionally, our results are similar to those previously published demonstrating the beneficial effect of pharmacologic therapy for OUD on HRQoL (De Jong et al., 2007;Carpentier et al., 2009;Schafer et al., 2009;Oviedo-Joekes et al., 2010;Griffin et al., 2015). ...

Determining important differences in scores
  • Citing Article
  • January 2000

... Tool 3: The health-related quality of life (SF-12 health survey): It contains twelve items divided into eight items that Ware et al. (2002) adopted. The SF-36 health survey, which is one of the most widely used general instruments for evaluating functioning associated with mental and physical health (MCS-12 and PCS-12, respectively), serves as the foundation for the SF-12 (Ware et al., 1996). ...

How to score SF-12 items