Donna Spiegelman’s research while affiliated with Yale University and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (818)


Power and Sample Size Calculations for Cluster Randomized Hybrid Type 2 Effectiveness-Implementation Studies
  • Article

February 2025

·

3 Reads

Statistics in Medicine

Melody A Owen

·

Geoffrey M Curran

·

·

[...]

·

Donna Spiegelman

Hybrid studies allow investigators to simultaneously study an intervention effectiveness outcome and an implementation research outcome. In particular, type 2 hybrid studies support research that places equal importance on both outcomes rather than focusing on one and secondarily on the other (i.e., type 1 and type 3 studies). Hybrid 2 studies introduce the statistical issue of multiple testing, complicated by the fact that they are typically also cluster randomized trials. Standard statistical methods do not apply in this scenario. Here, we describe the design methodologies available for validly powering hybrid type 2 studies and producing reliable sample size calculations in a cluster‐randomized design with a focus on binary outcomes. Through a literature search, 18 publications were identified that included methods relevant to the design of hybrid 2 studies. Five methods were identified, two of which did not account for clustering but are extended in this article to do so, namely the combined outcomes approach and the single 1‐degree of freedom combined test. Procedures for powering hybrid 2 studies using these five methods are described and illustrated using input parameters inspired by a study from the Community Intervention to Reduce CardiovascuLar Disease in Chicago (CIRCL‐Chicago) Implementation Research Center. In this illustrative example, the intervention effectiveness outcome was controlled blood pressure, and the implementation outcome was reach. The conjunctive test resulted in higher power than the popular p value adjustment methods, and the newly extended combined outcomes and single 1‐DF test were found to be the most powerful among all of the tests.


Identifying Key Influencers using an Egocentric Network-based Randomized Design

February 2025

·

2 Reads

Behavioral health interventions, such as trainings or incentives, are implemented in settings where individuals are interconnected, and the intervention assigned to some individuals may also affect others within their network. Evaluating such interventions requires assessing both the effect of the intervention on those who receive it and the spillover effect on those connected to the treated individuals. With behavioral interventions, spillover effects can be heterogeneous in that certain individuals, due to their social connectedness and individual characteristics, are more likely to respond to the intervention and influence their peers' behaviors. Targeting these individuals can enhance the effectiveness of interventions in the population. In this paper, we focus on an Egocentric Network-based Randomized Trial (ENRT) design, wherein a set of index participants is recruited from the population and randomly assigned to the treatment group, while concurrently collecting outcome data on their nominated network members, who remina untreated. In such design, spillover effects on network members may vary depending on the characteristics of the index participant. Here, we develop a testing method, the Multiple Comparison with Best (MCB), to identify subgroups of index participants whose treatment exhibits the largest spillover effect on their network members. Power and sample size calculations are then provided to design ENRTs that can detect key influencers. The proposed methods are demonstrated in a study on network-based peer HIV prevention education program, providing insights into strategies for selecting peer educators in peer education interventions.


Construction of a Large, Public, HIV-Related Database in Support of Ending the HIV Epidemic Initiative: Learnings on Approach, Process, and Tools (Preprint)

February 2025

·

2 Reads

BACKGROUND The HIV epidemic is a national focus within the United States (U.S.), Ending the HIV Epidemic (EHE) Initiative aiming to reduce new HIV infections by 90% by 2023 through scaling of prevention and treatment strategies. Data are crucial to understanding HIV-related needs, barriers to care, and the effectiveness of interventions. While several publicly available datasets exist, few integrate multiple topic domains such as HIV outcomes, social determinants of health (SDOH), and community-level factors. The lack of unified data and difficulty linking datasets across these domains hampers efforts to tailor HIV management and treatment strategies. Existing datasets are often siloed and difficult to analyze together, limiting potential for comprehensive research. Combining and integrating data across geographic and thematic strata offers significant potential to better understand the factors influencing HIV outcomes and to optimize intervention strategies. OBJECTIVE This study explores the feasibility of combining public HIV and community data to identify factors influencing HIV outcomes and interventions. We described the process of sourcing, extracting, and linking variables into a unified database to highlight key community factors and demonstrate the potential of this integrated approach for overcoming data access barriers in HIV research. METHODS A team of two researchers was integrated within a larger network to undergo this task. Approximately 350 hours were spent developing a single unified database. The build was conducted across three phases: 1) initial sourcing of datasets, 2) extracting and linking data, and 3) quality control/quality assurance. Experts were consulted to identify relevant data sources, and data was extracted and linked across multiple geographics levels. Quality control included manual spot checks and web scraping with Web Scraper, a third-party tool to verify accuracy of the extracted data. Data were uploaded to SAS for further analysis. RESULTS The resulting database comprises 242 variables sources from eight publicly available datasets. These variables were linked to 104 Ryan White-associated clinics across five U.S. states, covering geographic levels from state to individual clinic. All data were sourced and scraped between August 2022 and January 2023. Following data preparation, descriptive tables were generated, grouped into three domains: environmental variables, HIV-related variables, and implementation outcome variables. The final database and data dictionary are publicly accessible on the Yale Center for Implementation and Prevention Science (CMIPS) website for future research. CONCLUSIONS This integrated HIV and SDOH database overcomes key challenges in accessing fragmented data, offering valuable insights for addressing HIV outcomes. Despite barriers like technology learning curves and data availability, the project successfully created a resource that can inform EHE goals and guide future interventions. Expanding the database will enable deeper analysis and support ongoing effort to end the HIV epidemic.


Socio-demographic characteristics of the study participants (n = 426)
CASS mean score in each domain among participants (n = 426)
CASS mean score in each domain for selected variable (n = 426)
Factors associated with cancer stigma score among participants (n = 426)
Socio-economic factors associated with cancer stigma among apparently healthy women in two selected municipalities Nepal
  • Article
  • Full-text available

December 2024

·

23 Reads

Introduction Cancer is the primary cause of death globally, and despite the significant advancements in treatment and survival rates, it is still stigmatized in many parts of the world. However, there is limited public health research on cancer stigma among the general female population in Nepal. Therefore, this study aims to determine the prevalence of cancer stigma and its associated factors in this group. Methods We conducted a cross-sectional study among 426 healthy women aged 30 to 60 years who were residents of Dhulikhel and Banepa in central Nepal. We measured cancer stigma using the Cancer Stigma Scale (CASS). CASS measures cancer stigma in six domains (awkwardness, avoidance, severity, personal responsibility, policy opposition, financial discrimination) on a 6-point Likert scale (strongly disagree to agree strongly) with higher mean stigma scores correlating with higher levels of stigma. We utilized Generalized Estimating Equations (GEE) with multivariable linear regression to identify the socio-demographic factors associated with the CASS score. Results Overall, the level of cancer stigma was low, with a mean stigma score of 2.6 (0.6), but it was still present among participants. Stigma related to personal responsibility had the highest levels, with a mean score of 3.9 (1.3), followed by severity with a mean score of 3.2 (1.3), and financial discrimination with a mean score of 2.9 (1.6). There was a significant association between the mean CASS score and older age (mean difference in stigma score: 0.11 points; 95% CI: 0.02–0.20) as well as lower education (difference: -0.02 points; 95% CI: -0.03 to -0.003), after adjusting for age, ethnicity, education, marital status, religion, occupation, and parity. Conclusion While overall cancer stigma was low, some domains of stigma were higher among women in a suburban area in central Nepal; thus, indicating that cancer stigma persists in this region despite its low overall prevalence.

Download

Figure 1. The significance level for a Type 2 Hybrid Design (Q=2) using the D/AP Method
Description of Input Parameters Motivated by CIRCL 44,45 (2 Co-Primary Outcomes)
Power and Sample Size Calculations for Cluster Randomized Hybrid Type 2 Effectiveness-Implementation Studies

November 2024

·

25 Reads

Hybrid studies allow investigators to simultaneously study an intervention effectiveness outcome and an implementation research outcome. In particular, type 2 hybrid studies support research that places equal importance on both outcomes rather than focusing on one and secondarily on the other (i.e., type 1 and type 3 studies). Hybrid 2 studies introduce the statistical issue of multiple testing, complicated by the fact that they are typically also cluster randomized trials. Standard statistical methods do not apply in this scenario. Here, we describe the design methodologies available for validly powering hybrid type 2 studies and producing reliable sample size calculations in a cluster-randomized design with a focus on binary outcomes. Through a literature search, 18 publications were identified that included methods relevant to the design of hybrid 2 studies. Five methods were identified, two of which did not account for clustering but are extended in this article to do so, namely the combined outcomes approach and the single 1-degree of freedom combined test. Procedures for powering hybrid 2 studies using these five methods are described and illustrated using input parameters inspired by a study from the Community Intervention to Reduce CardiovascuLar Disease in Chicago (CIRCL-Chicago) Implementation Research Center. In this illustrative example, the intervention effectiveness outcome was controlled blood pressure, and the implementation outcome was reach. The conjunctive test resulted in higher power than the popular p-value adjustment methods, and the newly extended combined outcomes and single 1-DF test were found to be the most powerful among all of the tests.


Barriers and Facilitators to Patient Utilization of Non-Communicable Disease Services in Primary Healthcare Facilities in Nepal: A Qualitative Study

October 2024

·

232 Reads

·

1 Citation

Background The Nepalese government endorsed and implemented the Package of Essential Non-Communicable Disease Interventions (PEN) by the World Health Organization (WHO) to prevent and manage four major non-communicable diseases (NCDs): cardiovascular disease (CVD), diabetes, cancers, and chronic respiratory diseases. This study explored barriers and facilitators to patient utilization of NCD services at primary healthcare facilities in Nepal. Methodology: We conducted a qualitative study with a 35 purposive sample of patients living with one or more NCDs (hypertension, diabetes, chronic obstructive pulmonary disease (COPD/ asthma) who sought healthcare at primary healthcare facilities in 14 randomly selected districts in seven provinces in Nepal that implemented PEN. Trained qualitative researchers conducted in-depth interviews in-person in a private setting using a semi-structured interview guide developed based on the Health Belief Model in the local language. The interviews were audio-recorded, transcribed verbatim, coded inductively and deductively, and analyzed by a framework approach using Dedoose software. Results From the perspectives of patients, key facilitators of service utilization encompassed free medicines, low-cost services, geographical and financial accessibility, less waiting time, positive interactions with health service providers, experiencing improvements in their health conditions, and support from family and peers. Barriers to utilizing services included inadequate health services (e.g., lack of medications and equipment), inaccessibility and affordability, inadequate health-related information from health service providers, low knowledge of NCD care, and lack of reminders or follow ups. Conclusion Enhancing NCD service utilization is potentially attainable through interventions that address patients’ knowledge, self-motivation, and misconceptions. Furthermore, strengthening the availability and accessibility of crucial services such as laboratory investigations, medications, equipment, and the patient-provider relationship is crucial for sustainable implementation of PEN.


Measurement Error Correction for Spatially Defined Environmental Exposures in Survival Analysis

October 2024

·

7 Reads

Environmental exposures are often defined using buffer zones around geocoded home addresses, but these static boundaries can miss dynamic daily activity patterns, leading to biased results. This paper presents a novel measurement error correction method for spatially defined environmental exposures within a survival analysis framework using the Cox proportional hazards model. The method corrects high-dimensional surrogate exposures from geocoded residential data at multiple buffer radii by applying principal component analysis for dimension reduction and leveraging external GPS-tracked validation datasets containing true exposure measurements. It also derives the asymptotic properties and variances of the proposed estimators. Extensive simulations are conducted to evaluate the performance of the proposed estimators, demonstrating its ability to improve accuracy in estimated exposure effects. An illustrative application assesses the impact of greenness exposure on depression incidence in the Nurses' Health Study (NHS). The results demonstrate that correcting for measurement error significantly enhances the accuracy of exposure estimates. This method offers a critical advancement for accurately assessing the health impacts of environmental exposures, outperforming traditional static buffer approaches.


Causal Inference with Double/Debiased Machine Learning for Evaluating the Health Effects of Multiple Mismeasured Pollutants

September 2024

·

27 Reads

One way to quantify exposure to air pollution and its constituents in epidemiologic studies is to use an individual's nearest monitor. This strategy results in potential inaccuracy in the actual personal exposure, introducing bias in estimating the health effects of air pollution and its constituents, especially when evaluating the causal effects of correlated multi-pollutant constituents measured with correlated error. This paper addresses estimation and inference for the causal effect of one constituent in the presence of other PM2.5 constituents, accounting for measurement error and correlations. We used a linear regression calibration model, fitted with generalized estimating equations in an external validation study, and extended a double/debiased machine learning (DML) approach to correct for measurement error and estimate the effect of interest in the main study. We demonstrated that the DML estimator with regression calibration is consistent and derived its asymptotic variance. Simulations showed that the proposed estimator reduced bias and attained nominal coverage probability across most simulation settings. We applied this method to assess the causal effects of PM2.5 constituents on cognitive function in the Nurses' Health Study and identified two PM2.5 constituents, Br and Mn, that showed a negative causal effect on cognitive function after measurement error correction.




Citations (52)


... Enhancing the availability and utilization of NCD screening services necessitates the provision of laboratory investigations, medications, equipment, and healthcare providers, along with fostering a strong patient-provider relationship for sustainable health outcomes. 6 Effective intersectoral and intergovernmental coordination is essential to ensure the success of these campaigns in the timely identification and management of NCDs, ultimately reducing their burden on the population. ...

Reference:

Burden of Non-Communicable Diseases and Emerging Attention in Gandaki Province, Nepal
Barriers and Facilitators to Patient Utilization of Non-Communicable Disease Services in Primary Healthcare Facilities in Nepal: A Qualitative Study

... This study is part of the Barmaids PrEP project in Ubungo Municipality in Dar es Salaam (26). This qualitative case study used a phenomenological approach to explore female barmaids' awareness of HIV, risky behaviors, protection practices and PrEP, and willingness to use long-acting injectable HIV PrEP and delivery models given their experiences with healthcare services using a semi-structured in-depth interview guide (27)(28)(29)(30). ...

Demonstrating service delivery models for effective initiation and retention on pre- exposure prophylaxis (PrEP) among female bar workers in Dar es Salaam, Tanzania: A double randomized intervention study protocol

... Preclinical studies suggest that exercise can partially counteract cisplatin-induced muscle atrophy and restore normal food intake in mice, potentially through the regulation of appetiterelated hormones such as ghrelin (204). Clinical research further supports that increasing physical activity during and after breast cancer chemotherapy improves dietary intake and improves overall quality of life (205). While nutritional therapy alone may not be sufficient to significantly increase muscle mass (206,207) and typically produces slower effects (208), its combination with exercise has been shown to enhance exercise capacity, facilitate muscle adaptation to training, and reduce muscle atrophy (209, 210). ...

Improving lifestyle behaviors during chemotherapy for breast cancer: The Lifestyle, Exercise, and Nutrition Early After Diagnosis (LEANer) Trial

... The mechanisms driving weight gain during cancer treatment remain under investigation. Weight gain during breast cancer treatment is most often observed among patients who undergo chemotherapy, especially when treatment is of longer duration [38,[40][41][42][43], though some studies do challenge this notion and suggest other potential drivers of weight gain [44,45]. Proposed mechanisms linking chemotherapy to weight gain include, reduced energy expenditure (e.g., reduced physical activity and lean muscle mass during treatment), changes in sex hormone concentrations, such as estrogen, due to treatment-associated amenorrhea or menopause, and insulin resistance that promotes fat storage in response to chemotherapy [41]. ...

Post-diagnosis weight trajectories and mortality among women with breast cancer

npj Breast Cancer

... Prior studies in India, consistent with global literature, indicate that peers significantly influence injection initiation, continued substance use, and receipt of harm reduction services among young PWID [33,38,70]. Peer-delivered interventions that leverage strengths of social networks have been utilized to find and link PWID to services and support recovery in many settings [71][72][73][74][75]. Specific to ICCs, social network strategies such as respondent driven sampling methods have been successful in finding and linking PWID to ICCs in our studies [76][77][78]. ...

Overall, Direct, Spillover, and Composite Effects of Components of a Peer-Driven Intervention Package on Injection Risk Behavior Among People Who Inject Drugs in the HPTN 037 Study

AIDS and Behavior

... Nutritional factors may play a pivotal role not only in cancer occurrence but also in mitigating comorbidities and enhancing quality of life [12][13][14]. Even though growing interest in dietary risk factors for cancer has led to numerous investigations into the association between dietary habits and breast cancer [15][16][17], the dietary landscape and its impact on the prognosis of long-term breast cancer survivors remain understudied. Considering the dramatically increased number of breast cancer survivors and the increased comorbidity burden of long-term breast cancer survivors, this study aimed to assess the nutrition status and risk of co-morbid chronic diseases among breast cancer survivors in China. ...

Randomized Trial Evaluating a Self-Guided Lifestyle Intervention Delivered via Evidence-Based Materials versus a Waitlist Group on Changes in Body Weight, Diet Quality, Physical Activity, and Quality of Life among Breast Cancer Survivors

... 13 These interactions with the health system serve as opportunities for health system level interventions to address this social risk, such as engagement in interventions to prevent incarceration (initiation of medications for opioid use disorder, violence intervention programs) or prevent poor outcomes after release (engagement into primary care programs), though screening directly can be stigmatizing. [14][15][16][17] Additionally, systematically implementing broader health systems level interventions, such as medical legal partnerships, and quality of care analyses necessitate an ability to identify those with a history of incarceration within health system information systems, but currently there is no reliable way to do this. ...

Trusted residents and housing assistance to decrease violence exposure in New Haven (TRUE HAVEN): a strengths-based and community-driven stepped-wedge intervention to reduce gun violence

BMC Public Health

... These programs still predominantly rely on conventional cytology [14,15]. The lack of adherence to current guidelines and the procedures for referring patients with abnormal cytological findings to colposcopy are major hurdles in the effectiveness and timeliness of screening and treatment [16]. These observations underscore the continued need for improved strategies to lower barriers to participation, utilize accurate diagnostics, and ensure prompt diagnosis and treatment. ...

Factors associated with receiving results and attending colposcopy in patients with positive HPV screens in Mexico City

Preventive Medicine Reports

... However, most of these methods are not suitable for the main study (MS)/validation study (VS) design, where the MS provides the primary dataset for analysis and the VS provides additional data for measurement error correction (Cai et al., 2023;Hart et al., 2015;Liao et al., 2018;Rosner, Spiegelman and Willett, 1990;Spiegelman, Rosner and Logan, 2000). Unlike spatio-temporal models that use smoothing to estimate ambient air pollution exposure at individual residences, which would be the gold standard if individuals never left their houses, the VS collects personal exposure measurements, representing the actual exposure people receive, whether at home or not (Zhang et al., 2024). ...

Correcting for Bias Due to Mismeasured Exposure History in Longitudinal Studies with Continuous Outcomes

Biometrics

... Our team previously conducted two studies on cancer stigma. The first assessed the validity of the Cancer Stigma Scale (CASS) in the Nepalese context, [27] and the second examined the relationship between cancer stigma and cervical cancer screening uptake, finding that women with higher stigma were less likely to participate in screening [28]. To better understand this, it is important to examine the level and distribution of stigma, identify the domains where stigma is most prevalent, and analyze which socio-economic groups are more affected. ...

Association between cancer stigma and cervical cancer screening uptake among women of Dhulikhel and Banepa, Nepal