James Carpenter’s research while affiliated with MRC Clinical Trials Unit at UCL and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (287)


Associations of age and sexuality variables with STI reinfection and condom use at 1 year
Age, sex and sexual orientation effects in the Safetxt trial: secondary data analysis of a randomised controlled trial
  • Article
  • Full-text available

November 2024

·

7 Reads

Sexually Transmitted Infections

·

Melissa J Palmer

·

Ford Colin Ian Hickson

·

[...]

·

Background Increasing rates of sexually transmitted infections (STIs) and antimicrobial resistance among young people underscore the urgent need for preventative interventions. Interventions should be evidence-based and tailored to the unique risks and needs associated with varying age, sex and sexual orientation. We used data from the Safetxt trial to explore whether young people’s age, sex and sexual orientation influence (1) their risk of STI reinfection and condom use and (2) the effect of the Safetxt intervention on STI reinfection and condom use. Methods We conducted exploratory secondary analyses of data from the Safetxt trial that evaluated a theory-based digital sexual health intervention tailored according to sex and sexual orientation. We recruited 6248 young people with STIs from 92 UK sexual health clinics and assessed outcomes after 1 year, including the cumulative incidence of STI reinfection and condom use at last sex. We used adjusted logistic regression and margins plots to visualise effect modification. Results There were differences in STI reinfection and condom use by age, sex and sexuality. Age was associated with STI reinfection (OR 0.90, 95% CI 0.87 to 0.94) with evidence for interaction between age and sexuality (p<0.001). Our findings suggest that the risk of STI reinfection decreases with age among young heterosexuals but increases among men-who-have-sex-with-men (MSM). Overall, MSM had the highest likelihood of reinfection (OR 3.53, 95% CI 2.66 to 4.68) despite being more likely to use condoms (OR 1.50, 95% CI 1.18 to 1.91). Among MSM, age modified the intervention effect on condom use at 1 year with highest benefits among participants aged 16–18, moderate to minor benefits among those aged 18–21 and no effect among participants aged 22–24 years. Conclusions Future digital health interventions tailored for diverse sexuality groups need to target young people early enough to have an impact on sexual behaviour. Specific novel interventions are needed for older MSM. Trial registration number ISRCTN64390461 .

Download

Multiple imputation using auxiliary imputation variables that only predict missingness can increase bias due to data missing not at random

October 2024

·

13 Reads

·

1 Citation

Background Epidemiological and clinical studies often have missing data, frequently analysed using multiple imputation (MI). In general, MI estimates will be biased if data are missing not at random (MNAR). Bias due to data MNAR can be reduced by including other variables (“auxiliary variables”) in imputation models, in addition to those required for the substantive analysis. Common advice is to take an inclusive approach to auxiliary variable selection (i.e. include all variables thought to be predictive of missingness and/or the missing values). There are no clear guidelines about the impact of this strategy when data may be MNAR. Methods We explore the impact of including an auxiliary variable predictive of missingness but, in truth, unrelated to the partially observed variable, when data are MNAR. We quantify, algebraically and by simulation, the magnitude of the additional bias of the MI estimator for the exposure coefficient (fitting either a linear or logistic regression model), when the (continuous or binary) partially observed variable is either the analysis outcome or the exposure. Here, “additional bias” refers to the difference in magnitude of the MI estimator when the imputation model includes (i) the auxiliary variable and the other analysis model variables; (ii) just the other analysis model variables, noting that both will be biased due to data MNAR. We illustrate the extent of this additional bias by re-analysing data from a birth cohort study. Results The additional bias can be relatively large when the outcome is partially observed and missingness is caused by the outcome itself, and even larger if missingness is caused by both the outcome and the exposure (when either the outcome or exposure is partially observed). Conclusions When using MI, the naïve and commonly used strategy of including all available auxiliary variables should be avoided. We recommend including the variables most predictive of the partially observed variable as auxiliary variables, where these can be identified through consideration of the plausible casual diagrams and missingness mechanisms, as well as data exploration (noting that associations with the partially observed variable in the complete records may be distorted due to selection bias).


Unleashing the full potential of digital outcome measures in clinical trials: eight questions that need attention

September 2024

·

14 Reads

BMC Medicine

The use of digital health technologies to measure outcomes in clinical trials opens new opportunities as well as methodological challenges. Digital outcome measures may provide more sensitive and higher-frequency measurements but pose vital statistical challenges around how such outcomes should be defined and validated and how trials incorporating digital outcome measures should be designed and analysed. This article presents eight methodological questions, exploring issues such as the length of measurement period, choice of summary statistic and definition and handling of missing data as well as the potential for new estimands and new analyses to leverage the time series data from digital devices. The impact of key issues highlighted by the eight questions on a primary analysis of a trial are illustrated through a simulation study based on the 2019 Bellerophon INOPulse trial which had time spent in MVPA as a digital outcome measure. These eight questions present broad areas where methodological guidance is needed to enable wider uptake of digital outcome measures in trials.


The elements of the metadata model captured in NHS England’s Central Metastore, reproduced from the study’s operating manual/guidance.23,24
business lineage view of the Admitted Patient Care (APC), Outpatients (OP), and Critical Care (CC) datasets of Hospital Episode Statistics (HES). 23 The data journey from left (submission from hospitals using business rules via an XML schema) to right (production stage where derivations and processing rules are applied) to form the final releasable HES schema containing APC, OP and CC tables in the data access environment (far right).
Field level lineage of the Civil Registration of Deaths (CRD) with an example of derivation to confirm NHS number. 23 The data journey moves from left (submission via Message Exchange for Social Care and Health) to right (production stage, then to releasable in the data access environment).
Example of data item details captured in the Central Metastore at NHS England (diagnosis code, Hospital Episode Statistics Admitted Patient Care).
Demonstrating the data integrity of routinely collected healthcare systems data for clinical trials (DEDICaTe): A proof-of-concept study

September 2024

·

46 Reads

Introduction/aims: Healthcare systems data (also known as real-world or routinely collected health data) could transform the conduct of clinical trials. Demonstrating integrity and provenance of these data is critical for clinical trials, to enable their use where appropriate and avoid duplication using scarce trial resources. Building on previous work, this proof-of-concept study used a data intelligence tool, the “Central Metastore,” to provide metadata and lineage information of nationally held data. Methods: The feasibility of NHS England’s Central Metastore to capture detailed records of the origins, processes, and methods that produce four datasets was assessed. These were England’s Hospital Episode Statistics (Admitted Patient Care, Outpatients, Critical Care) and the Civil Registration of Deaths (England and Wales). The process comprised: information gathering; information ingestion using the tool; and auto-generation of lineage diagrams/content to show data integrity. A guidance document to standardise this process was developed. Results/Discussion: The tool can ingest, store and display data provenance in sufficient detail to support trust and transparency in using these datasets for trials. The slowest step was information gathering from multiple sources, so consistency in record-keeping is essential.



Breakdown of the number of in-person attendees at the event by background (top left) and by institution (top right). Perspectives that were not represented in the event, and would be important to have representation for future events, are indicated in the grey pie chart (bottom left). PPI, Patient and Public Involvement; RDS, National Institute of Health and Care Research (NIHR) Research Support Service; CRO, Clinical Research Organization
Perspectives of two PPI contributors
Details of Knowledge Exchange event
Digital endpoints in clinical trials: emerging themes from a multi-stakeholder Knowledge Exchange event

August 2024

·

63 Reads

·

1 Citation

Trials

Background Digital technologies, such as wearable devices and smartphone applications (apps), can enable the decentralisation of clinical trials by measuring endpoints in people’s chosen locations rather than in traditional clinical settings. Digital endpoints can allow high-frequency and sensitive measurements of health outcomes compared to visit-based endpoints which provide an episodic snapshot of a person’s health. However, there are underexplored challenges in this emerging space that require interdisciplinary and cross-sector collaboration. A multi-stakeholder Knowledge Exchange event was organised to facilitate conversations across silos within this research ecosystem. Methods A survey was sent to an initial list of stakeholders to identify potential discussion topics. Additional stakeholders were identified through iterative discussions on perspectives that needed representation. Co-design meetings with attendees were held to discuss the scope, format and ethos of the event. The event itself featured a cross-disciplinary selection of talks, a panel discussion, small-group discussions facilitated via a rolling seating plan and audience participation via Slido. A transcript was generated from the day, which, together with the output from Slido, provided a record of the day’s discussions. Finally, meetings were held following the event to identify the key challenges for digital endpoints which emerged and reflections and recommendations for dissemination. Results Several challenges for digital endpoints were identified in the following areas: patient adherence and acceptability; algorithms and software for devices; design, analysis and conduct of clinical trials with digital endpoints; the environmental impact of digital endpoints; and the need for ongoing ethical support. Learnings taken for next generation events include the need to include additional stakeholder perspectives, such as those of funders and regulators, and the need for additional resources and facilitation to allow patient and public contributors to engage meaningfully during the event. Conclusions The event emphasised the importance of consortium building and highlighted the critical role that collaborative, multi-disciplinary, and cross-sector efforts play in driving innovation in research design and strategic partnership building moving forward. This necessitates enhanced recognition by funders to support multi-stakeholder projects with patient involvement, standardised terminology, and the utilisation of open-source software.


Handling Partially Observed Trial Data After Treatment Withdrawal: Introducing Retrieved Dropout Reference‐Base Centred Multiple Imputation

July 2024

·

4 Reads

Pharmaceutical Statistics

The ICH E9(R1) Addendum (International Council for Harmonization 2019) suggests treatment‐policy as one of several strategies for addressing intercurrent events such as treatment withdrawal when defining an estimand. This strategy requires the monitoring of patients and collection of primary outcome data following termination of randomised treatment. However, when patients withdraw from a study early before completion this creates true missing data complicating the analysis. One possible way forward uses multiple imputation to replace the missing data based on a model for outcome on‐ and off‐treatment prior to study withdrawal, often referred to as retrieved dropout multiple imputation. This article introduces a novel approach to parameterising this imputation model so that those parameters which may be difficult to estimate have mildly informative Bayesian priors applied during the imputation stage. A core reference‐based model is combined with a retrieved dropout compliance model, using both on‐ and off‐treatment data, to form an extended model for the purposes of imputation. This alleviates the problem of specifying a complex set of analysis rules to accommodate situations where parameters which influence the estimated value are not estimable, or are poorly estimated leading to unrealistically large standard errors in the resulting analysis. We refer to this new approach as retrieved dropout reference‐base centred multiple imputation.


Figure 1: V3+ framework from the Digital Medicine Society. Original source: Bakker et al. [2024], reprinted with permission.
Figure 2: Seasonality. Plots show estimated mean of treatment effect (top) and its standard error (bottom) without interaction between season and treatment (left) and with an interaction (right). The seasonal effect varies between 0 and 10. Error bars indicate 1.96× Monte Carlo error. The black confidence interval indicates the scenario under no seasonal effect. Dark blue, purple and teal lines indicate that the proportion of patients recruited in winter are 0.1, 0.2 and 0.5, respectively. Results are based on 10,000 simulations.
Figure 4: Scenario 3 (a): Data missing completely at random. Plots show the change in the estimated mean of treatment effect (left) and its standard error (right) as the proportion of days that are missing completely at random varies between 0.05 and 9.5. Error bars indicate 1.96× Monte Carlo error. The black confidence interval indicates the scenario under complete data. Dark green, purple and light green lines indicate that the proportion of patients with missing data are 0.1, 0.2 and 0.5, respectively. Results are based on 10,000 simulations.
Unleashing the Full Potential of Digital Endpoints: Eight Questions that Need Attention

June 2024

·

156 Reads

The use of Digital Health Technologies to measure endpoints in clinical trials opens new opportunities as well as methodological challenges. Digital endpoints may provide more sensitive and higher-frequency measurements of participants' health outcomes, but pose vital statistical challenges around how such endpoints should be defined and validated and how trials incorporating digital endpoints should be designed and analysed. This article presents eight methodological questions, exploring issues around decision making on key issues such as the length of measurement period, choice of summary statistic and definition and handling of missing data, as well as the potential for new estimands and new analyses to leverage the time series data from digital devices. A selection of the challenges are illustrated through a simulation study based on the 2019 Bellerophon INOPulse trial which used time spent in MVPA as a digital endpoint. These eight questions present broad areas where methodological guidance is needed to enable wider uptake of digital endpoints in trials.


Comments on ‘standard and reference‐based conditional mean imputation’: Regulators and trial statisticians be aware!

April 2024

·

13 Reads

·

1 Citation

Pharmaceutical Statistics

Accurate frequentist performance of a method is desirable in confirmatory clinical trials, but is not sufficient on its own to justify the use of a missing data method. Reference‐based conditional mean imputation, with variance estimation justified solely by its frequentist performance, has the surprising and undesirable property that the estimated variance becomes smaller the greater the number of missing observations; as explained under jump‐to‐reference it effectively forces the true treatment effect to be exactly zero for patients with missing data.


Assessing efficacy in non-inferiority trials with non-adherence to interventions: Are intention-to-treat and per-protocol analyses fit for purpose?

April 2024

·

18 Reads

·

1 Citation

Statistics in Medicine

Background Non‐inferiority trials comparing different active drugs are often subject to treatment non‐adherence. Intention‐to‐treat (ITT) and per‐protocol (PP) analyses have been advocated in such studies but are not guaranteed to be unbiased in the presence of differential non‐adherence. Methods The REMoxTB trial evaluated two 4‐month experimental regimens compared with a 6‐month control regimen for newly diagnosed drug‐susceptible TB. The primary endpoint was a composite unfavorable outcome of treatment failure or recurrence within 18 months post‐randomization. We conducted a simulation study based on REMoxTB to assess the performance of statistical methods for handling non‐adherence in non‐inferiority trials, including: ITT and PP analyses, adjustment for observed adherence, multiple imputation (MI) of outcomes, inverse‐probability‐of‐treatment weighting (IPTW), and a doubly‐robust (DR) estimator. Results When non‐adherence differed between trial arms, ITT, and PP analyses often resulted in non‐trivial bias in the estimated treatment effect, which consequently under‐ or over‐inflated the type I error rate. Adjustment for observed adherence led to similar issues, whereas the MI, IPTW and DR approaches were able to correct bias under most non‐adherence scenarios; they could not always eliminate bias entirely in the presence of unobserved confounding. The IPTW and DR methods were generally unbiased and maintained desired type I error rates and statistical power. Conclusions When non‐adherence differs between trial arms, ITT and PP analyses can produce biased estimates of efficacy, potentially leading to the acceptance of inferior treatments or efficacious regimens being missed. IPTW and the DR estimator are relatively straightforward methods to supplement ITT and PP approaches.


Citations (66)


... Memantine is also a possibility although it was neutral in MND-SMART. 42 A recent network metanalysis provides mixed results as to whether AChE-I improve cognition in VCI/VaD. 43 However, it remains unclear whether AChE-i and memantine only modulate symptoms in AD, also have some disease modifying/neuroprotective properties or even address AD pathology which may often be present in VCI/VaD. ...

Reference:

Developing treatments for cerebral small vessel disease: a scoping review of licensed interventions for potential repurposing
Safety and efficacy of memantine and trazodone versus placebo for motor neuron disease (MND SMART): stage two interim analysis from the first cycle of a phase 3, multiarm, multistage, randomised, adaptive platform trial
  • Citing Article
  • September 2024

The Lancet Neurology

... While there are some design and analysis strategies to mitigate consequences illustrated in the simulation study, further methodological work and guidance is needed. Approaches to reduce the impact of seasonality at the design stage include recruiting at appropriate times of the year and stratifying for season in the randomisation scheme [62]. Furthermore, treatment imbalance within seasons could exacerbate the impact of a potential seasonal effect. ...

Digital endpoints in clinical trials: emerging themes from a multi-stakeholder Knowledge Exchange event

Trials

... However, a recent review into NI antibiotic trials suggested ITT may sometimes be the more conservative which the authors state may be due to lower success rates and larger variance in the ITT analysis [44]. Given this uncertainty, research suggests using additional analysis populations (inverse-probability-of-treatment weightings and doubly-robust estimators) to supplement the ITT and PP analyses [45] to enhance the robustness of results. One key consideration should be the estimand of interest to inform the most appropriate research question for the NI trials and help interpret if there are any differences between the results of analysis populations [43]. ...

Assessing efficacy in non-inferiority trials with non-adherence to interventions: Are intention-to-treat and per-protocol analyses fit for purpose?
  • Citing Article
  • April 2024

Statistics in Medicine

... To be confident in RCTs' results, outcome data need to be as accurate, complete, and unbiased as possible, resulting in valid and reliable estimates of treatments' effects. For RCTs using HCSD to identify outcomes, utility assessments comparing HCSD to RCTs' alternative primary methods of outcome detection should inform whether HCSD meet these requirements, but few such assessments exist in the UK or elsewhere [14,15]. A systematic review of ten RCTs of cardiovascular event prevention found good agreement between HCSD and adjudicated outcomes for death and some cardiovascular outcomes, with similar directions and magnitudes of treatment effects [16]. ...

Getting our ducks in a row: The need for data utility comparisons of healthcare systems data for clinical trials
  • Citing Article
  • March 2024

Contemporary Clinical Trials

... This is the strategy we adopted in the simulation study where the auxiliary was a complete variable. Such a strategy may also be useful in settings where the auxiliary variable should not be included in the outcome imputation model because doing so may cause bias due to the causal structure, e.g. by opening a collider path [42,43]. However, it is unclear whether this strategy violates compatibility when the auxiliary variable has missing data and needs to be imputed [6]. ...

Multiple imputation of missing data under missing at random: including a collider as an auxiliary variable in the imputation model can induce bias

Frontiers in Epidemiology

... It is a dynamic clinical trial that uses a master protocol (i.e., one overarching protocol) to evaluate multiple treatments in an ongoing manner, where new treatments can be added to the platform as they become available, and existing ones can leave the platform based on a finding of futility or efficacy. Such a trial structure not only streamlines operations but also has the potential to improve statistical efficiency by facilitating the sharing of data between different treatment evaluations (Mehta et al., 2023). The effectiveness of platform trials in addressing urgent medical challenges quickly is evidenced by the hallmark RECOVERY platform trial, which was designed to rapidly evaluate COVID-19 therapies during the pandemic (Normand, 2021;Buenconsejo et al., 2023). ...

The role of placebo control in clinical trials for neurodegenerative diseases
  • Citing Article
  • September 2023

Nature Medicine

... For example in 2014, the item asked, 'During 2013, how much income did you receive from wages, salary, commissions or tips from all jobs, before deductions for taxes or anything else?' Using the respondents' age, we were able to determine their income each year from age 18 to 36, which was used to calculate each respondent's annual average income between age 18 to 36. Due to an inherent interest in maintaining the sample size, gaps in income were address through the implementation of random forests multiple imputation (Carpenter et al. 2023). Exactly 8,226 individuals reported their income more than 2 times during the study period, 6,681 reported their income more than 7 times during the study period and 4,360 reported their income more than 10 times during the study period. ...

Multiple Imputation and its Application 2e
  • Citing Book
  • August 2023

... Finally, two main strategies to deal with non-wear periods are to omit or impute data during these periods. Several imputation techniques have been proposed [26][27][28], but there is no clear evidence of simple or more sophisticated methods to perform better, the critical point being proceeding with an imputation at the epoch level [23,29,30]. ...

Multiple imputation approaches for epoch-level accelerometer data in trials
  • Citing Article
  • July 2023

... On one hand, owing to factors such as seasonal temperature variations and vehicular dynamic loads, the tunnel SHM data is nonlinearity, displaying periodic patterns [13,19,24]. Conventional imputation methods struggle to capture the nonlinear evolutionary features within the data, particularly the complexities introduced by continuous missing data cases, and the magnitude of accuracy degradation increases with the extent of continuous missing percent [9]. On the other hand, each time series records the mechanical information of a singular measurement point [5]. ...

Multiple imputation of missing data under missing at random: compatible imputation models are not sufficient to avoid bias if they are mis-specified
  • Citing Article
  • June 2023

Journal of Clinical Epidemiology

... As Rehal et al. recently reported "a key aspect of the estimand framework is that missing data is a problem for the estimator not the estimand…. missing data is not viewed as an [ICE per se] but there may also be missing data as a consequence of an [ICE]" [italics added] [4]. ...

Handling intercurrent events and missing data in non-inferiority trials using the estimand framework: A tuberculosis case study
  • Citing Article
  • June 2023

Clinical Trials