Article

Simulation studies of surrogate endpoint validation using single trial and multitrial statistical approaches.

Department of Rheumatology, St. George Hospital, University of New South Wales, Sydney, Australia.
The Journal of Rheumatology (impact factor: 3.69). 04/2007; 34(3):616-9. pp.616-9
Source: PubMed

ABSTRACT A schema was recently proposed for assessing the levels of evidence for surrogate validity that included 4 domains: Target, Study Design, Statistical Strength, and Penalties. This report examines one component of the schema. It surveys the literature on methods of statistical validation of surrogate markers and compares these methods head-to-head using simulated datasets.
Simulated datasets (continuous, multivariate normal) were generated to capture 3 possible relationships of surrogate (S) and true (T) outcome (none, weakly positive, strongly positive) each applied to 4 treatment effects (effect on both surrogate and true outcome, effect on neither, effect on surrogate only, and effect on true outcome only). These datasets were analyzed using single and multitrial statistical approaches, and the results were provided to participants for discussion.
The multitrial surrogate threshold effect seemed to capture best the requirement that surrogate validation is demonstrated by a treatment-associated change in the surrogate predicting a treatment-associated change in the outcome.
There was general agreement that neither a single trial nor any of the single trial statistical methods was adequate to establish surrogate validity. These exercises also showed that summary statistics developed specifically to establish surrogate validity, such as the proportion of the effect explained, were problematic. A sizable statistical research agenda remains, which includes investigating the additional advantage obtained with modeling subject-level data compared to modeling with only trial-level data; and developing and testing multitrial statistical approaches robust to settings with only a few trials.

0 0
 · 
0 Bookmarks
 · 
32 Views
  • Source
    Article: Does chemotherapy improve survival in advanced breast cancer? A statistical overview.
    [show abstract] [hide abstract]
    ABSTRACT: The relative efficacies of cytotoxic chemotherapy regimens in the treatment of advanced breast cancer are generally assessed by comparing response rates in randomised trials. Treatment attempts to prolong survival but trials rarely demonstrate a statistically significant survival advantage: it has been argued that chemotherapy does not prolong survival. The correlation between response rates and survival has been examined by reviewing 79 comparisons between arms with unequal response rates in 50 published trials of chemotherapy in advanced breast cancer. In 73% of comparisons the group with the higher response rate also demonstrated the longer median survival (P less than 0.001). Weighted linear regression showed a statistically significant relationship between relative response rates and survival (P less than 0.001). The number of patients in a comparison did not influence this relationship.
    British Journal of Cancer 07/1988; 57(6):615-8. · 5.04 Impact Factor
  • Article: Statistical validation of intermediate endpoints for chronic diseases.
    [show abstract] [hide abstract]
    ABSTRACT: We discuss the implementation of a criterion due to Prentice for the statistical validation of intermediate endpoints for chronic disease. The criterion involves examining in a cohort or intervention study whether an exposure or intervention effect, adjusted for the intermediate endpoint, is reduced to zero. For example, to examine whether serum cholesterol level is an intermediate endpoint for coronary heart disease (CHD), we may investigate the effect of the cholesterol lowering drug cholestyramine on CHD incidence adjusted for serum cholesterol levels. We show that use of this criterion will usually demand some form of model selection. When the unadjusted exposure or treatment effect is less than four times its standard error, the analysis can usually lead only to a weak form of validation, a conclusion that the data are not inconsistent with the validation criterion. More significant unadjusted exposure effects offer the potential for stronger types of validation statement such as 'the intermediate endpoint explains at least 50 per cent (or 75 per cent) of the exposure effect'.
    Statistics in Medicine 02/1992; 11(2):167-78. · 1.88 Impact Factor
  • Source
    Article: Perspective: validating surrogate markers--are we being naive?
    [show abstract] [hide abstract]
    ABSTRACT: Because of the difficulties in conducting studies of clinical efficacy of new therapies for human immunodeficiency virus infection and other diseases, there is increasing interest in using measures of biologic activity as surrogates for clinical end points. A widely used criterion for evaluating whether such measures are reliable as surrogates requires that the putative surrogate fully captures the "net effect"-the effect aggregated over all mechanisms of action-of the treatment on the clinical end point. The variety of proposed metrics for evaluating the degree to which this criterion is met are subject to misinterpretation because of the multiplicity of mechanisms by which drugs operate. Without detailed understanding of these mechanisms, metrics of "surrogacy" are not directly interpretable. Even when all of the mechanisms are understood, these metrics are associated with a high degree of uncertainty unless either treatment effects are large in moderate-size studies or sample sizes are large in studies of moderately effective treatments.
    The Journal of Infectious Diseases 03/1997; 175(2):237-46. · 6.41 Impact Factor

Full-text

View
2 Downloads
Available from

Keywords

additional advantage
 
capture 3 possible relationships
 
datasets
 
included 4 domains
 
methods head-to-head
 
modeling subject-level data
 
multitrial statistical approaches
 
multitrial surrogate threshold effect
 
report examines
 
Simulated datasets
 
single trial statistical methods
 
sizable statistical research agenda
 
Statistical Strength
 
summary statistics
 
surrogate validation
 
surrogate validity
 
testing multitrial statistical approaches robust
 
trial-level data
 
true outcome only
 
weakly positive