Thomas H. Scheike

University of Copenhagen, Copenhagen, Capital Region, Denmark

Are you Thomas H. Scheike?

Claim your profile

Publications (35)45.21 Total impact

  • Article: Competing risks with missing covariates: effect of haplotypematch on hematopoietic cell transplant patients.
    [show abstract] [hide abstract]
    ABSTRACT: In this paper we consider a problem from hematopoietic cell transplant (HCT) studies where there is interest on assessing the effect of haplotype match for donor and patient on the cumulative incidence function for a right censored competing risks data. For the HCT study, donor's and patient's genotype are fully observed and matched but their haplotypes are missing. In this paper we describe how to deal with missing covariates of each individual for competing risks data. We suggest a procedure for estimating the cumulative incidence functions for a flexible class of regression models when there are missing data, and establish the large sample properties. Small sample properties are investigated using simulations in a setting that mimics the motivating haplotype matching problem. The proposed approach is then applied to the HCT study.
    Lifetime Data Analysis 09/2012; · 0.92 Impact Factor
  • Article: Absolute risk regression for competing risks: interpretation, link functions, and prediction.
    Thomas A Gerds, Thomas H Scheike, Per K Andersen
    [show abstract] [hide abstract]
    ABSTRACT: In survival analysis with competing risks, the transformation model allows different functions between the outcome and explanatory variables. However, the model's prediction accuracy and the interpretation of parameters may be sensitive to the choice of link function. We review the practical implications of different link functions for regression of the absolute risk (or cumulative incidence) of an event. Specifically, we consider models in which the regression coefficients β have the following interpretation: The probability of dying from cause D during the next t years changes with a factor exp(β) for a one unit change of the corresponding predictor variable, given fixed values for the other predictor variables. The models have a direct interpretation for the predictive ability of the risk factors. We propose some tools to justify the models in comparison with traditional approaches that combine a series of cause-specific Cox regression models or use the Fine-Gray model. We illustrate the methods with the use of bone marrow transplant data. Copyright © 2012 John Wiley & Sons, Ltd.
    Statistics in Medicine 08/2012; · 1.88 Impact Factor
  • Article: On cross-odds ratio for multivariate competing risks data.
    Thomas H Scheike, Yanqing Sun
    [show abstract] [hide abstract]
    ABSTRACT: The cross-odds ratio is defined as the ratio of the conditional odds of the occurrence of one cause-specific event for one subject given the occurrence of the same or a different cause-specific event for another subject in the same cluster over the unconditional odds of occurrence of the cause-specific event. It is a measure of the association between the correlated cause-specific failure times within a cluster. The joint cumulative incidence function can be expressed as a function of the marginal cumulative incidence functions and the cross-odds ratio. Assuming that the marginal cumulative incidence functions follow a generalized semiparametric model, this paper studies the parametric regression modeling of the cross-odds ratio. A set of estimating equations are proposed for the unknown parameters and the asymptotic properties of the estimators are explored. Non-parametric estimation of the cross-odds ratio is also discussed. The proposed procedures are applied to the Danish twin data to model the associations between twins in their times to natural menopause and to investigate whether the association differs among monozygotic and dizygotic twins and how these associations have changed over time.
    Biostatistics 06/2012; 13(4):680-94. · 2.14 Impact Factor
  • Article: The additive risk model for estimation of effect of haplotype match in BMT studies.
    Thomas H Scheike, Torben Martinussen, Mei-Jie Zhang
    [show abstract] [hide abstract]
    ABSTRACT: In this paper we consider a problem from bone marrow transplant (BMT) studies where there is interest on assessing the effect of haplotype match for donor and patient on the overall survival. The BMT study we consider is based on donors and patients that are genotype matched, and this therefore leads to a missing data problem. We show how Aalen's additive risk model can be applied in this setting with the benefit that the time-varying haplo-match effect can be easily studied. This problem has not been considered before, and the standard approach where one would use the EM-algorithm cannot be applied for this model because the likelihood is hard to evaluate without additional assumptions. We suggest an approach based on multivariate estimating equations that are solved using a recursive structure. This approach leads to an estimator where the large sample properties can be developed using product-integration theory. Small sample properties are investigated using simulations in a setting that mimics the motivating haplo-match problem.
    Scandinavian Journal of Statistics 09/2011; 38(3):409-423. · 1.12 Impact Factor
  • Source
    Article: Analyzing Competing Risk Data Using the R timereg Package.
    Thomas H Scheike, Mei-Jie Zhang
    [show abstract] [hide abstract]
    ABSTRACT: In this paper we describe flexible competing risks regression models using the comp.risk() function available in the timereg package for R based on Scheike et al. (2008). Regression models are specified for the transition probabilities, that is the cumulative incidence in the competing risks setting. The model contains the Fine and Gray (1999) model as a special case. This can be used to do goodness-of-fit test for the subdistribution hazards' proportionality assumption (Scheike and Zhang 2008). The program can also construct confidence bands for predicted cumulative incidence curves.We apply the methods to data on follicular cell lymphoma from Pintilie (2007), where the competing risks are disease relapse and death without relapse. There is important non-proportionality present in the data, and it is demonstrated how one can analyze these data using the flexible regression models.
    Journal of statistical software 01/2011; 38(2). · 4.01 Impact Factor
  • Source
    Article: A semiparametric random effects model for multivariate competing risks data
    [show abstract] [hide abstract]
    ABSTRACT: We propose a semiparametric random effects model for multivariate competing risks data when the failures of a particular type are of interest. Under this model, the marginal cumulative incidence functions follow a generalized semiparametric additive model. The associations between the cause-specific failure times can be studied through dependence parameters of copula functions that are allowed to depend on cluster-level covariates. A cross-odds ratio-type measure is proposed to describe the associations between cause-specific failure times, and its relationship to the dependence parameters is explored. We develop a two-stage estimation procedure where the marginal models are estimated in the first stage and the dependence parameters are estimated in the second stage. The large sample properties of the proposed estimators are derived. The proposed procedures are applied to Danish twin data to model the cumulative incidence for the age of natural menopause and to investigate the association in the onset of natural menopause between monozygotic and dizygotic twins. Copyright 2010, Oxford University Press.
    Biometrika 01/2010; 97(1):133-145. · 1.91 Impact Factor
  • Article: Estimating haplotype effects for survival data.
    Thomas H Scheike, Torben Martinussen, Jeremy D Silver
    [show abstract] [hide abstract]
    ABSTRACT: Genetic association studies often investigate the effect of haplotypes on an outcome of interest. Haplotypes are not observed directly, and this complicates the inclusion of such effects in survival models. We describe a new estimating equations approach for Cox's regression model to assess haplotype effects for survival data. These estimating equations are simple to implement and avoid the use of the EM algorithm, which may be slow in the context of the semiparametric Cox model with incomplete covariate information. These estimating equations also lead to easily computable, direct estimators of standard errors, and thus overcome some of the difficulty in obtaining variance estimators based on the EM algorithm in this setting. We also develop an easily implemented goodness-of-fit procedure for Cox's regression model including haplotype effects. Finally, we apply the procedures presented in this article to investigate possible haplotype effects of the PAF-receptor on cardiovascular events in patients with coronary artery disease, and compare our results to those based on the EM algorithm.
    Biometrics 09/2009; 66(3):705-15. · 1.83 Impact Factor
  • Article: Flexible survival regression modelling.
    [show abstract] [hide abstract]
    ABSTRACT: Regression analysis of survival data, and more generally event history data, is typically based on Cox's regression model. We here review some recent methodology, focusing on the limitations of Cox's regression model. The key limitation is that the model is not well suited to represent time-varying effects. We start by considering classical and also more recent goodness-of-fit procedures for the Cox model that will reveal when the Cox model does not capture important aspects of the data, such as time-varying effects. We present recent regression models that are able to deal with and describe such time-varying effects. The introduced models are all applied to data on breast cancer from the Norwegian cancer registry, and these analyses clearly reveal the shortcomings of Cox's regression model and the need for other supplementary analyses with models such as those we present here.
    Statistical Methods in Medical Research 08/2009; 19(1):5-28. · 2.44 Impact Factor
  • Chapter: Failure Time Analysis
    03/2009: pages 259 - 277; , ISBN: 9780470744116
  • Source
    Article: Covariate Selection for the Semiparametric Additive Risk Model
    TORBEN MARTINUSSEN, THOMAS H. SCHEIKE
    [show abstract] [hide abstract]
    ABSTRACT: This paper considers covariate selection for the additive hazards model. This model is particularly simple to study theoretically and its practical implementation has several major advantages to the similar methodology for the proportional hazards model. One complication compared with the proportional model is, however, that there is no simple likelihood to work with. We here study a least squares criterion with desirable properties and show how this criterion can be interpreted as a prediction error. Given this criterion, we define ridge and Lasso estimators as well as an adaptive Lasso and study their large sample properties for the situation where the number of covariates "p" is smaller than the number of observations. We also show that the adaptive Lasso has the oracle property. In many practical situations, it is more relevant to tackle the situation with large "p" compared with the number of observations. We do this by studying the properties of the so-called Dantzig selector in the setting of the additive risk model. Specifically, we establish a bound on how close the solution is to a true sparse signal in the case where the number of covariates is large. In a simulation study, we also compare the Dantzig and adaptive Lasso for a moderate to small number of covariates. The methods are applied to a breast cancer data set with gene expression recordings and to the primary biliary cirrhosis clinical data. Copyright (c) 2009 Board of the Foundation of the Scandinavian Journal of Statistics.
    Scandinavian Journal of Statistics 01/2009; 36(4):602-619. · 1.12 Impact Factor
  • Article: Dynamic regression hazards models for relative survival.
    Giuliana Cortese, Thomas H Scheike
    [show abstract] [hide abstract]
    ABSTRACT: A natural way of modelling relative survival through regression analysis is to assume an additive form between the expected population hazard and the excess hazard due to the presence of an additional cause of mortality. Within this context, the existing approaches in the parametric, semiparametric and non-parametric setting are compared and discussed. We study the additive excess hazards models, where the excess hazard is on additive form. This makes it possible to assess the importance of time-varying effects for regression models in the relative survival framework. We show how recent developments can be used to make inferential statements about the non-parametric version of the model. This makes it possible to test the key hypothesis that an excess risk effect is time varying in contrast to being constant over time. In case some covariate effects are constant, we show how the semiparametric additive risk model can be considered in the excess risk setting, providing a better and more useful summary of the data. Estimators have explicit form and inference based on a resampling scheme is presented for both the non-parametric and semiparametric models. We also describe a new suggestion for goodness of fit of relative survival models, which consists on statistical and graphical tests based on cumulative martingale residuals. This is illustrated on the semiparametric model with proportional excess hazards. We analyze data from the TRACE study using different approaches and show the need for more flexible models in relative survival.
    Statistics in Medicine 08/2008; 27(18):3563-84. · 1.88 Impact Factor
  • Article: Modeling cumulative incidence function for competing risks data.
    Mei-Jie Zhang, Xu Zhang, Thomas H Scheike
    [show abstract] [hide abstract]
    ABSTRACT: A frequent occurrence in medical research is that a patient is subject to different causes of failure, where each cause is known as a competing risk. The cumulative incidence curve is a proper summary curve, showing the cumulative failure rates over time due to a particular cause. A common question in medical research is to assess the covariate effects on a cumulative incidence function. The standard approach is to construct regression models for all cause-specific hazard rate functions and then model a covariate-adjusted cumulative incidence curve as a function of all cause-specific hazards for a given set of covariates. New methods have been proposed in recent years, emphasizing direct assessment of covariate effects on cumulative incidence function. Fine and Gray proposed modeling the effects of covariates on a subdistribution hazard function. A different approach is to directly model a covariate-adjusted cumulative incidence function, including a pseudovalue approach by Andersen and Klein and a direct binomial regression by Scheike, Zhang and Gerds. In this paper, we review the standard and new regression methods for modeling a cumulative incidence function, and give the sources of computer packages/programs that implement these regression models. A real bone marrow transplant data set is analyzed to illustrate various regression methods.
    Expert Review of Clinical Pharmacology 05/2008; 1(3):391-400.
  • Article: Time trends in human fecundability in Sweden.
    [show abstract] [hide abstract]
    ABSTRACT: Trends in biologic fertility are elusive. Possible negative trends in male reproductive health are still debated, and their effect on human fertility might be negligible. Time-to-pregnancy (TTP) is a functional measure of couple fecundability. We analyzed data on TTP among 832,000 primiparous women 20 years of age and older in the nationwide Swedish Medical Birth Registry from 1983 through 2002. This age restriction led to an exclusion of 10% of primiparous pregnancies. Subfertility (TTP > or =1 year) was analyzed as a function of maternal age, calendar time at initiation of attempt, and birth cohort-taking into account the truncation problems that are inherent in birth-based retrospective sampling. Subfertility generally decreased over successive birth cohorts. When studied as a period effect, a transient increase in subfertility was seen in the early 1990s. Subfertility increased with age, except that for women in their late 1930s, an apparent decrease was observed, particularly among the early cohorts. We found decreasing subfertility over time. We speculate that these patterns might be related to a Sweden-specific decrease over time in sexually transmitted diseases, to changes in sexual behavior induced by socioeconomic conditions, or to broader biologic or educational trends.
    Epidemiology 04/2008; 19(2):191-6. · 5.57 Impact Factor
  • Article: The Mizon-Richard encompassing test for the Cox and Aalen additive hazards models.
    Torben Martinussen, Odd O Aalen, Thomas H Scheike
    [show abstract] [hide abstract]
    ABSTRACT: The Cox hazards model (Cox, 1972, Journal of the Royal Statistical Society, Series B34, 187-220) for survival data is routinely used in many applied fields, sometimes, however, with too little emphasis on the fit of the model. A useful alternative to the Cox model is the Aalen additive hazards model (Aalen, 1980, in Lecture Notes in Statistics-2, 1-25) that can easily accommodate time changing covariate effects. It is of interest to decide which of the two models that are most appropriate to apply in a given application. This is a nontrivial problem as these two classes of models are nonnested except only for special cases. In this article we explore the Mizon-Richard encompassing test for this particular problem. It turns out that it corresponds to fitting of the Aalen model to the martingale residuals obtained from the Cox regression analysis. We also consider a variant of this method, which relates to the proportional excess model (Martinussen and Scheike, 2002, Biometrika 89, 283-298). Large sample properties of the suggested methods under the two rival models are derived. The finite-sample properties of the proposed procedures are assessed through a simulation study. The methods are further applied to the well-known primary biliary cirrhosis data set.
    Biometrics 04/2008; 64(1):164-71. · 1.83 Impact Factor
  • Source
    Article: Predicting cumulative incidence probability by direct binomial regression
    Thomas H. Scheike, Mei-Jie Zhang, Thomas A. Gerds
    [show abstract] [hide abstract]
    ABSTRACT: We suggest a new simple approach for estimation and assessment of covariate effects for the cumulative incidence curve in the competing risks model. We consider a semiparametric regression model where some effects may be time-varying and some may be constant over time. Our estimator can be implemented by standard software. Our simulation study shows that the estimator works well and has finite-sample properties comparable with the subdistribution approach. We apply the method to bone marrow transplant data and estimate the cumulative incidence of death in complete remission following a bone marrow transplantation. Here death in complete remission and relapse are two competing events. Copyright 2008, Oxford University Press.
    Biometrika 01/2008; 95(1):205-220. · 1.91 Impact Factor
  • Source
    Article: Maximum likelihood estimation for tied survival data under Cox regression model via EM-algorithm.
    Thomas H Scheike, Yanqing Sun
    [show abstract] [hide abstract]
    ABSTRACT: We consider tied survival data based on Cox proportional regression model. The standard approaches are the Breslow and Efron approximations and various so called exact methods. All these methods lead to biased estimates when the true underlying model is in fact a Cox model. In this paper we review the methods and suggest a new method based on the missing-data principle using EM-algorithm that leads to a score equation that can be solved directly. This score has mean zero. We also show that all the considered methods have the same asymptotic properties and that there is no loss of asymptotic efficiency when the tie sizes are bounded or even converge to infinity at a given rate. A simulation study is conducted to compare the finite sample properties of the methods.
    Lifetime Data Analysis 10/2007; 13(3):399-420. · 0.92 Impact Factor
  • Article: Direct Modelling of Regression Effects for Transition Probabilities in Multistate Models
    THOMAS H. SCHEIKE, MEI-JIE ZHANG
    [show abstract] [hide abstract]
    ABSTRACT: A simple and standard approach for analysing multistate model data is to model all transition intensities and then compute a summary measure such as the transition probabilities based on this. This approach is relatively simple to implement but it is difficult to see what the covariate effects are on the scale of interest. In this paper, we consider an alternative approach that directly models the covariate effects on transition probabilities in multistate models. Our new approach is based on binomial modelling and inverse probability of censoring weighting techniques and is very simple to implement by standard software. We show how to do flexible regression models with possibly time-varying covariate effects. Copyright 2007 Board of the Foundation of the Scandinavian Journal of Statistics..
    Scandinavian Journal of Statistics 01/2007; 34(1):17-32. · 1.12 Impact Factor
  • Article: A flexible semiparametric transformation model for survival data.
    Thomas H Scheike
    [show abstract] [hide abstract]
    ABSTRACT: I suggest an extension of the semiparametric transformation model that specifies a time-varying regression structure for the transformation, and thus allows time-varying structure in the data. Special cases include a stratified version of the usual semiparametric transformation model. The model can be thought of as specifying a first order Taylor expansion of a completely flexible baseline. Large sample properties are derived and estimators of the asymptotic variances of the regression coefficients are given. The method is illustrated by a worked example and a small simulation study. A goodness of fit procedure for testing if the regression effects lead to a satisfactory fit is also suggested.
    Lifetime Data Analysis 01/2007; 12(4):461-80. · 0.92 Impact Factor
  • Article: Aalen Additive Hazards Change-Point Model
    Torben Martinussen, Thomas H. Scheike
    [show abstract] [hide abstract]
    ABSTRACT: We study a test comparing the full Aalen additive hazards model and the change-point model, and suggest how to estimate the parameters of the change-point model. We also study a test for no change-point effect. Both tests are provided with large sample properties and a resampling method is applied to obtain p-values. The finite-sample properties of the proposed inference procedures and estimators are assessed through a simulation study. The methods are further applied to a dataset concerning myocardial infarction. Copyright 2007, Oxford University Press.
    Biometrika 01/2007; 94(4):861-872. · 1.91 Impact Factor
  • Article: Design and analysis of time-to-pregnancy.
    Thomas H Scheike, Niels Keiding
    [show abstract] [hide abstract]
    ABSTRACT: Time-to-pregnancy(TTP), the duration that a couple waits from initiating attempts to conceive until conception occurs, is regarded as one of the direct measures of natural fecundity. Statistical tools for designing and analysing TTP studies belong to the general area of survival analysis, but several special features have been developed: it is customary to work in discrete time, and random heterogeneity between couples has always played a prominent role. This review works on this background with focus on how to perform valid analyses, under various prospective, retrospective and cross-sectional sampling frames. We illustrate using examples from our own experience.
    Statistical Methods in Medical Research 05/2006; 15(2):127-40. · 2.44 Impact Factor