A semi-parametric generalization of the Cox proportional hazards regression model: Inference and applications

Division of Population Science, Fox Chase Cancer Center, Philadelphia, PA 19111.
Computational Statistics & Data Analysis (Impact Factor: 1.15). 01/2011; 55(1):667-676. DOI: 10.1016/j.csda.2010.06.010
Source: RePEc

ABSTRACT The assumption of proportional hazards (PH) fundamental to the Cox PH model sometimes may not hold in practice. In this paper, we propose a generalization of the Cox PH model in terms of the cumulative hazard function taking a form similar to the Cox PH model, with the extension that the baseline cumulative hazard function is raised to a power function. Our model allows for interaction between covariates and the baseline hazard and it also includes, for the two sample problem, the case of two Weibull distributions and two extreme value distributions differing in both scale and shape parameters. The partial likelihood approach can not be applied here to estimate the model parameters. We use the full likelihood approach via a cubic B-spline approximation for the baseline hazard to estimate the model parameters. A semi-automatic procedure for knot selection based on Akaike's information criterion is developed. We illustrate the applicability of our approach using real-life data.

  • [Show abstract] [Hide abstract]
    ABSTRACT: Practical estimation procedures for the local linear estimation of an unrestricted failure rate when more information is available than just time are developed. This extra information could be a covariate and this covariate could be a time series. Time dependent covariates are sometimes called markers, and failure rates are sometimes called hazards, intensities or mortalities. It is shown through simulations and a practical example that the fully local linear estimation procedure exhibits an excellent practical performance. Two different bandwidth selection procedures are developed. One is an adaptation of classical cross-validation, and the other one is indirect cross-validation. The simulation study concludes that classical cross-validation works well on continuous data while indirect cross-validation performs only marginally better. However, cross-validation breaks down in the practical data application to old-age mortality. Indirect cross-validation is thus shown to be superior when selecting a fully feasible estimation method for marker dependent hazard estimation.
    Computational Statistics & Data Analysis 12/2013; 68:155-169. DOI:10.1016/j.csda.2013.06.010 · 1.15 Impact Factor
  • [Show abstract] [Hide abstract]
    ABSTRACT: This article applies general engineering rules for describing the reliability of devices working under variable stresses. The approach is based on imposing completeness and physicality. Completeness refers to the model's capability for studying as many stated conditions as possible, and physicality refers to the model's capability for incorporating explanatory variables specified and related each other by the physical laws. The proposed reliability model has as many explanatory variables as necessary but only three unknown parameters, and hence, it allows the engineer to collect reliability data from different tests campaigns, and to extrapolate reliability results towards other operational and design points.
    Communication in Statistics- Theory and Methods 05/2014; 43(10-12). DOI:10.1080/03610926.2013.775303 · 0.28 Impact Factor
  • [Show abstract] [Hide abstract]
    ABSTRACT: Multifactor software reliability modeling with software test metrics data is well known to be useful for predicting the software reliability with higher accuracy, because it utilizes not only software fault count data but also software testing metrics data observed in the development process. In this paper we generalize the existing Cox proportional hazards regression-based software reliability model by introducing more generalized hazards representation, and improve the goodness-of-fit and predictive performances. In numerical examples with real software development project data, we show that our generalized model can significantly outperform several logistic regression-based models as well as the existing Cox proportional hazards regression-based model.
    Proceedings of the 2013 IEEE 19th Pacific Rim International Symposium on Dependable Computing; 12/2013


Available from