Heinze G, Schemper M. A solution to the problem of separation in logistic regression. Stat Med 21: 2409-2419

Section of Clinical Biometrics, Department of Medical Computer Sciences, University of Vienna, Spitalgasse 23, A-1090 Vienna, Austria.
Statistics in Medicine (Impact Factor: 1.83). 08/2002; 21(16):2409-19. DOI: 10.1002/sim.1047
Source: PubMed


The phenomenon of separation or monotone likelihood is observed in the fitting process of a logistic model if the likelihood converges while at least one parameter estimate diverges to +/- infinity. Separation primarily occurs in small samples with several unbalanced and highly predictive risk factors. A procedure by Firth originally developed to reduce the bias of maximum likelihood estimates is shown to provide an ideal solution to separation. It produces finite parameter estimates by means of penalized maximum likelihood estimation. Corresponding Wald tests and confidence intervals are available but it is shown that penalized likelihood ratio tests and profile penalized likelihood confidence intervals are often preferable. The clear advantage of the procedure over previous options of analysis is impressively demonstrated by the statistical analysis of two cancer studies.

Download full-text


Available from: Georg Heinze, Dec 22, 2014
  • Source
    • "How - ever , when the occurrence probability is extremely small ( or large) , or if the number of samples is not enough for the number of parameters , it is sometimes impossible to obtain accurate estimates with this method . The exact method is one way of resolving these issues ( Mehta and Patel 1995 ) ( Heinze and Schemper 2002) . "
    [Show abstract] [Hide abstract]
    ABSTRACT: In 2014, we published an article titled "Novel uterine sarcoma preoperative diagnosis score predicts the need for surgery in patients presenting with a uterine mass" on the preoperative diagnosis of uterine sarcoma, in the SpringerPlus (Nagai et al. in SpringerPlus 2014, 3:678. doi:10.1186/2193-1801-3-678). Subsequently, we received several suggestions from readers, which were used to modify the statistical analysis methods and create a more precise preoperative diagnostic scoring system, which we present here as a supplemental report. The subjects were 63 patients who underwent surgical therapy for suspected uterine sarcoma (sarcoma group: 15 patients, benign group: 48 patients). Logistic regression analysis using the exact method was performed considering the subjects' preoperative age, serum lactate dehydrogenase levels, magnetic resonance imaging findings, and endometrial cytology findings. We then used parameter estimates obtained from this analysis to revise the PREoperative Sarcoma Score (PRESS). The revised PRESS (rPRESS) has a maximum score of 10 points and an optimal cut-off value of 4 points, as derived from a receiver operating characteristic curve. Using this, the accuracy, positive predictive value, and negative predictive value were 93.7, 92.3, and 94.0 %, respectively. The diagnostic precision of the rPRESS is better than that of the original PRESS.
    SpringerPlus 09/2015; 4(1):520. DOI:10.1186/s40064-015-1318-7
  • Source
    • "The problem of potential dependency could be resolved in generalized linear mixed models (GLMM) by including female identity and pond as random factors. However, our data included several incidences of perfect prediction, that is, when all of the females in a pond scored either positive or negative assortative, which interferes with parameter estimation in GLMM (Heinze and Schemper 2002). An appropriate correction for the bias introduced by perfect prediction has not yet been developed for mixed models (G. "
    [Show abstract] [Hide abstract]
    ABSTRACT: Assortative mating promotes reproductive isolation and allows allopatric specia-tion processes to continue in secondary contact. As mating patterns are determined by mate preferences and intrasexual competition, we investigated male–male competition and behavioral isolation in simulated secondary contact among allopatric populations. Three allopatric color morphs of the cichlid fish Tropheus were tested against each other. Dyadic male–male contests revealed dominance of red males over bluish and yellow-blotch males. Reproductive isolation in the presence of male–male competition was assessed from genetic parent-age in experimental ponds and was highly asymmetric among pairs of color morphs. Red females mated only with red males, whereas the other females performed variable degrees of heteromorphic mating. Discrepancies between mating patterns in ponds and female preferences in a competition-free, two-way choice paradigm suggested that the dominance of red males interfered with positive assortative mating of females of the subordinate morphs and provoked asymmet-ric hybridization. Between the nonred morphs, a significant excess of negative assortative mating by yellow-blotch females with bluish males did not coincide with asymmetric dominance among males. Hence, both negative assortative mating preferences and interference of male–male competition with positive assorta-tive preferences forestall premating isolation, the latter especially in environments unsupportive of competition-driven spatial segregation.
    Ecology and Evolution 04/2015; DOI:10.1002/ece3.1372 · 2.32 Impact Factor
  • Source
    • "For example, according to the general bullying item, none of the children with OI or with moderate ID were bullied. Given this constancy , we decided to remove the OI and moderate ID variables in the MLR analyses rather than using the Haldane correction where the 0 is replaced with a value of 1 (Bull, Mak, & Greenwood, 2002; Heinze & Schemper, 2002). However, it was not necessary to remove these students in the calculation of prevalence rates. "
    [Show abstract] [Hide abstract]
    ABSTRACT: Prevalence rates for bullying victimization among children with disabilities have varied greatly in the research literature. Two reasons for such variability were the focus of this study: (a) rates vary as a function of disability type, and (b) rates vary based on the bullying measure and criteria used to classify students as bullying victims. The sample consisted of 1,027 parents or guardians of children with disabilities and 11,500 parents or guardians of children without disabilities who reported the frequency with which their children experienced bullying in general and 12 specific behaviors associated with verbal, physical, and social–relational bullying. Prevalence rates and odds ratios (ORs) differed considerably based not only on disability type but also on the classification criteria used. For both conceptual and practical reasons, it is recommended that bullying victims be considered those who experience bullying-related behaviors frequently and repetitively as opposed to only sometimes.
    School psychology review 03/2015; 44(1):98-116. DOI:10.17105/SPR44-1.98-116 · 1.85 Impact Factor
Show more