Article

# A simple method of sample size calculation for linear and logistic regression.

CSPCC, Department of Veterans Affairs, Palo Alto Health Care System (151-K), California 94304, USA.
Statistics in Medicine (Impact Factor: 2.04). 08/1998; 17(14):1623-34. DOI:10.1002/(SICI)1097-0258(19980730)17:143.0.CO;2-S
Source: PubMed

ABSTRACT A sample size calculation for logistic regression involves complicated formulae. This paper suggests use of sample size formulae for comparing means or for comparing proportions in order to calculate the required sample size for a simple logistic regression model. One can then adjust the required sample size for a multiple logistic regression model by a variance inflation factor. This method requires no assumption of low response probability in the logistic model as in a previous publication. One can similarly calculate the sample size for linear regression models. This paper also compares the accuracy of some existing sample-size software for logistic regression with computer power simulations. An example illustrates the methods.

0 0
·
0 Bookmarks
·
383 Views
• ##### Article: Sample Size for Logistic Regression with Small Response Probability
[hide abstract]
ABSTRACT: The Fisher information matrix for the estimated parameters in a multiple logistic regression can be approximated by the augmented Hessian matrix of the moment-generating function for the covariates. The approximation is valid when the probability of response is small. With its use one can obtain a simple closed-form estimate of the asymptotic covariance matrix of the maximum likelihood parameter estimates, and thus approximate sample sizes needed to test hypotheses about the parameters. The method is developed for selected distributions of a single covariate and for a class of exponential-type distributions of several covariates. It is illustrated with an example concerning risk factors for coronary heart disease.
Journal of The American Statistical Association - J AMER STATIST ASSN. 01/1981; 76(373):27-32.
• Source
##### Article: Sample size tables for logistic regression.
[hide abstract]
ABSTRACT: Sample size tables are presented for epidemiologic studies which extend the use of Whittemore's formula. The tables are easy to use for both simple and multiple logistic regressions. Monte Carlo simulations are performed which show three important results. Firstly, the sample size tables are suitable for studies with either high or low event proportions. Secondly, although the tables can be inaccurate for risk factors having double exponential distributions, they are reasonably adequate for normal distributions and exponential distributions. Finally, the power of a study varies both with the number of events and the number of individuals at risk.
Statistics in Medicine 08/1989; 8(7):795-802. · 2.04 Impact Factor
• ##### Article: Sample size calculations for studies with correlated observations.
[hide abstract]
ABSTRACT: Correlated data occur frequently in biomedical research. Examples include longitudinal studies, family studies, and ophthalmologic studies. In this paper, we present a method to compute sample sizes and statistical powers for studies involving correlated observations. This is a multivariate extension of the work by Self and Mauritsen (1988, Biometrics 44, 79-86), who derived a sample size and power formula for generalized linear models based on the score statistic. For correlated data, we appeal to a statistic based on the generalized estimating equation method (Liang and Zeger, 1986, Biometrika 73, 13-22). We highlight the additional assumptions needed to deal with correlated data. Some special cases that are commonly seen in practice are discussed, followed by simulation studies.
Biometrics 10/1997; 53(3):937-47. · 1.41 Impact Factor