The power computing formula is based on Shieh and O’Brien (1998); Shieh (2000); Self, Mauritsen, and Ohara (1992), and Hsieh (1989).
Define the following notation for a logistic regression analysis:
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
The logistic regression model is
![]() |
The hypothesis test of the first predictor variable is
![]() |
![]() |
![]() |
![]() |
Assuming independence among all predictor variables, is defined as follows:
![]() |
where is calculated according to the following algorithm:
![]() |
![]() |
![]() |
![]() |
![]() |
This algorithm causes the elements of the transposed vector to vary fastest to slowest from right to left as m increases, as shown in the following table of
values:
![]() |
The values are determined in a completely analogous manner.
The discretization is handled as follows (unless the distribution is ordinal, or binomial with sample size parameter at least
as large as requested number of bins): for , generate
quantiles at evenly spaced probability values such that each such quantile is at the midpoint of a bin with probability
. In other words,
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
The primary noncentrality for the power computation is
![]() |
where
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
where
![]() |
![]() |
![]() |
![]() |
The power is
![]() |
The factor is the adjustment for correlation between the predictor that is being tested and other predictors, from Hsieh (1989).
Alternative input parameterizations are handled by the following transformations:
![]() |
![]() |
![]() |
![]() |