The power computing formula is based on Shieh and O’Brien (1998); Shieh (2000); Self, Mauritsen, and Ohara (1992), and Hsieh (1989).
Define the following notation for a logistic regression analysis:
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
The logistic regression model is
|
The hypothesis test of the first predictor variable is
|
|
|
|
Assuming independence among all predictor variables, is defined as follows:
|
where is calculated according to the following algorithm:
|
|
|
|
|
This algorithm causes the elements of the transposed vector to vary fastest to slowest from right to left as m increases, as shown in the following table of values:
|
The values are determined in a completely analogous manner.
The discretization is handled as follows (unless the distribution is ordinal, or binomial with sample size parameter at least as large as requested number of bins): for , generate quantiles at evenly spaced probability values such that each such quantile is at the midpoint of a bin with probability . In other words,
|
|
|
|
|
|
The primary noncentrality for the power computation is
|
where
|
|
|
|
|
|
|
|
where
|
|
|
|
The power is
|
The factor is the adjustment for correlation between the predictor that is being tested and other predictors, from Hsieh (1989).
Alternative input parameterizations are handled by the following transformations:
|
|
|
|