Let Y be the response variable that takes values
. Let j index all observations and
be the value of response for the jth observation. Consider the multinomial variable
such that
and . With
denoting the probability that the jth observation has response value i, the expected value of
is
, and
. The covariance matrix of
is
, which is the covariance matrix of a multinomial random variable for one trial with parameter vector
. Let
be the vector of regression parameters—for example,
for cumulative logit model. Let
be the matrix of partial derivatives of
with respect to
. The estimating equation for the regression parameters is
where , and
and
are the WEIGHT and FREQ values of the jth observation.
With a starting value of , the pseudo-estimate of
is obtained iteratively as
where ,
, and
are evaluated at the ith iteration
. The expression after the plus sign is the step size. If the log likelihood evaluated at
is less than that evaluated at
, then
is recomputed by step-halving or ridging. The iterative scheme continues until convergence is obtained—that is, until
is sufficiently close to
. Then the maximum likelihood estimate of
is
.
By default, starting values are zero for the slope parameters, and starting values are the observed cumulative logits (that is, logits of the observed cumulative proportions of response) for the intercept parameters. Alternatively, the starting values can be specified with the INEST= option in the PROC SURVEYLOGISTIC statement.