The CATMOD Procedure

Background: The Underlying Model

The CATMOD procedure analyzes data that can be represented by a two-dimensional contingency table. The rows of the table correspond to populations (or samples) formed on the basis of one or more independent variables. The columns of the table correspond to observed responses formed on the basis of one or more dependent variables. The frequency in the $(i,j)$ cell is the number of subjects in the ith population that have the jth response. The frequencies in the table are assumed to follow a product multinomial distribution, corresponding to a sampling design in which a simple random sample is taken for each population. The contingency table can be represented as shown in Table 32.1.

Table 32.1: Contingency Table Representation

	Response
Sample	1	2	$\cdots$	r	Total
1	$n_{11}$	$n_{12}$	$\cdots$	$n_{1r}$	$n_1$
2	$n_{21}$	$n_{22}$	$\cdots$	$n_{2r}$	$n_2$
$\vdots$	$\vdots$	$\vdots$	$\ddots$	$\vdots$	$\vdots$
s	$n_{s1}$	$n_{s2}$	$\cdots$	$n_{sr}$	$n_ s$

For each sample i, the probability of the jth response ( $\pi _{ij}$ ) is estimated by the sample proportion, $p_{ij}=n_{ij}/n_ i$ . The vector ( $\mb{p}$ ) of all such proportions is then transformed into a vector of functions, denoted by $\mb{F} = \mb{F}(\mb{p})$ . If $\bpi$ denotes the vector of true probabilities for the entire table, then the functions of the true probabilities, denoted by $\bF (\bpi )$ , are assumed to follow a linear model

$\mb{E_ A}(\bF ) = \bF (\bpi ) = \mb{X} \bbeta$

where $\mb{E_ A}$ denotes asymptotic expectation, $\mb{X}$ is the design matrix containing fixed constants, and $\bbeta$ is a vector of parameters to be estimated.

PROC CATMOD provides two estimation methods:

The weighted least squares method minimizes the weighted residual sum of squares for the model. The weights are contained in the inverse covariance matrix of the functions $\mb{F}(\mb{p})$ . According to central limit theory, if the sample sizes within populations are sufficiently large, the elements of $\mb{F}$ and $\mb{b}$ (the estimate of $\bbeta$ ) are distributed approximately as multivariate normal. This allows the computation of statistics for testing the goodness of fit of the model and the significance of other sources of variation. For details of the theory, see Grizzle, Starmer, and Koch (1969) or Koch et al. (1977, Appendix 1). Weighted least squares estimation is available for all types of response functions.
The maximum likelihood method estimates the parameters of the linear model so as to maximize the value of the joint multinomial likelihood function of the responses. Maximum likelihood estimation is available only for the standard response functions, logits and generalized logits, which are used for logistic regression analysis and log-linear model analysis. Two methods of maximization are available: Newton-Raphson and iterative proportional fitting. For details of the theory, see Bishop, Fienberg, and Holland (1975).

Following parameter estimation, hypotheses about linear combinations of the parameters can be tested. For that purpose, PROC CATMOD computes generalized Wald (1943) statistics, which are approximately chi-square distributed if the sample sizes are sufficiently large and the null hypotheses are true.