The GENMOD Procedure

PROC GENMOD Statement

PROC GENMOD <options> ;

The PROC GENMOD statement invokes the GENMOD procedure. Table 40.1 summarizes the options available in the PROC GENMOD statement.

Table 40.1: PROC GENMOD Statement Options

Option

Description

DATA=

Specifies the input data set

DESCENDING

Sorts response variable in the reverse of the default order

EXACTONLY

Requests only the exact analyses

NAMELEN=

Specifies the length of effect names

ORDER=

Specifies the sort order of CLASS variable

PLOTS

Controls the plots produced through ODS Graphics

RORDER=

Specifies the sort order for the levels of the response variable


You can specify the following options.

DATA=SAS-data-set

specifies the SAS data set containing the data to be analyzed. If you omit the DATA= option, the procedure uses the most recently created SAS data set.

DESCENDING
DESCEND
DESC

specifies that the levels of the response variable for the ordinal multinomial model and the binomial model with single variable response syntax be sorted in the reverse of the default order. For example, if RORDER=FORMATTED (the default), the DESCENDING option causes the levels to be sorted from highest to lowest instead of from lowest to highest. If RORDER=FREQ, the DESCENDING option causes the levels to be sorted from lowest frequency count to highest instead of from highest to lowest.

EXACTONLY

requests only the exact analyses. The asymptotic analysis that PROC GENMOD usually performs is suppressed.

NAMELEN=n

specifies the length of effect names in tables and output data sets to be n characters long, where n is a value between 20 and 200 characters. The default length is 20 characters.

ORDER=DATA | FORMATTED | FREQ | INTERNAL

specifies the sort order for the levels of the classification variables (which are specified in the CLASS statement). The ORDER= option can be useful when you use the CONTRAST or ESTIMATE statement because it determines which parameters in the model correspond to each level in the data.

This option applies to the levels for all classification variables, except when you use the (default) ORDER=FORMATTED option with numeric classification variables that have no explicit format. With this option, the levels of such variables are ordered by their internal value.

The ORDER= option can take the following values:

Value of ORDER=

Levels Sorted By

DATA

Order of appearance in the input data set

FORMATTED

External formatted value, except for numeric variables with no explicit format, which are sorted by their unformatted (internal) value

FREQ

Descending frequency count; levels with the most observations come first in the order

INTERNAL

Unformatted value

By default, ORDER=FORMATTED. For ORDER=FORMATTED and ORDER=INTERNAL, the sort order is machine-dependent. For more information about sort order, see the chapter on the SORT procedure in the Base SAS Procedures Guide and the discussion of BY-group processing in SAS Language Reference: Concepts.

PLOTS <(global-plot-option)>= plot-request <(options)>
PLOTS <(global-plot-options)> <= (plot-request <(options)> <…plot-request <(options)>>)>

specifies plots to be created using ODS Graphics. Many of the observational statistics in the output data set can be plotted using this option. You are not required to create an output data set in order to produce a plot. When you specify only one plot request, you can omit the parentheses around the plot request. Here are some examples:

plots=all
plots=predicted
plots=(predicted reschi)
plots(unpack)=dfbeta

ODS Graphics must be enabled before plots can be requested. For example:

proc genmod plots=all;
   model y = x;
run;

For more information about enabling and disabling ODS Graphics, see the section Enabling and Disabling ODS Graphics in Chapter 21: Statistical Graphics Using ODS.

Any specified global plot options apply to all plots that are specified with plot requests. The following global plot options are available.

CLUSTERLABEL

displays formatted levels of the SUBJECT= effect instead of plot symbols. This option applies only to diagnostic statistics for models fit by GEEs that are plotted against cluster number, and provides a way to identify cluster level names with corresponding ordered cluster numbers.

UNPACK

displays multiple plots individually. The default is to display related multiple plots in a panel.

See the section OUTPUT Statement for definitions of the statistics specified with the plot requests. The plot requests include the following:

ALL

produces all available plots.

COOKSD
DOBS

plots the Cook’s distance statistic as a function of observation number.

DFBETA

plots the $\bbeta $ deletion statistic as a function of observation number for each regression parameter in the model.

DFBETAS

plots the standardized $\bbeta $ deletion statistic as a function of observation number for each regression parameter in the model.

LEVERAGE

plots the leverage as a function of observation number.

PREDICTED<(option)>

plots predicted values with confidence limits as a function of observation number. The PREDICTED plot request has the following option:

CLM

includes confidence limits in the predicted value plot.

PZERO

plots the zero inflation probability for zero-inflated Poisson and negative binomial models as a function of observation number.

RESCHI<(options)>

The RESCHI plot request has the following options:

INDEX

plots as a function of observation number.

XBETA

plots as a function of linear predictor.

If you do not specify an option, Pearson residuals are plotted as a function of observation number.

RESDEV<(options)>

plots deviance residuals. The RESDEV plot request has the following options:

INDEX

plots as a function of observation number.

XBETA

plots as a function of linear predictor.

If you do not specify an option, deviance residuals are plotted as a function of observation number.

RESLIK<(options)>

plots likelihood residuals. The RESLIK plot request has the following options:

INDEX

plots as a function of observation number.

XBETA

plots as a function of linear predictor.

If you do not specify an option, likelihood residuals are plotted as a function of observation number.

RESRAW<(options)>

plots raw residuals. The RESRAW plot request has the following options:

INDEX

plots as a function of observation number.

XBETA

plots as a function of linear predictor.

If you do not specify an option, raw residuals are plotted as a function of observation number.

STDRESCHI<(options)>

plots standardized Pearson residuals. The STDRESCHI plot request has the following options:

INDEX

plots as a function of observation number.

XBETA

plots as a function of linear predictor.

If you do not specify an option, standardized Pearson residuals are plotted as a function of observation number.

STDRESDEV<(options)>

plots standardized deviance residuals. The STDRESDEV plot request has the following options:

INDEX

plots as a function of observation number.

XBETA

plots as a function of linear predictor.

If you do not specify an option, standardized deviance residuals are plotted as a function of observation number.

If you fit a model by using generalized estimating equations (GEEs), the following additional plot requests are available:

CLEVERAGE

plots the cluster leverage as a function of ordered cluster.

CLUSTERCOOKSD
DCLS

plots the cluster Cook’s distance statistic as a function of ordered cluster.

CLUSTERDFIT
MCLS

plots the studentized cluster Cook’s distance statistic as a function of ordered cluster.

DFBETAC

plots the cluster deletion statistic as a function of ordered cluster for each regression parameter in the model.

DFBETACS

plots the standardized cluster deletion statistic as a function of ordered cluster for each regression parameter in the model.

RORDER=keyword

specifies the sort order for the levels of the response variable. This order determines which intercept parameter in the model corresponds to each level in the data. If RORDER=FORMATTED for numeric variables for which you have supplied no explicit format, the levels are ordered by their internal values. The following table displays the valid keywords and describes how PROC GENMOD interprets them.

RORDER=keyword

Levels Sorted by

DATA

Order of appearance in the input data set

FORMATTED

External formatted value, except for numeric

 

variables with no explicit format, which are

 

sorted by their unformatted (internal) value

FREQ

Descending frequency count; levels with the

 

most observations come first in the order

INTERNAL

Unformatted value

By default, RORDER=FORMATTED. For RORDER=FORMATTED and RORDER=INTERNAL, the sort order is machine dependent. The DESCENDING option in the PROC GENMOD statement causes the response variable to be sorted in the reverse of the order displayed in the previous table. For more information about sort order, see the chapter on the SORT procedure in the Base SAS Procedures Guide.

The NOPRINT option, which suppresses displayed output in other SAS procedures, is not available in the PROC GENMOD statement. However, you can use the Output Delivery System (ODS) to suppress all displayed output, store all output on disk for further analysis, or create SAS data sets from selected output. You can suppress all displayed output with the statement ODS SELECT NONE; and turn displayed output back on with the statement ODS SELECT ALL;. See Table 40.12 and Table 40.13 for the names of output tables available from PROC GENMOD. For more information about ODS, see Chapter 20: Using the Output Delivery System.