OUTPUT <OUT=SAS-data-set> <keyword <(keyword-options )> <=name>> …<keyword <(keyword-options )> <=name>>;
The OUTPUT statement creates a new SAS data set to contain diagnostic measures that are calculated for the selected model. If you do not specify a keyword, then the only diagnostic included is the predicted response.
All the variables in the original data set are included by the new data set, along with variables created in the OUTPUT statement.
These new variables contain the values of a variety of statistics and diagnostic measures that are calculated for each observation
in the data set. If you specify a BY statement, then a variable _BY_
that indexes the BY groups is included. For each observation, the value of _BY_
is the index of the BY group to which this observation belongs.
If you have requested n-fold cross validation, then a variable _CVINDEX_
is included in the output data set. For each observation that is used for model training, the value of _CVINDEX_
is i if that observation is omitted in forming the ith subset of the training data. See the CVMETHOD=
for additional details. The value of _CVINDEX_
is 0 for all observations in the input data set that are not used for model training.
If you have partitioned the input data by using a PARTITION
statement, then a character variable _ROLE_
is included in the output data set. For each observation the value of _ROLE_
is as follows:
|
Observation Role |
---|---|
TEST |
Testing |
TRAIN |
Training |
VALIDATE |
Validation |
If you want to create a permanent SAS data set, you must specify a two-level name. For more information about permanent SAS data sets, see SAS Language Reference: Concepts.
Details about the specifications in the OUTPUT statement follow.