The SCORE statement creates a new SAS data set to contain predicted values and optionally residuals for data in a new data
set that you name. If you do not specify a DATA= data set, then the input data are scored. If you want to predict multiple
data sets, you can specify multiple SCORE statements. If you want to create a SAS data set in a permanent library, you must
specify a two-level name. For more information about permanent libraries and SAS data sets, see
SAS Language Reference: Concepts.
When you specify a BY statement, the DATA= data set must either contain all the BY variables sorted in the order of the BY
variables or contain none of the BY variables. If the DATA= data set contains all the BY variables, then the model that is
selected for a given BY group is used to score just the matching observations in that data set. If the DATA= set contains
none of the BY variables, then the entire data set is scored for each BY group.
All observations in the DATA= data set are retained in the output data set. All the variables in the input data set are included
in the output data set, along with variables that contain predicted values and optionally residuals.
You can specify the following arguments in the SCORE statement:
-
DATA=SAS data set
-
names the data set to be scored. If you omit this option, then the input data set that is named in the DATA=
option in the PROC ADAPTIVEREG statement is scored.
-
keyword <=name>
-
specifies the statistics to include in the output data set and optionally names the new variables that contain the statistics. Specify one of the following keyword for each desired statistic, followed optionally by an equal sign, and a variable to contain the statistic.
If you specify keyword=name, the new variable that contains the requested statistic has the specified name. If you omit the optional =name after a keyword, then the new variable name is formed by using a prefix of one or more characters that identify the statistic, followed by
an underscore (_), followed by the dependent variable name.
You can specify the following keywords, which represent the statistics shown:
- PREDICTED |PRED |P
-
includes predicted values in the output data set. The prefix for the default name is Pred
.
- RESIDUAL |RESID |R
-
includes residuals (which are calculated as ACTUAL – PREDICTED), in the output data set. The prefix for the default name is
Resid
.
-
OUT=SAS data set
-
specifies the name of the new output data set. By default, PROC ADAPTIVEREG uses the DATAn
convention to name the new data set.