Measures of predictive accuracy of regression models quantify the extent to which covariates determine an individual outcome. Schemper and Henderson’s (2000) proposed predictive accuracy measure is defined as the difference between individual processes and the fitted survivor function.
For the ith individual (), let
and
be the left-truncation time, observed time, event indicator (1 for death and 0 for censored), and covariate vector, respectively.
If there is no delay entry, then
. Let
be m distinct event times with
deaths at
. The survival process
for the ith individual is
Let be the Kaplan-Meier estimate of the survivor function (assuming no covariates). Let
be the fitted survivor function with covariates
, and if you specify TIES=EFRON, then
is computed by the Efron method; otherwise, the Breslow estimate is used.
The predictive accuracy is defined as the difference between individual survival processes and the fitted survivor functions with (
)) or without (
) covariates between 0 and
, the largest observed time. For each death time
, define a mean absolute distance between the
and the
as
where . Let
be defined similarly to
, but with
replaced by
and
replaced by
. Let
be the Kaplan-Meier estimate of the censoring or potential follow-up distribution, and let
The overall estimator of the predictive accuracy with () and without (
) covariates are weighted averages of
and
, respectively, given by
The explained variation by the Cox regression is
Because the predictive accuracy measures and
are based on differences between individual survival processes and fitted survivor functions, a smaller value indicates a
better prediction. For this reason,
and
are also referred to as predictive inaccuracy measures.