The MI Procedure

TRANSFORM Statement

  • TRANSFORM transform (variables</ options>)<…transform (variables</ options>)> ;

The TRANSFORM statement lists the transformations and their associated variables to be transformed. The options are transformation options that provide additional information for the transformation.

The MI procedure assumes that the data are from a multivariate normal distribution when either the regression method or the MCMC method is used. When some variables in a data set are clearly non-normal, it is useful to transform these variables to conform to the multivariate normality assumption. With a TRANSFORM statement, variables are transformed before the imputation process, and these transformed variable values are displayed in all of the results. When you specify an OUT= option, the variable values are back-transformed to create the imputed data set.

The following transformations can be used in the TRANSFORM statement:

BOXCOX

specifies the Box-Cox transformation of variables. The variable Y is transformed to ${\frac{ (\mr{\Variable{Y}}+c)^{\lambda }-1}{\lambda }}$, where c is a constant such that each value of $\mr{\Variable{Y}}+c$ must be positive. If the specified constant ${\lambda }=0$, the logarithmic transformation is used.

EXP

specifies the exponential transformation of variables. The variable Y is transformed to $\mr{e}^{(\mr{\Variable{Y}}+c)}$, where c is a constant.

LOG

specifies the logarithmic transformation of variables. The variable Y is transformed to $\mr{log}(\mr{\Variable{Y}}+c)$, where c is a constant such that each value of $\mr{\Variable{Y}}+c$ must be positive.

LOGIT

specifies the logit transformation of variables. The variable Y is transformed to $\mr{log}( {\frac{\mr{\Variable{Y}}/c}{1-\mr{\Variable{Y}}/c}} )$, where the constant c>0 and the values of $\mr{\Variable{Y}}/c$ must be between 0 and 1.

POWER

specifies the power transformation of variables. The variable Y is transformed to $(\mr{\Variable{Y}}+c)^{\lambda }$, where c is a constant such that each value of $\mr{\Variable{Y}}+c$ must be positive and the constant ${\lambda } \neq 0$.

The following options provide the constant c and $\lambda $ values in the transformations.

C=number

specifies the c value in the transformation. The default is c = 1 for logit transformation and c = 0 for other transformations.

LAMBDA=number

specifies the ${\lambda }$ value in the power and Box-Cox transformations. You must specify the ${\lambda }$ value for these two transformations.

For example, the following statement requests that variables $\mr{log}(\Variable{y1})$, a logarithmic transformation for the variable y1, and $\sqrt {\Variable{y2}+1}$, a power transformation for the variable y2, be used in the imputation:

   transform log(y1) power(y2/c=1 lambda=.5);

If the MU0= option is used to specify a parameter value $\bmu _0$ for a transformed variable, the same transformation for the variable is also applied to its corresponding MU0= value in the t test. Otherwise, $\bmu _0=0$ is used for the transformed variable. See Example 63.10 for a usage of the TRANSFORM statement.