This section discusses various computational details.
Let the system be composed of equations and let the ith equation be expressed in this form:
where
is the vector of observations on the dependent variable
is the matrix of observations on the endogenous variables included in the equation
is the vector of parameters associated with
is the matrix of observations on the predetermined variables included in the equation
is the vector of parameters associated with
is a vector of errors
Let , where is the projection of onto the space spanned by the instruments matrix Z.
Let
be the vector of parameters associated with both the endogenous and exogenous variables.
The K-class of estimators (Theil, 1971) is defined by
where k is a user-defined value.
Let
and
The 2SLS estimator is defined as
Let and be the vectors obtained by stacking the vectors of dependent variables and parameters for all equations, and let and be the block diagonal matrices formed by and , respectively.
The SUR and ITSUR estimators are defined as
while the 3SLS and IT3SLS estimators are defined as
where is the identity matrix, and is an estimator of the cross-equation correlation matrix. For 3SLS, is obtained from the 2SLS estimation, while for SUR it is derived from the OLS estimation. For IT3SLS and ITSUR, it is obtained iteratively from the previous estimation step, until convergence.
The VARDEF= option in the PROC SYSLIN statement controls the denominator used in calculating the cross-equation covariance estimates and the parameter standard errors and covariances. The values of the VARDEF= option and the resulting denominator are as follows:
uses the number of nonmissing observations.
uses the number of nonmissing observations less the degrees of freedom in the model.
uses the sum of the observation weights given by the WEIGHTS statement.
uses the sum of the observation weights given by the WEIGHTS statement less the degrees of freedom in the model.
The VARDEF= option does not affect the model mean squared error, root mean squared error, or statistics. These statistics are always based on the error degrees of freedom, regardless of the VARDEF= option. The VARDEF= option also does not affect the dependent variable coefficient of variation (CV).
The REDUCED option in the PROC SYSLIN statement computes estimates of the reduced form coefficients. The REDUCED option requires that the equation system be square. If there are fewer models than endogenous variables, IDENTITY statements can be used to complete the equation system.
The reduced form coefficients are computed as follows. Represent the equation system, with all endogenous variables moved to the left-hand side of the equations and identities, as
Here B is the estimated coefficient matrix for the endogenous variables Y, and is the estimated coefficient matrix for the exogenous (or predetermined) variables X.
The system can be solved for Y as follows, provided B is square and nonsingular:
The reduced form coefficients are the matrix .
The SDIAG option in the PROC SYSLIN statement computes estimates by assuming uncorrelated errors across equations. As a result, when the SDIAG option is used, the 3SLS estimates are identical to 2SLS estimates, and the SUR estimates are the same as the OLS estimates.
The OVERID option in the MODEL statement can be used to test for overidentifying restrictions on parameters of each equation. The null hypothesis is that the predetermined variables that do not appear in any equation have zero coefficients. The alternative hypothesis is that at least one of the assumed zero coefficients is nonzero. The test is approximate and rejects the null hypothesis too frequently for small sample sizes.
The formula for the test is given as follows. Let be the i th equation. are the endogenous variables that appear as regressors in the i th equation, and are the instrumental variables that appear as regressors in the i th equation. Let be the number of variables in and .
Let . Let Z represent all instrumental variables, T be the total number of observations, and K be the total number of instrumental variables. Define as follows:
Then the test statistic
is distributed approximately as an F with and degrees of freedom. See Basmann (1960) for more information.
The ALPHA= option in the PROC SYSLIN and MODEL statements parameterizes Fuller’s modification to LIML. This modification is , where is the value of the ALPHA= option, is the LIML k value,n is the number of observations, and g is the number of predetermined variables. Fuller’s modification is not used unless the ALPHA= option is specified. See Fuller (1977) for more information.