To estimate the parameters in a linear model with mean function by maximum likelihood, you need to specify the distribution of the response vector
. In the linear model with a continuous response variable, it is commonly assumed that the response is normally distributed.
In that case, the estimation problem is completely defined by specifying the mean and variance of
in addition to the normality assumption. The model can be written as
, where the notation
indicates a multivariate normal distribution with mean vector
and variance matrix
. The log likelihood for
then can be written as
This function is maximized in when the sum of squares
is minimized. The maximum likelihood estimator of
is thus identical to the ordinary least squares estimator. To maximize
with respect to
, note that
Hence the MLE of is the estimator
This is a biased estimator of , with a bias that decreases with n.