Determining the Components of Variation

The standard Shewhart analysis assumes that sampling variation, also referred to as within-group variation, is the only source of variation. Writing $x_{ij}$ for the jth measurement within the ith subgroup, you can express the model for the conventional $\bar{X}$ and R chart as

\[  \begin{array}{rclp{3in}r} x_{ij} &  = &  \mu + \sigma _{W} \epsilon _{ij} & &  (1) \end{array}  \]

for $i = 1, 2, \ldots , k$ and $j = 1, 2, \ldots , n$. The random variables $\epsilon _{ij}$ are assumed to be independent with zero mean and unit variance, and $\sigma ^{2}_{W}$ is the within-subgroup variance. The parameter $\mu $ denotes the process mean.

In a process such as film manufacturing, this model is not adequate because there is additional variation due to changes in temperature, pressure, raw material, and other factors. A more appropriate model is

\[  \begin{array}{rclp{2.5in}r} x_{ij} &  = &  \mu + \sigma _{B} \omega _{i} + \sigma _{W} \epsilon _{ij} & &  (2) \end{array}  \]

where $\sigma ^{2}_{B}$ is the between-subgroup variance, the random variables $\omega _{i}$ are independent with zero mean and unit variance, and the random variables $\omega _{i}$ are independent of the random variables $\epsilon _{ij}$.[105]

To plot the subgroup averages $\bar{x}_{i.} \equiv \frac{1}{n} \sum ^{n}_{j=1} x_{ij}$ on a control chart, you need expressions for the expectation and variance of $\bar{x}_{i.}$. These are

\[  \begin{array}{rcl} E (\bar{x}_{i.} ) &  = &  \mu \\ \mbox{Var} (\bar{x}_{i.} ) &  = &  \sigma ^{2}_{B} + \frac{\sigma ^{2}_{W}}{n} \end{array}  \]

Thus, the central line should be located at $\hat{\mu }$, and $3\sigma $ limits should be located at

\[  \begin{array}{rp{3.1in}r} \hat{\mu } \pm 3 \sqrt { \widehat{\sigma ^{2}_{B}} + \frac{\widehat{\sigma ^{2}_{W}}}{n} } & &  (3) \end{array}  \]

where $\widehat{\sigma ^{2}_{B}}$ and $\widehat{\sigma ^{2}_{W}}$ denote estimates of the variance components. You can use a variety of SAS procedures for fitting linear models to estimate the variance components. The following statements show how this can be done with the MIXED procedure:

title;
proc mixed data=Film2;
   class Sample;
   model Testval = / s;
   random Sample;
   ods output solutionf=sf;
   ods output covparms=cp;
run;

The results are shown in Figure 17.201. Note that the parameter estimates are $\widehat{\sigma ^2_ B}=19.25$, $\widehat{\sigma ^2_ W}=39.68$, and $\hat{\mu }=88.90$.

Figure 17.201: Partial Output from the MIXED Procedure

The Mixed Procedure

Covariance Parameter Estimates
Cov Parm Estimate
Sample 19.2526
Residual 39.6825

Solution for Fixed Effects
Effect Estimate Standard Error DF t Value Pr > |t|
Intercept 88.8963 0.7250 55 122.61 <.0001


The following statements merge the output data sets from the MIXED procedure into a SAS data set named Newlim that contains the appropriately derived control limit parameters for average test value:

data cp;
   set cp sf;
   keep Estimate;
run;

proc transpose data=cp out=Newlim;
run;

data Newlim (keep=_lclx_ _mean_ _uclx_);
   set Newlim;
   _limitn_ = 4;
   _mean_   = col3;
   _stddev_ = sqrt(4*col1 + col2);
   _lclx_   = _mean_ - 3*_stddev_ / sqrt(_limitn_);
   _uclx_   = _mean_ + 3*_stddev_ / sqrt(_limitn_);
   output;
run;

Here, the variable _LIMITN_ is assigned the value of n, the variable _MEAN_ is assigned the value of $\hat{\mu }$, and the variable _STDDEV_ is assigned the value of

\[  \hat{\sigma }\! _{\mbox{\scriptsize adj}} \equiv \sqrt { 4 \widehat{\sigma ^{2}_{B}} + \widehat{\sigma ^{2}_{W}} }  \]

The $3\sigma $ limits (_LCLX_ and _UCLX_) are computed according to (3) using $\hat{\sigma }\! _{\mbox{\scriptsize adj}}$. The data set Newlim contains the mean and $3\sigma $ limits for the average test value.

The following statements compute appropriate control limits for the $\bar{X}$ and R charts, which are shown in Figure 17.202. First, the data set Newlim2 is created by merging the data set RLimits, which contains the original R chart limits computed in Preliminary Examination of Variation, with Newlim, which saved the appropriate $\bar{X}$ chart limits. The original R chart limits are valid because the range in the ith subgroup is $R_ i = \sigma _ W ( \max _ j \epsilon _{ij} - \min _ j \epsilon _{ij} )$, which is the same for models (1) and (2). The LIMITS= option specifies the data set Newlim2 as the source of the control limits for Figure 17.202.

data Newlim2;
   merge Newlim RLimits (drop=_lclx_ _mean_ _uclx_);
run;

title 'Control Chart with Adjusted Limits';
symbol h = 2.0 pct;
proc shewhart data=Film2 limits=Newlim2;
   xrchart Testval*Sample / npanelpos = 60;
   label Testval='Average Test Value';
run;

The control limits for the $\bar{X}$ chart in Figure 17.202 are $\hat{\mu } \pm \frac{3}{\sqrt {n}} \hat{\sigma }\! _{\mbox{\scriptsize adj}}$. This chart correctly indicates that the variation in the process is due to common causes.

Figure 17.202: $\bar{X}$ and R Chart with Derived Control Limits

X and R Chart with Derived Control Limits


You can use a similar set of statements to display the derived control limits in Newlim on an $\bar{X}$ and R chart for the original data (including outliers), as shown in Figure 17.203.

Figure 17.203: $\bar{X}$ and R Chart with Derived Control Limits for Raw Data

X and R Chart with Derived Control Limits for Raw Data


A simple alternative to the chart in Figure 17.202 is an individual measurements chart for the subgroup means. The advantage of the variance components approach is that it yields separate estimates of the components due to lane and sample, as well as a number of hypothesis tests (these require assumptions of normality). In applying this method, however, you should be careful to use data that represent the process in a state of statistical control.



[105] This notation is used in Chapter 3 of Wetherill and Brown (1991), which discusses this issue.