PROC FASTCLUS
<MAXCLUSTERS=n> <RADIUS=t> <options> ;
The PROC FASTCLUS statement invokes the FASTCLUS procedure. You must specify the MAXCLUSTERS= option or RADIUS= option or both in the PROC FASTCLUS statement.
You can specify the following options in the PROC FASTCLUS statement. Table 36.1 summarizes the options available in the PROC FASTCLUS statement.
Table 36.1: PROC FASTCLUS Statement Options
Option |
Description |
---|---|
Specify input and output data sets |
|
Specifies input data set |
|
Specifies input SAS data set previously created by the OUTSTAT= option |
|
Specifies input SAS data set for selecting initial cluster seeds |
|
Specifies divisor for variances |
|
Output Data Processing |
|
Specifies name for cluster membership variable in OUTSEED= and OUT= data sets |
|
Specifies label for cluster membership variable in OUTSEED= and OUT= data sets |
|
Specifies output SAS data set containing original data and cluster assignments |
|
Specifies writing to OUTSEED= data set on every iteration |
|
Specifies output SAS data set containing cluster centers |
|
Specifies output SAS data set containing statistics |
|
Initial Clusters |
|
Permits cluster to seeds to drift during initialization |
|
Specifies maximum number of clusters |
|
Specifies minimum distance for selecting new seeds |
|
Specifies seed to initializes pseudo-random number generator |
|
Specifies seed replacement method |
|
Clustering Methods |
|
Specifies convergence criterion |
|
Deletes cluster seeds with few observations |
|
Optimizes an criterion, where |
|
Specifies maximum number of iterations |
|
Prevents an observation from being assigned to a cluster if its distance to the nearest cluster seed is large |
|
Arcane Algorithmic Options |
|
Specifies number of bins used for computing medians for LEAST=1 |
|
Specifies criterion for updating the homotopy parameter |
|
Specifies initial value of the homotopy parameter |
|
Uses an iteratively reweighted least squares method instead of the modified Ekblom-Newton method for 1 < p < 2 |
|
Missing Values |
|
Imputes missing values after final cluster assignment |
|
Excludes observations with missing values |
|
Control Displayed Output |
|
Displays distances between cluster centers |
|
Displays cluster assignments for all observations |
|
Suppresses displayed output |
|
Suppresses display of large matrices |
|
Suppresses display of all results except for the cluster summary |
|
Suppresses warning in output |
The following list provides details on these options. The list is in alphabetical order.