A dissimilarity measure is called an ultrametric if it satisfies the following conditions:
for all x
for all x, y
for all x, y
for all x, y, and z
Any hierarchical clustering method induces a dissimilarity measure on the observations—say, . Let be the cluster with the fewest members that contains both and . Assume was formed by joining and . Then define .
If the fusion of and reduces the number of clusters from g to , then define . Johnson (1967) shows that if
then is an ultrametric. A method that always satisfies this condition is said to be a monotonic or ultrametric clustering method. All methods implemented in PROC CLUSTER except CENTROID, EML, and MEDIAN are ultrametric (Milligan, 1979; Batagelj, 1981).