Silhouettes: a graphical aid to the interpretation and validation of cluster analysis

PJ Rousseeuw - Journal of computational and applied mathematics, 1987 - Elsevier
Journal of computational and applied mathematics, 1987Elsevier
A new graphical display is proposed for partitioning techniques. Each cluster is represented
by a so-called silhouette, which is based on the comparison of its tightness and separation.
This silhouette shows which objects lie well within their cluster, and which ones are merely
somewhere in between clusters. The entire clustering is displayed by combining the
silhouettes into a single plot, allowing an appreciation of the relative quality of the clusters
and an overview of the data configuration. The average silhouette width provides an …
Abstract
A new graphical display is proposed for partitioning techniques. Each cluster is represented by a so-called silhouette, which is based on the comparison of its tightness and separation. This silhouette shows which objects lie well within their cluster, and which ones are merely somewhere in between clusters. The entire clustering is displayed by combining the silhouettes into a single plot, allowing an appreciation of the relative quality of the clusters and an overview of the data configuration. The average silhouette width provides an evaluation of clustering validity, and might be used to select an ‘appropriate’ number of clusters.
Elsevier