Asymptotic properties of univariate sample k-means clusters |
| |
Authors: | M Anthony Wong |
| |
Institution: | (1) Sloan School of Management, Massachusetts Institute of Technology, 02139 Cambridge, MA |
| |
Abstract: | A random sample of sizeN is divided intok clusters that minimize the within clusters sum of squares locally. Some large sample properties of this k-means clustering method (ask approaches withN) are obtained. In one dimension, it is established that the sample k-means clusters are such that the within-cluster sums of squares are asymptotically equal, and that the sizes of the cluster intervals are inversely proportional to the one-third power of the underlying density at the midpoints of the intervals. The difficulty involved in generalizing the results to the multivariate case is mentioned.This research was supported in part by the National Science Foundation under Grant MCS75-08374. The author would like to thank John Hartigan and David Pollard for helpful discussions and comments. |
| |
Keywords: | K-means clusters Within-clusters sum of squares Cluster lengths Non-standard asymptotics |
本文献已被 SpringerLink 等数据库收录! |