On some significance tests in cluster analysis |
| |
Authors: | Bock H.H. |
| |
Affiliation: | (1) Institut für Statistik und Wirtschaftsmathematik, Technical University Aachen, Wüllnerstr. 3, D-5100 Aachen, West Germany |
| |
Abstract: | We investigate the properties of several significance tests for distinguishing between the hypothesisH of a homogeneous population and an alternativeA involving clustering or heterogeneity, with emphasis on the case of multidimensional observationsx1, ...,xn p. Four types of test statistics are considered: the (s-th) largest gap between observations, their mean distance (or similarity), the minimum within-cluster sum of squares resulting from a k-means algorithm, and the resulting maximum F statistic. The asymptotic distributions underH are given forn and the asymptotic power of the tests is derived for neighboring alternatives. |
| |
Keywords: | Significance test Homogeneity Heterogeneity Gap test Minimum within-cluster sum of squares Maximum F statistics Asymptotic normal distribution |
本文献已被 SpringerLink 等数据库收录! |
|