首页 | 本学科首页   官方微博 | 高级检索  
     


Solving Non-Uniqueness in Agglomerative Hierarchical Clustering Using Multidendrograms
Authors:Alberto Fernández  Sergio Gómez
Affiliation:1. Universitat Rovira i Virgili, Tarragona, Spain
2. Departament d’Enginyeria Informàtica i Matemàtiques, Universitat Rovira i Virgili, Campus Sescelades, Avinguda dels Pa?sos Catalans 26, E–43007, Tarragona, Spain
Abstract:In agglomerative hierarchical clustering, pair-group methods suffer from a problem of non-uniqueness when two or more distances between different clusters coincide during the amalgamation process. The traditional approach for solving this drawback has been to take any arbitrary criterion in order to break ties between distances, which results in different hierarchical classifications depending on the criterion followed. In this article we propose a variable-group algorithm that consists in grouping more than two clusters at the same time when ties occur. We give a tree representation for the results of the algorithm, which we call a multidendrogram, as well as a generalization of the Lance andWilliams’ formula which enables the implementation of the algorithm in a recursive way. The authors thank A. Arenas for discussion and helpful comments. This work was partially supported by DGES of the Spanish Government Project No. FIS2006–13321–C02–02 and by a grant of Universitat Rovira i Virgili.
Keywords:Agglomerative methods  Cluster analysis  Hierarchical classification  Lance and Williams’   formula  Ties in proximity
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号