首页 | 本学科首页   官方微博 | 高级检索  
     


Weighting and selection of variables for cluster analysis
Authors:R. Gnanadesikan  J. R. Kettenring  S. L. Tsao
Affiliation:(1) Department of Statistics, Rutgers University, 08903 New Brunswick, NJ;(2) Bellcore, 445 South Street, 07960 Morristown, NJ;(3) AT&T Bell Laboratories, 101 Crawford's Corner Road, 07733 Holmdel, NJ
Abstract:One of the thorniest aspects of cluster analysis continues to be the weighting and selection of variables. This paper reports on the performance of nine methods on eight ldquoleading caserdquo simulated and real sets of data. The results demonstrate shortcomings of weighting based on the standard deviation or range as well as other more complex schemes in the literature. Weighting schemes based upon carefully chosen estimates of within-cluster and between-cluster variability are generally more effective. These estimates do not require knowledge of the cluster structure. Additional research is essential: worry-free approaches do not yet exist.
Keywords:Clustering  Variable selection  Feature selection  Variable weighting  Variable importance  Pattern recognition  Discriminant analysis
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号