首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Dealing with Distances and Transformations for Fuzzy C-Means Clustering of Compositional Data
Authors:Javier Palarea-Albaladejo  Josep Antoni Martín-Fernández  Jesús A Soto
Institution:1. Biomathematics and Statistics Scotland, JCMB, The King??s Buildings, Edinburgh, EH9 3JZ, UK
2. Universitat de Girona, Girona, Spain
3. Universidad Cat??lica San Antonio, Murcia, Spain
Abstract:Clustering techniques are based upon a dissimilarity or distance measure between objects and clusters. This paper focuses on the simplex space, whose elements??compositions??are subject to non-negativity and constant-sum constraints. Any data analysis involving compositions should fulfill two main principles: scale invariance and subcompositional coherence. Among fuzzy clustering methods, the FCM algorithm is broadly applied in a variety of fields, but it is not well-behaved when dealing with compositions. Here, the adequacy of different dissimilarities in the simplex, together with the behavior of the common log-ratio transformations, is discussed in the basis of compositional principles. As a result, a well-founded strategy for FCM clustering of compositions is suggested. Theoretical findings are accompanied by numerical evidence, and a detailed account of our proposal is provided. Finally, a case study is illustrated using a nutritional data set known in the clustering literature.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号