首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于三支决策的高斯混合聚类研究
引用本文:万仁霞,王大庆,苗夺谦.基于三支决策的高斯混合聚类研究[J].重庆邮电大学学报(自然科学版),2021,33(5):806-815.
作者姓名:万仁霞  王大庆  苗夺谦
作者单位:北方民族大学 数学与信息科学学院,银川750021;同济大学 计算机科学与技术系,上海201804
基金项目:国家自然科学基金(61662001);中央高校基本科研业务费专项资金(FWNX04);宁夏自然科学基金(2021AAC03203)
摘    要:针对隶属关系不明确的情况,即样本点属于多个类别的概率接近,高斯混合模型聚类存在较大的误判风险的问题,将三支决策思想融入高斯混合模型中,提出一种基于三支决策的高斯混合聚类算法.新算法计算出数据对象属于各个类簇的后验概率作为决策评价函数,用于确定聚类结果的正域和边界域.由于新算法对边界对象采取了比一般高斯混合聚类算法更加谨慎的操作,避免了直接做出对象属于某一类或不属于某一类的决策所需承担的风险,从而有效减小了误判代价.实验进一步表明,所提出的算法不仅继承了高斯混合聚算法的特点,具有良好的聚类性能,而且还对于非球形数据簇表现出优良的聚类效果.

关 键 词:三支决策  高斯混合模型  聚类  后验概率  边界域
收稿时间:2021/5/20 0:00:00
修稿时间:2021/6/24 0:00:00

Gaussian mixture clustering based on three-way decision
WAN Renxia,WANG Daqing,MIAO Duoqian.Gaussian mixture clustering based on three-way decision[J].Journal of Chongqing University of Posts and Telecommunications,2021,33(5):806-815.
Authors:WAN Renxia  WANG Daqing  MIAO Duoqian
Institution:College of Mathematics and Information Science, North Minzu University, Yinchuan 750021; College of Computer Science and Technology, TongJi University, Shanghai 201804
Abstract:When the membership relationship is not clear, i.e., the data object belongs to multiple clusters with similar probabilities, the clustering of Gaussian mixture model (GMM) has a large risk of misjudgment. In this paper, the idea of three-way decision is integrated into GMM, and a Gaussian mixture clustering algorithm based on three-way decision is proposed. In the new algorithm, the posterior probability of the data object belonging to each cluster is calculated and then used as the decision evaluation function to determine the positive and boundary regions of the clustering result. A prudent operation strategy is also adopted to deal with boundary objects to avoid the risk of directly making the decision of assigning the data object to a certain cluster or not. The proposed algorithm can effectively reduce the cost of misjudgment.The experimental results show that the proposed algorithm not only inherits the characteristics of GMM, but also has good clustering performance for non-spherical data clusters.
Keywords:three-way decision  Gaussian mixture model  clustering  posterior probability  boundary region
本文献已被 万方数据 等数据库收录!
点击此处可从《重庆邮电大学学报(自然科学版)》浏览原始摘要信息
点击此处可从《重庆邮电大学学报(自然科学版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号