首页 | 本学科首页   官方微博 | 高级检索  
     检索      

一种基于Seeds集和成对约束的主动半监督聚类算法
引用本文:陈志雨,王慧君,胡明,刘 钢.一种基于Seeds集和成对约束的主动半监督聚类算法[J].吉林大学学报(理学版),2017,55(3):664-672.
作者姓名:陈志雨  王慧君  胡明  刘 钢
作者单位:1. 长春工业大学 计算机科学与工程学院, 长春 130012; 2. 长春工程学院 校长办公室, 长春 130012
摘    要:针对半监督聚类算法中监督信息使用不充分,监督信息中信息含有量低的问题,提出一种结合主动学习的半监督聚类算法.首先结合使用数据的类别标记和成对约束信息,指导Kmeans聚类过程,设计出一种基于Seeds集和成对约束的半监督聚类算法SC-Kmeans;其次将主动学习算法引入到SC-Kmeans中,以尽量小的代价选取信息含有量更高的监督信息,提高SC-Kmeans算法的聚类精度;最后在UCI标准数据集上进行仿真实验.实验结果表明,该算法取得了较好的聚类效果,有效提高了聚类准确率.

关 键 词:Seeds集    主动学习    成对约束  半监督聚类    Kmeans算法  
收稿时间:2016-09-09

An Active Semi-supervised Clustering AlgorithmBased onSeeds Set and Pairwise Constraints
CHEN Zhiyu,WANG Huijun,HU Ming,LIU Gang.An Active Semi-supervised Clustering AlgorithmBased onSeeds Set and Pairwise Constraints[J].Journal of Jilin University: Sci Ed,2017,55(3):664-672.
Authors:CHEN Zhiyu  WANG Huijun  HU Ming  LIU Gang
Institution:1. College of Computer Science and Engineering, Changchun University of Technology, Changchun 130012, China;\=2. Office of Principal, Jilin Vocational and Technical Institute Communications, Changchun 130012, China
Abstract:Aiming at the problem that the supervised information was not sufficient and the information content of supervision information was lowin semi-supervised clustering algorithm, we proposed a semi\|supervised clustering algorithm based on active learning. Firstly, we designed a semi\|supervised clustering algorithm based on Seeds set and pairwise constraints (SC\|Kmeans) to guide the clustering process of the Kmeans algorithm by using the labeled data a nd pairwise constraints. Secondly, we introduced the active learning algorithm into SC\|Kmeans, in order to select a higher amount of supervision information with a small cost and improve the clustering accuracy of SC\|Kmeans algorithm. Finally, the simulation experiments were performed on machine learning repository (UCI) standard data sets. The experimental results show that the proposed algorithm can achieve better clustering effect, and effectively improve the clustering accuracy.
Keywords:semi-supervised clustering  pairwise constraint  Kmeans algorithm  active learning  Seeds set
本文献已被 CNKI 等数据库收录!
点击此处可从《吉林大学学报(理学版)》浏览原始摘要信息
点击此处可从《吉林大学学报(理学版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号