首页 | 本学科首页   官方微博 | 高级检索  
     检索      

结合邻域信息和标记相关性的在线多标记流特征选择算法
引用本文:包丰浩,林耀进,李育林,毛煜.结合邻域信息和标记相关性的在线多标记流特征选择算法[J].重庆邮电大学学报(自然科学版),2023,35(1):79-89.
作者姓名:包丰浩  林耀进  李育林  毛煜
作者单位:闽南师范大学 计算机学院,福建 漳州 363000;闽南师范大学 数据科学与智能应用福建省高等学校重点实验室,福建 漳州 363000
基金项目:国家自然科学基金面上项目(61672272);福建省自然科学基金重点项目(2021J02049)
摘    要:现有大多数多标记流特征选择算法在进行特征选择时,往往忽略标记间的相关性,易导致算法预测精度的下降。为解决这一问题,提出一种结合邻域信息和标记相关性的在线多标记流特征选择算法;定义自适应邻域关系解决邻域粗糙集的粒度选择问题,将其推广到多标记学习中;利用互信息计算标记间的相关性得到标记权重;通过邻域粗糙集和标记权重评估特征和标记间的相关性,并设计特征在线重要度分析、在线相关性分析和在线冗余度分析3种指标,以实现在线评价动态候选特征。在7组多标记数据集以及5个评价指标上的实验结果表明,所提算法综合性能较优。

关 键 词:流特征  特征选择  邻域粗糙集  标记相关性  多标记学习
收稿时间:2022/7/28 0:00:00
修稿时间:2022/12/5 0:00:00

Online multi-label streaming feature selection algorithm via combining neighborhood information and label correlation
BAO Fenghao,LIN Yaojin,LI Yulin,MAO Yu.Online multi-label streaming feature selection algorithm via combining neighborhood information and label correlation[J].Journal of Chongqing University of Posts and Telecommunications,2023,35(1):79-89.
Authors:BAO Fenghao  LIN Yaojin  LI Yulin  MAO Yu
Institution:School of Computer Science and Engineering, Minnan Normal University, Zhangzhou 363000, P. R. China;Key Laboratory of Data Science and Intelligence Application, Minnan Normal University, Zhangzhou 363000, P. R. China
Abstract:Most of the existing multi-label streaming feature selection algorithms tend to ignore the correlation between labels, which easily leads to the decline of prediction accuracy. To address this problem, an online multi-label streaming feature selection algorithm via combining neighborhood information and label correlation is proposed. Firstly, the adaptive neighborhood relationship is defined to solve the problem of granularity selection of neighborhood rough set, and then it is extended to multi-label learning. Secondly, the mutual information is used to calculate the correlation between labels to obtain the label weight. Finally, the neighborhood rough set and label weights are integrated to evaluate the correlation between features and labels. And three metrics, i.e., feature online importance analysis, online relevance analysis and online redundancy analysis, are designed to evaluate the online candidate features. Experiment results on 7 multi-label datasets and 5 evaluation metrics show that the comprehensive performance of the proposed algorithm is effective.
Keywords:streaming feature  feature selection  neighborhood rough set  label correlation  multi-label learning
点击此处可从《重庆邮电大学学报(自然科学版)》浏览原始摘要信息
点击此处可从《重庆邮电大学学报(自然科学版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号