首页 | 本学科首页   官方微博 | 高级检索  
     检索      

一种改进的代价敏感随机森林算法
引用本文:杨杰明,高 聪,曲朝阳,阚中锋,高 冶,常 成.一种改进的代价敏感随机森林算法[J].科学技术与工程,2018,18(6).
作者姓名:杨杰明  高 聪  曲朝阳  阚中锋  高 冶  常 成
作者单位:东北电力大学,东北电力大学信息工程学院,东北电力大学,国网吉林供电公司,国网吉林供电公司,国网吉林供电公司
基金项目:国家自然科学基金项目(面上项目,重点项目,重大项目)吉林省科技计划项目
摘    要:随机森林在分类不平衡数据时,容易偏向多数类而忽略少数类,可以将代价敏感用于分类器的训练,但在传统代价敏感随机森林算法中,代价函数没有考虑样本集实际分布与特征权重,且在随机森林投票阶段,没有考虑基分类器的性能差异。本文提出一种改进的代价敏感随机森林算法ICSRF,该算法首先根据不平衡数据集的实际分布构造代价函数,并将权重距离引入代价函数,然后根据基分类器的性能采取权重投票,提高分类准确率。实验结果表明,ICSRF算法能有效提高少数类的分类性能,可以较好的处理不平衡数据。

关 键 词:代价敏感  随机森林  不平衡数据  权重距离
收稿时间:2017/7/22 0:00:00
修稿时间:2017/9/19 0:00:00

An Improved Cost-sensitive Algorithm Based on Random Forest
yangjieming,quzhaoyang,kanzhongfeng,gaoye and changcheng.An Improved Cost-sensitive Algorithm Based on Random Forest[J].Science Technology and Engineering,2018,18(6).
Authors:yangjieming  quzhaoyang  kanzhongfeng  gaoye and changcheng
Institution:northeast electric power university,,northeast electric power university,Jilin Electric Power Supply Company,Jilin Electric Power Supply Company,Jilin Electric Power Supply Company
Abstract:The random forest prefers to majority classes rather than minority classes on imbalanced data . The cost sensitive method can be combined with random forest to solve the imbalanced problem . But the traditional cost-sensitive algorithm based on random forest does not consider the actual distribution of data set and feature weight. And in the voting stages of random forest, it does not consider the performance differences of base classifiers. This paper proposed an improved cost-sensitive algorithm based on random forest ICSRF, which constructs a cost function based on the actual distribution of imbalanced data set and introduced the weight distance, then takes weighted voting according to the performance of the base classifier . It can improve the classification accuracy. The experiment results show that the ICSRF algorithm has higher accuracy rate and can effectively improve the recognition rate of the minority classes.
Keywords:cost-sensitive  random forest  imbalanced data  weight distance
点击此处可从《科学技术与工程》浏览原始摘要信息
点击此处可从《科学技术与工程》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号