首页 | 本学科首页   官方微博 | 高级检索  
     检索      

一种基于信息增益的新垃圾邮件特征选择算法
引用本文:李猛,刘元宁.一种基于信息增益的新垃圾邮件特征选择算法[J].吉林大学学报(理学版),2017,55(2):379-382.
作者姓名:李猛  刘元宁
作者单位:吉林大学 计算机科学与技术学院, 长春 130012
摘    要:基于传统信息增益特征选择算法,通过提出类内分散度与类间集中度的概念,结合传统信息增益算法,解决了信息增益算法因忽略特征项的分布而导致的性能下降问题,提高了信息增益算法的效率.使用改进的特征选择算法进行垃圾邮件过滤实验,在不同的分类器下,与传统的特征选择算法进行对比,实验结果表明,改进的特征选择算法性能较优.

关 键 词:信息增益  垃圾邮件    类内分散度    特征选择    类间集中度  
收稿时间:2016-06-24

A New Spam Feature Selection Algorithm Based on Information Gain
LI Meng,LIU Yuanning.A New Spam Feature Selection Algorithm Based on Information Gain[J].Journal of Jilin University: Sci Ed,2017,55(2):379-382.
Authors:LI Meng  LIU Yuanning
Institution:College of Computer Science and Technology, Jilin University, Changchun 130012, China
Abstract:The concept of intra class dispersity and inter class concentration was proposed based on the traditional information gain feature se lection algorithm. Combined with the traditional information gain algorithm, i t solved the problem of performance degradation caused by ignoring the distribut ion of the characteristic items and improved the efficiency of the information g ain algorithm. The improved feature selection algorithm was applied to the spam filtering experiment. Compared with the traditional feature selection algorithms under different classifiers, the experimental results show that the improved fe ature selection algorithm has better performance.
Keywords:information gain  spam  intra class dispersity  feature selection  inter class concentration
本文献已被 CNKI 等数据库收录!
点击此处可从《吉林大学学报(理学版)》浏览原始摘要信息
点击此处可从《吉林大学学报(理学版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号