首页 | 本学科首页   官方微博 | 高级检索  
     检索      

垃圾邮件过滤中特征选择方法研究
引用本文:王军,史科,王辉.垃圾邮件过滤中特征选择方法研究[J].合肥工业大学学报(自然科学版),2009,32(12).
作者姓名:王军  史科  王辉
作者单位:1. 合肥工业大学,信息与网络中心,安徽,合肥,230009
2. 安徽大学,计算机科学与技术学院,安徽,合肥,230039
摘    要:文章对垃圾邮件过滤中的特征选择问题进行了研究,引入"词共现模型"考虑词语之间的语义联系信息,和传统的信息增益特征选择方法结合表示邮件,采用神经网络方法对邮件进行分类得到垃圾邮件过滤器.实验表明,文章提出的将词共现对和信息增益结合的特征选择方法能够提高垃圾邮件过滤的精确度.

关 键 词:垃圾邮件过滤  信息增益  词共现模型  神经网络  交叉覆盖算法

Research on the feature selection method for spam filtering
WANG Jun,SHI Ke,WANG Hui.Research on the feature selection method for spam filtering[J].Journal of Hefei University of Technology(Natural Science),2009,32(12).
Authors:WANG Jun  SHI Ke  WANG Hui
Abstract:Feature selection for spam filtering is researched in this paper.The word co-occurrence model is introduced to analyze the semantic relation between phrases.Features representing ernails are selected by word co-occurrence and information gain.The neural network is used to classify emails and construct the spam filter.The experiments show that the precision of spam filtering is increased by feature selection which combines word co-occurrence and information gain.
Keywords:spam filtering  information gain  word co-occurrence model  neural network  crossover algorithm
本文献已被 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号