首页 | 本学科首页   官方微博 | 高级检索  
     检索      

不均衡数据集下的入侵检测
引用本文:杜红乐,张燕,张林.不均衡数据集下的入侵检测[J].山东大学学报(理学版),2016,51(11):50-57.
作者姓名:杜红乐  张燕  张林
作者单位:商洛学院数学与计算机应用学院, 陕西 商洛 726000
基金项目:陕西省自然科学基础研究计划资助项目(2015JM6347);陕西省教育厅科技计划项目(15JK1218);商洛学院科学与技术研究项目(15sky010)
摘    要:在直推式支持向量机(transductive support vector machine, TSVM)中,迭代过程中样本标注错误会导致错误传递,影响下一次迭代中样本标注准确度,使得错误不断地被积累,造成最终分类超平面的偏移。在不均衡数据集下,传统支持向量机(support vector machine, SVM)对样本分类的错误率较高,导致TSVM在每次迭代中标注样本准确度不高。针对此,本文提出一种不均衡数据集下的直推式学习算法,该算法依据各类支持向量的密度分布关系动态计算各类的惩罚因子,提高每次迭代中样本标注的准确度,算法在继承渐进赋值和动态调整规则的基础上,减少分类超平面的偏移。最后,在KDD CUP99数据集上的仿真实验结果表明该算法能够提高TSVM在不均衡数据下的分类性能,降低误警率和漏报率。

关 键 词:支持向量机  半监督学习  直推式学习  入侵检测  不均衡数据集  
收稿时间:2015-09-21

Intrusion detection on imbalanced dataset
DU Hong-le,ZHANG Yan,ZHANG Lin.Intrusion detection on imbalanced dataset[J].Journal of Shandong University,2016,51(11):50-57.
Authors:DU Hong-le  ZHANG Yan  ZHANG Lin
Institution:School of Mathematics and Computer Application, Shangluo University, Shangluo 726000, Shaanxi, China
Abstract:In transductive support vector machine, sample labeling error will result in error propagation in the iterative process. It affects the accuracy of sample labeling in the next iteration and makes mistakes constantly being accumulated. Eventually leading to classification hyperplane offset. Under imbalanced dataset, there is higher classification error rate of traditional SVM that causes the labeling error rate in each iterative for TSVM. Therefore, the algorithm of TSVM for imbalanced dataset is proposed in this paper. We dynamic calculates the penalty factor of every class according to the relationship of sample density of every class to improve the accuracy of labeling sample in each iterative. The algorithm inherits its rules of progressive labeling and dynamic adjusting, and reduces the offset of the classification hyperplane. Finally, experiment results with KDD CUP99 dataset show the algorithm can improve the classification performance at imbalanced dataset, especially for the minority class samples.
Keywords:support vector machine  transductive learning  semi-supervised learning  imbalanced dataset  intrusion detection  
本文献已被 CNKI 等数据库收录!
点击此处可从《山东大学学报(理学版)》浏览原始摘要信息
点击此处可从《山东大学学报(理学版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号