不均衡数据集下的入侵检测 Intrusion detection on imbalanced dataset期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

不均衡数据集下的入侵检测

引用本文：	杜红乐,张燕,张林.不均衡数据集下的入侵检测[J].山东大学学报(理学版),2016,51(11):50-57.

作者姓名：	杜红乐张燕张林

作者单位：	商洛学院数学与计算机应用学院, 陕西商洛 726000

基金项目：	陕西省自然科学基础研究计划资助项目(2015JM6347);陕西省教育厅科技计划项目(15JK1218);商洛学院科学与技术研究项目(15sky010)

摘要：	在直推式支持向量机(transductive support vector machine, TSVM)中,迭代过程中样本标注错误会导致错误传递,影响下一次迭代中样本标注准确度,使得错误不断地被积累,造成最终分类超平面的偏移。在不均衡数据集下,传统支持向量机(support vector machine, SVM)对样本分类的错误率较高,导致TSVM在每次迭代中标注样本准确度不高。针对此,本文提出一种不均衡数据集下的直推式学习算法,该算法依据各类支持向量的密度分布关系动态计算各类的惩罚因子,提高每次迭代中样本标注的准确度,算法在继承渐进赋值和动态调整规则的基础上,减少分类超平面的偏移。最后,在KDD CUP99数据集上的仿真实验结果表明该算法能够提高TSVM在不均衡数据下的分类性能,降低误警率和漏报率。
关键词：	支持向量机半监督学习直推式学习入侵检测不均衡数据集
收稿时间：	2015-09-21
Intrusion detection on imbalanced dataset

DU Hong-le,ZHANG Yan,ZHANG Lin.Intrusion detection on imbalanced dataset[J].Journal of Shandong University,2016,51(11):50-57.

Authors:	DU Hong-le ZHANG Yan ZHANG Lin

Institution:	School of Mathematics and Computer Application, Shangluo University, Shangluo 726000, Shaanxi, China

Abstract:	In transductive support vector machine, sample labeling error will result in error propagation in the iterative process. It affects the accuracy of sample labeling in the next iteration and makes mistakes constantly being accumulated. Eventually leading to classification hyperplane offset. Under imbalanced dataset, there is higher classification error rate of traditional SVM that causes the labeling error rate in each iterative for TSVM. Therefore, the algorithm of TSVM for imbalanced dataset is proposed in this paper. We dynamic calculates the penalty factor of every class according to the relationship of sample density of every class to improve the accuracy of labeling sample in each iterative. The algorithm inherits its rules of progressive labeling and dynamic adjusting, and reduces the offset of the classification hyperplane. Finally, experiment results with KDD CUP99 dataset show the algorithm can improve the classification performance at imbalanced dataset, especially for the minority class samples.

Keywords:	support vector machine transductive learning semi-supervised learning imbalanced dataset intrusion detection
本文献已被 CNKI 等数据库收录！
	点击此处可从《山东大学学报(理学版)》浏览原始摘要信息
	点击此处可从《山东大学学报(理学版)》下载免费的PDF全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏