首页 | 本学科首页   官方微博 | 高级检索  
     检索      

剔除支持向量回归中异常数据算法
引用本文:曾绍滑,魏延,唐远炎.剔除支持向量回归中异常数据算法[J].重庆大学学报(自然科学版),2012,35(12):120-132.
作者姓名:曾绍滑  魏延  唐远炎
作者单位:重庆大学 计算机学院,重庆 400044;重庆师范大学 模式分析与信息处理研究所,重庆 401331;重庆师范大学 模式分析与信息处理研究所,重庆 401331;重庆大学 计算机学院,重庆 400044
基金项目:重庆市教委科学技术研究项目(KJ110632);重庆市自然科学基金资助项目(CSTC2011JJA4008)
摘    要:定义了回归问题中异常数据及其不满足回归映射关系差异程度的度量,分析了回归问题中理论映射模式与回归估计模式关系,提出并证明了回归问题中逐个剔除异常数据,建立回归估计模式逐步逼近理论模式的逐步逼近定理,并构建了以逐步逼近定理为理论依据的剔除支持向量回归中异常数据算法,理论分析了算法的收敛性和有效性。然后,引入逐步搜索算法改进剔除异常数据算法以解决大规模样本的支持向量回归中异常数据剔除问题,理论分析显示改进算法也是收敛的和有效的。最后,应用给定已知函数生成样本和UCI机器学习数据库样本数据仿真实验,结果显示算法是有效的和鲁棒的。

关 键 词:支持向量回归  异常数据  剔除异常数据算法  仿真

Algorithm of removing outliers in SVR
ZENG Shaohu,WEI Yan and TANG Yuanyan.Algorithm of removing outliers in SVR[J].Journal of Chongqing University(Natural Science Edition),2012,35(12):120-132.
Authors:ZENG Shaohu  WEI Yan and TANG Yuanyan
Institution:College of Computer Science, Chongqing University, Chongqing 400044, China; Institute of Pattern Analysis & Information Processing, Chongqing Normal University, Chongqing 401331, China;Institute of Pattern Analysis & Information Processing, Chongqing Normal University, Chongqing 401331, China;College of Computer Science, Chongqing University, Chongqing 400044, China
Abstract:The outlier and the measurement that an outlier does not fit the theoretical model in the regression problems are defined. The relationship between the theoretical model and the regression model in the regression problem is analyzed. An approximate theorem is proposed and verified by deleting outlier one by one to construct SVR to approximate the theoretical model. An algorithm of detecting outliers in the SVR problems is constructed based on the approximate theorem. The theoretical analysis of the convergence and effectiveness of the proposed algorithm is given. Then, the step-by-step search algorithm is introduced to improve the outlier removing algorithm to remove outliers in SVR with large-scale samples. The theoretical analysis shows that the improved algorithm is convergent and effective. Finally, the samples produced by two test functions and the samples in UCI data set are used for simulation, and the results show that the proposed algorithm is effective and robust.
Keywords:SVR(support vector regression)  algorithm  algorithm of detecting outliers  simulation
点击此处可从《重庆大学学报(自然科学版)》浏览原始摘要信息
点击此处可从《重庆大学学报(自然科学版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号