首页 | 本学科首页   官方微博 | 高级检索  
     检索      

SVM网页分类中一种新的特征提取方法
引用本文:孙明柱,魏海平,顿绍坤,王居柱.SVM网页分类中一种新的特征提取方法[J].科学技术与工程,2011,11(6).
作者姓名:孙明柱  魏海平  顿绍坤  王居柱
作者单位:辽宁石油化工大学计算机与通信工程学院,抚顺,113001
摘    要:随着互联网的迅速发展,对网页正确分类显得越来越重要。网页分类的一个难点就是特征空间的维数比较大,支持向量机(SVM)分类方法显示出比其它分类方法更好的性能,但是训练样本时却花费了比其它算法更多的时间。本文提出了一种基于选择最确信的词来预测一个文本的类别的特征提取方法,通过中文文本实验,结果表明在不降低分类准确性的前提下,缩短了训练时间。

关 键 词:特征提取  Web分类  支持向量机
收稿时间:12/9/2010 4:32:43 PM
修稿时间:12/9/2010 4:32:43 PM

A New Feature Selection Method in SVM Web Page Classification
sunmingzhu,weihaiping,dunshaokun and wangjuzhu.A New Feature Selection Method in SVM Web Page Classification[J].Science Technology and Engineering,2011,11(6).
Authors:sunmingzhu  weihaiping  dunshaokun and wangjuzhu
Institution:SUN Ming-zhu,WEI Hai-ping,DUN Shao-kun,WANG Ju-zhu(School of Computer & Communication Engineering,Liaoning Shihua University,Fushun 113001,P.R.China)
Abstract:With the rapid development of Internet, the need of correctly Web Page Classification is becoming more and more critical. The major problem in Web page Classification is the high dimensionality of feature space. The Support Vector Machine classifier is shown to perform better than other Web page Classification algorithms. However, the time taken for training a Support Vector Machine model is more than other algorithms. A feature selection method based on the most certainly keyword to predict the category of a web Page was proposed in this paper. Through the experimental of Chinese text, the results show that this method reduces the training time, while maintaining the accuracy of Web Page Classification.
Keywords:Feature Selection  Web Page Classification  Support Vector Machine
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《科学技术与工程》浏览原始摘要信息
点击此处可从《科学技术与工程》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号