首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于关联规则的中文文本分类算法的改进
引用本文:张玉芳,杨柯,熊忠阳.基于关联规则的中文文本分类算法的改进[J].郑州大学学报(理学版),2007,39(2):114-117.
作者姓名:张玉芳  杨柯  熊忠阳
作者单位:重庆大学计算机学院,重庆,400030
基金项目:重庆市科委自然科学基金
摘    要:随着中文电子刊物和Web文档数量的飞速增加,中文文本自动分类工作变得日益重要.将文档视为事务,将关键词视为项,文本预处理时提出特征权重阈值,用构造的分类器对未知文档分类时,采用了CDD(Class Differen-tiate Degree)改进算法,对基于关联规则挖掘的中文文本自动分类方法进行了改进.实验结果表明,该算法能较快地获得可理解的规则并且具有较好的宏平均和微平均值.

关 键 词:关联规则挖掘  中文文本  文本自动分类算法
文章编号:1671-6841(2007)02-0114-04
修稿时间:12 20 2006 12:00AM

Improvement of Chinese Text Categorization Based on Associate Rules
ZHANG Yu-fang,YANG Ke,XIONG Zhong-yang.Improvement of Chinese Text Categorization Based on Associate Rules[J].Journal of Zhengzhou University:Natural Science Edition,2007,39(2):114-117.
Authors:ZHANG Yu-fang  YANG Ke  XIONG Zhong-yang
Institution:College of Computer Science, Chongqing Ke, XIONG Zhong-yang University, Chongqing 400030, China
Abstract:With the rapid expansion of Chinese electronic publication and web documents,the workof automatic Chinese text categorizationis i mportant increasingly.Anew method calledi mprovedautomatic Chinese text categorization based on associate ruels mining is proposed in the algo-rithm.Each documnet and keywordis represented as transaction anditem.Character thresholdisintroducedin the text being preprocessed.CDD(Class Differentiate Degree) i mproved algorithmis used when using the classifier to classify the unknown documents.Experi ments confirmthatthis algorithmgets the understandable rules of classifer faster and better in terms of the averagepromising recall and precision rate.
Keywords:associate rules mining  Chinese documents  text automatic classified algorithm
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号