首页 | 本学科首页   官方微博 | 高级检索  
     检索      

差错网络的文本分类反馈校正方法
引用本文:卢玲,杨武,刘恒洋.差错网络的文本分类反馈校正方法[J].重庆邮电大学学报(自然科学版),2014,26(6):790-795.
作者姓名:卢玲  杨武  刘恒洋
作者单位:重庆理工大学 计算机科学与工程学院,重庆400054;重庆理工大学 计算机科学与工程学院,重庆400054;重庆理工大学 计算机科学与工程学院,重庆400054
基金项目:重庆市自然科学基金(CSTC2011jjA40002)
摘    要:中文新闻信息分类的类别数量大,难以一次性获取均衡的分类性能。针对这一问题,提出了一种基于差错网络的文本分类反馈校正方法。首先对文本进行一次分类,^根据分类结果生成有向差错网络,得到标注类别与真实类别的候选映射规则。然后计算差错网络的类别关联度参数,再对候选映射规则进行筛选,得到标注类别与真实类别的映射规则。最后根据映射规则进行二次分类,实现分类反馈校正。实验表明,差错网络清晰地反映了类 别的相关度。通过映射关系进行反馈校正,比普通文本分类的F值提高了6.2%。在NLP&CC2014评测中,基于差错网络的方法平均正确率达到73% ,证明了该方法的有效性。

关 键 词:差错网络  分类反馈校正  映射规则
收稿时间:2014/7/20 0:00:00
修稿时间:2014/10/28 0:00:00

Feedback correction method of text classification based on error network
LU Ling,YANG Wu and LIU Hengyang.Feedback correction method of text classification based on error network[J].Journal of Chongqing University of Posts and Telecommunications,2014,26(6):790-795.
Authors:LU Ling  YANG Wu and LIU Hengyang
Institution:College of Computer Science and Engineering, Chongqing University of Technology, Chongqing 400050, P.R. China;College of Computer Science and Engineering, Chongqing University of Technology, Chongqing 400050, P.R. China;College of Computer Science and Engineering, Chongqing University of Technology, Chongqing 400050, P.R. China
Abstract:In study of Chinese news texts classification, it''s hard to achieve a balanced performance because the number of categories. In order to solve the problem, this paper puts forward a feedback correction method of text classification based on error network. First, we performed the first level classification, generated the error network and extracted the candidate mapping rules set between annotation and real categories. Then we calculated the references of type relationship, extracted the mapping rules set between annotation and real categories by screening the candidate mapping rules. At last, we executed the second classification based on mapping rules set and implemented the feedback correction of classification. Experiments showed that the error network reflected the relationship between annotation categories and real categories clearly. It also showed that the method based on feedback correction improved the F measure about 6. 3% than traditional method. The average precision achieved at 73% in NLP&CC 2014 by implementing the method, which proved the effectiveness of the method.
Keywords:
点击此处可从《重庆邮电大学学报(自然科学版)》浏览原始摘要信息
点击此处可从《重庆邮电大学学报(自然科学版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号