首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Recognition of Chinese Organization Name Using Co-training
Authors:KE Xiao  LI Shao-zi  CHEN Jin-xiu
Institution:Department of Cognitive Science, Fujian Key Laboratory of the Brain-like Intelligent Systons , Xiamen University, Xianmen 361005, China
Abstract:Chinese organization name recognition is hard and important in natural language processing. To reduce tagged corpus and use untagged corpus, we presented combing Co-trainins with support vector machines (SVM) and conditional random fields (CRF) to improve recognition results. Based on principles of uncorrelated and compatible, we constructed different classifiers from different views within SVM or CRF alone and combination of these two models. And we modified a heuristic untagged samples selection algorithm to reduce time complexity. Experimental results show that under the same tagged data, Co-training has 10% F-measure higher than using SVM or CRF alone; under the same F-measure, Co-training saves at most 70% of tagged data to achieve the same performance.
Keywords:Co-training  named entity recognition  conditional random fields (CRF)  support vector machines (SVM)
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号