首页 | 本学科首页   官方微博 | 高级检索  
     检索      

一种高性能英文词性标注器的设计与实现
引用本文:吕琳,周世斌,刘玉树.一种高性能英文词性标注器的设计与实现[J].北京理工大学学报,2005,25(10):876-879.
作者姓名:吕琳  周世斌  刘玉树
作者单位:北京理工大学,信息科学技术学院计算机科学工程系,北京,100081
摘    要:针对统计和规则方法各自的优点和局限,提出运用Viterbi和FTBL(fast transformation-based learning)算法相级联的算法,实现一种英文自动词性标注器.该级联方法以FTBL算法为整体算法,在它的规则学习和最终标注两个阶段,均以Viterbi算法作为其初始化过程.实验结果表明此算法优于其中任何一种单独的算法,达到了98%的高准确率,验证了自然语言处理中统计与规则并举的主流设计思想.

关 键 词:词性标注器  Viterbi  FTBL  隐马尔可夫模型  高性能  英文  词性标注器  设计思想  Language  English  Part  of  Speech  Realization  统计与规则  然语言处理  验证  准确率  结果  实验  初始化过程  规则学习  整体算法  规则方法  自动  fast
文章编号:1001-0645(2005)10-0876-04
收稿时间:11 26 2004 12:00AM
修稿时间:2004年11月26日

Design and Realization of a High-Performance Part of Speech Tagger for the English Language
LU Lin,ZHOU Shi-bin and LIU Yu-shu.Design and Realization of a High-Performance Part of Speech Tagger for the English Language[J].Journal of Beijing Institute of Technology(Natural Science Edition),2005,25(10):876-879.
Authors:LU Lin  ZHOU Shi-bin and LIU Yu-shu
Institution:Department of Computer Seienee and Engineering,Sehool of Information Seienee and Teehnology, Beijing Institute of Teehnology, Beijing 100081, China
Abstract:In view of the respective strength and weakness of the statistical method and rule governed method, a kind of English part of speech tagger based on a cascade of Viterbi and FTBL algorithms is proposed. Its design idea and realization process are discussed. This cascade method views the FTBL algorithm as its whole algorithm, applying Viterbi algorithm to its initialization process during its two phases rule learning and final tagging. The results show that this method excels either of the separate algorithms, achieving high accuracy of over 98%, and validates the mainstream design idea as a combination of statistical and rule governed methods in natural language processing.
Keywords:part of speech tagging  Viterbi  FTBL  hidden Markov modeling  
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《北京理工大学学报》浏览原始摘要信息
点击此处可从《北京理工大学学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号