基于语言特性的中文领域术语抽取算法 An Algorithm of Chinese Domain Term Extraction Based on Language Feature期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

基于语言特性的中文领域术语抽取算法

引用本文：	傅继彬,樊孝忠,毛金涛,余正涛.基于语言特性的中文领域术语抽取算法[J].北京理工大学学报,2010,30(3):307-310.

作者姓名：	傅继彬樊孝忠毛金涛余正涛

作者单位：	河南财经学院,计算机与信息工程学院,河南,郑州,450002;北京理工大学,计算机学院,北京,100081;北京理工大学,计算机学院,北京,100081;昆明理工大学,信息工程与自动化学院,云南,昆明,650051

基金项目：	国家自然科学基金资助项目(60863011);;国家教育部高等学校博士学科点专项科研基金资助课题(20050007023)

摘要：	提出一种基于语言特性的中文领域术语自动抽取算法.集成领域耦合性、领域相关性和领域一致性3种语言特性建立统计模型进行中文领域术语的自动抽取.提出基于困惑度衰减比率的自动评价方法,使用该评价方法对术语抽取算法进行了比较评估.实验结果表明,该算法与基于互信息和似然度的方法相比,在准确率和召回率方面都有较大提高.
关键词：	术语抽取领域耦合性领域相关性领域一致性
收稿时间：	2009/2/27 0:00:00
An Algorithm of Chinese Domain Term Extraction Based on Language Feature

FU Ji-bin,FAN Xiao-zhong,MAO Jin-tao and YU Zheng-tao.An Algorithm of Chinese Domain Term Extraction Based on Language Feature[J].Journal of Beijing Institute of Technology(Natural Science Edition),2010,30(3):307-310.

Authors:	FU Ji-bin FAN Xiao-zhong MAO Jin-tao and YU Zheng-tao

Institution:	1.College of Computer and Information Engineering;Henan University of Finance and Economics;Zhengzhou;He'nan 450002;China;2.School of Computer Science and Technology;Beijing Institute of Technology;Beijing 100081;3.School of Information Engineering and Automation;Kunming University of Science and Technology;Kunming;Yunnan 650051;China

Abstract:	An algorithm for Chinese domain term extraction based on language feature is proposed.Domain terms in Chinese have three features: domain cohesiveness,domain relevancy and domain consensus.The algorithm to extract domain term integrates three statistical models which compute domain cohesiveness,domain relevancy and domain consensus respectively.Experimental results show that the algorithm has higher precision and recall than the method based on mutual information and log-likelihood.An automatic evaluation m...

Keywords:	term extraction domain cohesiveness domain relevancy domain consensus
本文献已被 CNKI 万方数据等数据库收录！
	点击此处可从《北京理工大学学报》浏览原始摘要信息
	点击此处可从《北京理工大学学报》下载免费的PDF全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏