首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于互信息的中文姓名识别方法
引用本文:黄德根,马玉霞,杨元生.基于互信息的中文姓名识别方法[J].大连理工大学学报,2004,44(5):744-748.
作者姓名:黄德根  马玉霞  杨元生
作者单位:大连理工大学,计算机科学与工程系,辽宁,大连,116024;大连理工大学,计算机科学与工程系,辽宁,大连,116024;大连理工大学,计算机科学与工程系,辽宁,大连,116024
基金项目:国家自然科学基金资助项目(60373095).
摘    要:提出并实现了一个基于互信息的中文姓名识别方法,该方法充分挖掘姓名和其上下文信息的关联程度以及姓名用字之间关联程度的信息,引入互信息对其进行定量的描述;提出中文姓名的上下文互信息、内部互信息等概念,并对其建立了动态评价函数,开放测试结果表明,该方法有效地提高了中文姓名识别的效果,保证了较高的精确率和召回率。

关 键 词:中文姓名识别  互信息  上下文互信息  内部互信息
文章编号:1000-8608(2004)05-0744-05

Chinese names identification based on mutual information
HUANG De-gen,MA Yu-xia,YANG Yuan-sheng.Chinese names identification based on mutual information[J].Journal of Dalian University of Technology,2004,44(5):744-748.
Authors:HUANG De-gen  MA Yu-xia  YANG Yuan-sheng
Institution:HUANG De-gen~*,MA Yu-xia,YANG Yuan-sheng
Abstract:A method based on mutual information to identify Chinese names is proposed. The word association norms between Chinese names and their contexts as well as those among the characters or words in the names are studied, and then, mutual information is introduced to describe them quantitatively, the concepts of con-mutual information, inner-mutual information, etc. are also introduced. Lastly, the dynamic evaluation functions are built. The open tests on real data sets show that it is an effective method to improve the result of the identification, and high accuracy rate and recall rate are guaranteed.
Keywords:Chinese names identification  mutual information  con-mutual information  inner-mutual information
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号