首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于CRF和多元规则的层次化句法分析
引用本文:杨陈菊,孙俊,皮乾东,邵玉斌,龙华.基于CRF和多元规则的层次化句法分析[J].吉林大学学报(理学版),2021,58(6):1452-1460.
作者姓名:杨陈菊  孙俊  皮乾东  邵玉斌  龙华
作者单位:昆明理工大学 信息工程与自动化学院, 昆明 650504
摘    要:针对句法分析中细粒度和粗粒度组块识别模型的冲突问题, 为解决句法分析中词语搭配规则多、减少搭配优先级变动的影响, 提出一种结合条件随机场(CRF)和多元规则的层次化句法分析模型. 先利用CRF算法识别细粒度语句的组块标记序列, 然后结合统计和多元规则识别粗粒度组块, 在识别出的组块中层层引入不同优先级的二元、三元规则. 该模型实现了同时进行细粒度和粗粒度组块的识别, 可更好地服务于句法分析. 在Chinese TreeBank8.0(CTB8.0)语料上采用5-折交叉验证, 结果表明, 相比于仅使用二元、 三元规则及使用CRF+二元规则的句法分析, 该模型的正确率分别约提高12%,3%,5%, 验证了该模型有效性和稳定性.

关 键 词:层次句法分析    条件随机场    多元规则    组块识别  

Hierarchical Parsing Based on CRF and Multiple Rules
YANG Chenju,SUN Jun,PI Qiandong,SHAO Yubin,LONG Hua.Hierarchical Parsing Based on CRF and Multiple Rules[J].Journal of Jilin University: Sci Ed,2021,58(6):1452-1460.
Authors:YANG Chenju  SUN Jun  PI Qiandong  SHAO Yubin  LONG Hua
Institution:School of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650504, China
Abstract:Aiming at the problem of the conflict between fine-grained and coarse-grained chunk recognition models in parsing, in order to solve the problem of multiple collocation rules in parsing and reduce the influence of collocation priority changes, we proposed a hierarchical parsing model which combined conditional random field (CRF) with multiple rules. First, CRF algorithm was used to identify the chunk tag sequence of the fine-grained sentence, and then the coarse-grained chunks were identified by combining statistics and multiple rules, and binary and ternary rules of different priorities were introduced into the identified chunks. The model realized the identification of fine-grained and coarse-grained chunks at the same time, which could better serve parsing. On the Chinese TreeBank8.0 corpus, the 5-fold cross-validation method was used for experimental verification. The results show that it is compared with the parsing using only binary and ternary rules, as well as the use of binary rules and CRF, the accuracy of the model is improved by nearly 12%,3%,5%, respectively, which verifies the effectiveness and stability of the model.
Keywords:hierarchical parsing  conditional random field  multiple rules  chunk recognition  
点击此处可从《吉林大学学报(理学版)》浏览原始摘要信息
点击此处可从《吉林大学学报(理学版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号