首页 | 本学科首页   官方微博 | 高级检索  
     

基于双向编码器表示模型和注意力机制的食品安全命名实体识别
引用本文:姜同强,王岚熙. 基于双向编码器表示模型和注意力机制的食品安全命名实体识别[J]. 科学技术与工程, 2021, 21(3): 1103-1108. DOI: 10.3969/j.issn.1671-1815.2021.03.039
作者姓名:姜同强  王岚熙
作者单位:北京工商大学计算机与信息工程学院,北京 100048;农产品质量安全追溯技术及应用国家工程实验室,北京100048
基金项目:国家重点研发计划项目“食品污染物风险分级评价框架体系与智能研判预警模型研究”(项目编号:2019YFC1606401)
摘    要:针对于目前传统的命名实体识别模型在食品案件纠纷裁判文书领域的准确率不足的问题,在双向长短时记忆网络的基础上提出一种基于双向编码器表示模型(bidirectional encoder representations from transformers,Bert)和注意力机制的命名实体识别模型.模型通过Bert层进行字向量...

关 键 词:命名实体识别  字向量  裁判文书  双向长短时记忆网络  条件随机场
收稿时间:2020-05-15
修稿时间:2021-01-14

Food safety named entity recognition based on Bert and attention mechanism
Jiang Tongqiang,Wang Lanxi. Food safety named entity recognition based on Bert and attention mechanism[J]. Science Technology and Engineering, 2021, 21(3): 1103-1108. DOI: 10.3969/j.issn.1671-1815.2021.03.039
Authors:Jiang Tongqiang  Wang Lanxi
Affiliation:School of Computer and Information Engineering,Beijing Technology and Business University; National Engineering Laboratory for Agricultural Product Quality and Safety Traceability Technology and Application
Abstract:Aiming at the problem of insufficient accuracy of the current traditional named entity recognition model in the field of food dispute dispute judgment documents, a named entity recognition model based on Bert and attention mechanism is proposed based on the two-way long-term and short-term memory network. The model pre-trains the word vectors through the Bert layer, generates word vectors based on the contextual semantics, the word vector sequence is input to the BiLSTM layer and the Attention layer to extract semantic features, and then the CRF layer predicts and outputs the optimal label sequence of the word, and finally obtains a food case dispute judgment The entity in the instrument. Experiments show that the accuracy rate and F1 value of the model in the legal documents of food disputes have reached 92.56% and 90.25%, respectively, and the accuracy rate has been improved by 6.76% compared with the BiLSTM-CRF model that is currently most used. The Bert-BiLSTM-Attention-CRF model can effectively overcome the ambiguity of missing words in the traditional named entity recognition model through pre-training of word vectors and fully integrate the context semantics, which improves the recognition of named entities in the field of food dispute dispute judgment documents. Accuracy.
Keywords:named entity recognition   word vector   referee documents  bi-directionalSlong short-term memory  conditional random field
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《科学技术与工程》浏览原始摘要信息
点击此处可从《科学技术与工程》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号