首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于对抗迁移的复合材料检测领域命名实体识别
引用本文:李洋,蔡红珍,邢林林,苏展鹏.基于对抗迁移的复合材料检测领域命名实体识别[J].科学技术与工程,2022,22(30):13370-13377.
作者姓名:李洋  蔡红珍  邢林林  苏展鹏
作者单位:山东理工大学农业工程与食品科学学院;山东理工大学计算机科学与技术学院
基金项目:本课题受国家重点研发计划项目(编号:2018YFB1403302)资助。
摘    要:命名实体识别(Named Entity Recognition,NER)可整合复合材料检测领域相关数据精准提取关键实体信息,促进产业信息化,为行业发展提供技术支撑。针对复合材料检测领域专业名词过多及边界混淆等问题,提出了一种基于对抗训练(Adversarial training)和BERT(Bidirectional Encoder Representations from Transformers)嵌入相结合的领域命名实体识别模型(BERT-AdBC)。首先,复合材料检测领域数据规模较小,BERT嵌入增强了领域迁移能力,通过融合字向量获取充分的语义表示;其次,领域语句繁杂冗长,引入自注意力机制结合双向长短期记忆网络(Bi-LSTM)模型增强了上下文之间语义关系的获取;最后,对抗训练利用分词任务与实体识别任务的共享信息解决了边界混淆问题。实验结果表明,本文所提出的BERT-AdBC模型对复合材料检测领域实体识别的效果要优于传统模型,综合评价指标F1值最高提升6.48%。

关 键 词:复合材料    深度学习  ?  BERT  ?  对抗训练  ?  命名实体识别
收稿时间:2022/1/18 0:00:00
修稿时间:2022/8/7 0:00:00

Named entity recognition in the field of composite materials testing based on BERT and Adversarial Training
Li Yang,Cai Hongzhen,Xing Linlin,Su Zhanpeng.Named entity recognition in the field of composite materials testing based on BERT and Adversarial Training[J].Science Technology and Engineering,2022,22(30):13370-13377.
Authors:Li Yang  Cai Hongzhen  Xing Linlin  Su Zhanpeng
Institution:School of Agricultural Engineering and Food Science, Shandong University of Technology
Abstract:Named Entity Recognition(NER) could integrate the source data and extract the crucial entities in the field of composite materials testing field, and provide technical support for the development of the area. In order to solve the problem of excessive terminology and boundary confusion in the field of composite materials testing, a NER method based on Adversarial training and BERT(Bidirectional Encoder Representations from Transformers) was proposed, and then a BERT-AdBC model was designed. Firstly, the data set in the field of inspection testing was pretty small, so the representation of semantic relations was enhanced by the BERT embedding. And the transferability of model parameters was improved effectively by the transformer structure. Then the retrieval of semantic relationships in contexts was strengthened by the Bi-LSTM-CRF model combined with self-attention. Finally, the influence of boundary confusion on the NER task and CWS(Chinese Words Segmentation) task was reduced by adversarial training. The experimental results showed that the performance of the proposed model was better than traditional models, and the maximum increase of F1 value is 6.48%.
Keywords:Composite materials  deep learning  BERT  adversarial training  named entity recognition
点击此处可从《科学技术与工程》浏览原始摘要信息
点击此处可从《科学技术与工程》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号