首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于BLSTM-CRF模型的安全漏洞领域命名实体识别
引用本文:张若彬,刘嘉勇,何祥.基于BLSTM-CRF模型的安全漏洞领域命名实体识别[J].四川大学学报(自然科学版),2019,56(3):469-475.
作者姓名:张若彬  刘嘉勇  何祥
作者单位:四川大学电子信息学院,四川大学网络空间安全学院,四川大学电子信息学院
基金项目:2017年国家重点研发计划网络空间安全重点专项“软件与系统漏洞分析与发现技术”(2017YFB0802900)
摘    要:非结构化文本资源提供了大量与漏洞相关的信息,传统的特定领域实体识别依赖特征模板和领域知识来识别相关实体,其识别性能很大程度上依赖于人工选取的特征函数质量.如何利用机器挖掘文本隐含的特征,而不需要人工详细地制定领域术语的特征表达是一项具有挑战性的任务.该文针对安全漏洞领域,提出一种双向长短期记忆网络BLSTM与条件随机场CRF相结合的安全漏洞领域实体识别模型,并使用基于词典的方法对结果进行校正,F值可达到85%以上.实验表明,该方法在提高实体识别的准确率和召回率的同时,能够显著地降低人工选取特征的工作量.

关 键 词:安全漏洞  实体识别  双向长短期记忆网络  条件随机场
收稿时间:2018/12/4 0:00:00
修稿时间:2018/12/21 0:00:00

Named Entity Recognition for Vulnerabilities Based on BLSTM-CRF Model
Zhang Ruo-bin,Liu Jia-yong and He Xiang.Named Entity Recognition for Vulnerabilities Based on BLSTM-CRF Model[J].Journal of Sichuan University (Natural Science Edition),2019,56(3):469-475.
Authors:Zhang Ruo-bin  Liu Jia-yong and He Xiang
Institution:College of Electronics and Information Engineering, Sichuan University,College of Cybersecurity, Sichuan University,College of Electronics and Information Engineering, Sichuan University
Abstract:Unstructured text resources provide a large amount of information related to vulnerability. Traditional domain-specific entity recognition relies on feature templates and domain knowledge to recognize related entities. The recognition performance depends largely on the quality of manually selected feature functions. It is a challenging task to mine the features implied by the text automatically, rather than manually formulate the characterization of the domain terminology. In this paper, a BLSTM and CRF security vulnerability domain entity recognition model (BLSTM-CRF model) is proposed and a dictionary is used to correct the results generated by the model. The F value can reach 85%. Experiments show that this method can significantly reduce the workload of manually selecting features while improving the precision and recall
Keywords:Cyber Vulnerabilities  Named entity recognition  Bidirectional Long Short-Term Memory  Conditional Random Field
本文献已被 CNKI 等数据库收录!
点击此处可从《四川大学学报(自然科学版)》浏览原始摘要信息
点击此处可从《四川大学学报(自然科学版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号