首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于RoBERTa与改进局部离群因子算法的专利新颖性测量
引用本文:廖列法,姚秀,李奎.基于RoBERTa与改进局部离群因子算法的专利新颖性测量[J].科学技术与工程,2023,23(17):7420-7427.
作者姓名:廖列法  姚秀  李奎
作者单位:江西理工大学
基金项目:国家自然科学基金项目(71462018,71761018)
摘    要:现有的专利新颖性测量方法需要依赖特定的领域知识以及专家的介入,性能差且耗时,为此,提出了一种不依赖特定领域知识及专家的全自动化系统的识别新颖性专利的方法。首先利用RoBERTa表示专利向量,以解决需要依赖技术领域的知识来表示专利的多义词问题,其次利用数据点的密度分布并结合信息熵改进局部离群因子算法(LOF)来确定离群点个数及数据点集,提高离群点的检测精度,结合RoBERT与改进的LOF在数值尺度上度量专利的新颖性。实验验证表明,所提方法测量的专利新颖性的得分与现有文献中的相关专利指标显著相关,并且识别出的新颖性专利具有更高的技术影响。

关 键 词:专利新颖性  RoBERTa  信息熵  局部离群因子算法  离群点检测
收稿时间:2022/9/13 0:00:00
修稿时间:2023/4/7 0:00:00

Patented novelty measurement based on RoBERTa and improved local outlier algorithm
Liao Lief,Yao Xiu,Li Kui.Patented novelty measurement based on RoBERTa and improved local outlier algorithm[J].Science Technology and Engineering,2023,23(17):7420-7427.
Authors:Liao Lief  Yao Xiu  Li Kui
Institution:Jiangxi University of Science and Technology
Abstract:Existing patented novelty measurement methods need to rely on specific domain knowledge and expert intervention, poor performance and time-consuming, so a method for identifying novelty patents was proposed in a fully automated system that does not rely on specific domain knowledge and experts. Firstly, RoBERTa was used to represent the patent vector to solve the polysemy problem that needs to rely on knowledge in the technical field to represent the patent, and secondly, the density distribution of data points and the local outlier factor algorithm (LOF) were improved by using the density distribution of data points and combined with information entropy to determine the number of outliers and the set of data points, improve the detection accuracy of outliers, and combine RoBERT and improved LOF to measure the novelty of the patent on a numerical scale. Experimental verification shows that the patent novelty score measured by the proposed method is significantly correlated with the relevant patent indicators in the existing literature, and the identified novelty patents have higher technical impact.
Keywords:patent novelty  RoBERTa  Information entropy  local outlier factor  outlier detection
点击此处可从《科学技术与工程》浏览原始摘要信息
点击此处可从《科学技术与工程》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号