首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于随机森林算法的函数缺陷定位
引用本文:李倩倩,牟永敏,赵晓永.基于随机森林算法的函数缺陷定位[J].科学技术与工程,2020,20(32):13278-13284.
作者姓名:李倩倩  牟永敏  赵晓永
作者单位:北京信息科技大学网络文化与数字传播北京市重点实验室,北京100101;北京信息科技大学信息管理学院, 北京100192
基金项目:北京市自然科学基金资助项目(Z160002);网络文化与数字传播北京市重点实验室开放课题资助(5221935409)
摘    要:缺陷定位是软件调试过程中的重要阶段,通过挖掘程序执行过程中的动态信息与执行结果之间的关系,可以有效定位缺陷位置。由此提出了一种基于随机森林算法的函数缺陷定位方法(Function Defect Location based on Random Forest,简称FDLRF)。其具体思想是:首先动态执行测试用例获取函数的动态调用图并生成DOT文件,解析该文件获取各个函数的轨迹信息,建立特征矩阵,同时利用合成少数类过采样技术(Synthetic Minority Over-sampling Technique,简称SMOTE)得到均衡样本,运用随机森林算法对数据进行训练,从而获得每个属性的贡献度信息,即函数缺陷概率。实验结果表明,该方法较传统算法在定位准确率有了一定程度的提升。

关 键 词:缺陷定位  函数轨迹信息  随机森林  合成少数类过采样技术(SMOTE)算法
收稿时间:2020/4/10 0:00:00
修稿时间:2020/7/29 0:00:00

Function defect location based on random forest algorithm
LI Qian-qian,MU Yong-min,ZHAO Xiaoyong.Function defect location based on random forest algorithm[J].Science Technology and Engineering,2020,20(32):13278-13284.
Authors:LI Qian-qian  MU Yong-min  ZHAO Xiaoyong
Institution:Beijing key laboratory of Internet Culture and Digital Dissemination Research,Beijing Information Science Technology University,Beijing key laboratory of Internet Culture and Digital Dissemination Research,Beijing Information Science Technology University,College of Information and Management,Beijing Information Science Technology University
Abstract:Defect locating is an important stage in the software debugging process. By mining the relationship between the dynamic information and the execution results during the execution of the program, the defect location can be effectively located.Therefore, a function defect location method based on random forest algorithm (FDLRF) is proposed. The specific idea is: first, dynamically execute the test case to obtain the dynamic call graph of the function and generate a DOT file, parse the file to obtain the trajectory information of each function, and establish a feature matrix;Secondly, use the synthetic minority over-sampling technique (SMOTE) to obtain balanced samples;Finally, the random forest algorithm is used to train the data to obtain the contribution information of each attribute, that is, the function defect probability.Experimental results show that the method has a certain degree of improvement in positioning accuracy compared with traditional algorithms.
Keywords:defect location  function trajectory information  random forest  SMOTE algorithm
本文献已被 万方数据 等数据库收录!
点击此处可从《科学技术与工程》浏览原始摘要信息
点击此处可从《科学技术与工程》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号