首页 | 本学科首页   官方微博 | 高级检索  
     

数据污染对于序贯分支方法的影响及应对策略
引用本文:刘丽君,马义中,欧阳林寒,张延静. 数据污染对于序贯分支方法的影响及应对策略[J]. 系统工程理论与实践, 2020, 40(5): 1281-1292. DOI: 10.12011/1000-6788-2019-0119-12
作者姓名:刘丽君  马义中  欧阳林寒  张延静
作者单位:1. 南京理工大学 经济管理学院, 南京 210094;2. 南京航空航天大学 经济与管理学院, 南京 211106
基金项目:国家自然科学基金(71931006,71871119,71702072,11901299)
摘    要:序贯分支方法(sequential bifurcation,SB)因其高效性,近年来被广泛用于仿真试验的因子筛选研究中.然而,传统的序贯分支方法难以应对数据污染情形下的因子筛选问题,因此,本文结合稳健估计的方法改进了传统的序贯分支筛选过程,使其具有良好的抗异常值特性,解决了多种数据污染情形下的因子筛选问题.首先,分析仿真试验中可能出现的数据污染情形及其数据形式,并结合序贯分支方法的基本原理,量化不同数据污染情形对因子筛选结果所造成的影响;其次,采用稳健的位置和散度统计量改进了传统的序贯分支方法中的显著性检验过程,使因子筛选结果不受数据污染的影响;最后,通过仿真试验验证改进的序贯分支方法具有更好的抗异常值特性,同时,该方法在非数据污染下也不失一般性.

关 键 词:数据污染  仿真试验  因子筛选  序贯分支方法
收稿时间:2019-01-23

The influence of data contamination on sequential bifurcation and its counter measures
LIU Lijun,MA Yizhong,OUYANG Linhan,ZHANG Yanjing. The influence of data contamination on sequential bifurcation and its counter measures[J]. Systems Engineering —Theory & Practice, 2020, 40(5): 1281-1292. DOI: 10.12011/1000-6788-2019-0119-12
Authors:LIU Lijun  MA Yizhong  OUYANG Linhan  ZHANG Yanjing
Affiliation:1. School of Economics and Management, Nanjing University of Science and Technology, Nanjing 210094, China;2. College of Economics and Management, Nanjing University of Aeronautics and Astronautics, Nanjing 211106, China
Abstract:Sequential bifurcation (SB) is the most efficient factor screening method which is widely used in the simulation experiments to identify important factors with as few runs as possible. However faced with data contamination problem, SB which assumes the normal response shows some limitations. Hence, classic sequential bifurcation procedure is modified in this paper, which could handle not only the response with data contamination but also the normal ones. Firstly, several types of data contamination are introduced and the influence of factor screening results caused by data contamination is analyzed. Then sequential bifurcation is modified by including robust estimators in hypothesis testing. Finally, Monte Carlo simulation is employed to show that the modified sequential bifurcation method is efficient and effective when the data are normally distributed yet also robust when the data are contaminated.
Keywords:data contamination  simulation experiments  factor screening  sequential bifurcation  
本文献已被 CNKI 维普 等数据库收录!
点击此处可从《系统工程理论与实践》浏览原始摘要信息
点击此处可从《系统工程理论与实践》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号