首页 | 本学科首页   官方微博 | 高级检索  
     

基于Bayesian方法的参数估计和异常值检测
引用本文:尚华,冯牧,张贝贝. 基于Bayesian方法的参数估计和异常值检测[J]. 重庆邮电大学学报(自然科学版), 2016, 28(1): 138-142. DOI: 10.3979/j.issn.1673-825X.2016.01.021
作者姓名:尚华  冯牧  张贝贝
作者单位:1. 首都经济贸易大学 统计学院 北京 100070;2. 中国科学技术大学 管理学院,安徽 合肥,230000
基金项目:国家自然科学基金(11426159);首都经济贸易大学研究生科技创新项目(12013120061)
摘    要:异常值检测是当前数据分析研究中的一个重要研究领域。模型中的异常值会直接影响建模、参数的估计、预测等问题。基于模型的异常值检测,传统的做法是先对模型参数进行估计,再进行异常值检测。而异常值的存在会影响参数估计,从而导致下一步异常值检测的不可靠;反之异常值检测也会影响参数估计。针对这些不足之处,提出了基于 Bayesian 方法的参数估计和异常值检测,此方法可以将参数估计和异常值检测同时实现,具体做法是在线性回归模型中引入识别变量,基于 Gibbs 抽样算法,给出识别变量后验概率的计算方法,通过比较这些识别变量的后验概率进行异常值定位,同时给出参数的估算方法。通过大量的模拟实验,结果表明,与传统方法相比,提出的方法对异常值更灵敏。

关 键 词:线性回归  识别变量  参数估计  异常值  Bayesian 方法  Gibbs 抽样
收稿时间:2015-03-12
修稿时间:2015-10-12

Parameter estimation and outliers detection based on Bayesian method
SHANG Hu,FENG Mu and ZHANG Beibei. Parameter estimation and outliers detection based on Bayesian method[J]. Journal of Chongqing University of Posts and Telecommunications, 2016, 28(1): 138-142. DOI: 10.3979/j.issn.1673-825X.2016.01.021
Authors:SHANG Hu  FENG Mu  ZHANG Beibei
Affiliation:College of Statistics,Capital University of Economics and Business,Beijing,100070,P.R.China,College of Management,University of Science and Technology of China,Hefei,230000,P.R.China and College of Statistics,Capital University of Economics and Business,Beijing,100070,P.R.China
Abstract:Outliers detection is an important research field in the current data analysis.Outliers in the data will affect the modeling,estimating parameters,forecasting and other issues directly.The conventional methods of outliers detection based on the model are to estimate the model parameters firstly,and then detect the abnormal value.The presence of outliers affects the parameter estimation,which results the in unreliability of outlier detection consequently;On the contrary,the presence of outliers will affect the parameter estimation.In this paper,we propose a new outliers detecting method based on Bayesian method,which can estimate parameters and detect outliers simultaneously.This method is introducing classification variables into linear regression model.Using Gibbs sampling a procedure for computing the posterior probabilities of classification variables and obtaining the estimation of parameters is designed.The outliers can be detected by comparing the posterior probabilities of these classification variables.A large number of simulation experiments illustrate that the proposed method is more sensitive to outliers compared with traditional methods
Keywords:linear regression  classification variables  parameter estimation  outlier  Bayesian method  Gibbs sampling
本文献已被 万方数据 等数据库收录!
点击此处可从《重庆邮电大学学报(自然科学版)》浏览原始摘要信息
点击此处可从《重庆邮电大学学报(自然科学版)》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号