首页 | 本学科首页   官方微博 | 高级检索  
     检索      

成分数据中基于MCLasso的修正EM算法
作者单位:;1.山西大学数学科学学院
摘    要:针对成分数据中含有近似零值,对其作对数比变换后就会出现无穷值,从而影响对数据的进一步分析.提出了一个新的修正EM算法来处理成分数据中的近似零值问题,针对EM算法的缺点对其进行一些改进,即:对EM算法的E步用Monte Carlo方法改进,对EM算法的M步用Lasso算法进行改进.对新的方法进行实证分析,并与基于线性回归的修正EM算法、基于均值插补法和Bootstrap的修正EM算法进行比较研究,验证了该方法的有效性.

关 键 词:成分数据  缺失数据  EM算法  Monte  Carlo方法  Lasso方法

A new modified EM algorithm based on MCLasso in compositional data
Institution:,School of Mathematics Sciences,Shanxi University
Abstract:The log-ratio transformation is a common method of pretreatment in compositional data analysis,however,the data sets often contain the rounded zeros in practice,which will be infinite when using the log-ratio transformation.We can be regarded as a special kind of missing values,thus affecting the further analysis of the compositional data.In this paper,we present a new modified EM algorithm(MCLasso) to dealing with rounded zeros in compositional data.Some improvements are executed for the shortcomings of EM algorithm,namely:the E step of EM algorithm will be improved by the Monte Carlo algorithm,and the M step of EM algorithm will be improved by the Lasso algorithm.We conduct empirical analysis for the new method based on MCLasso,and the modified EM algorithm based on linear regression,based on mean imputation and Bootstrap were compared to verify the validity of the method.
Keywords:compositional data  missing values  EM algorithm  Monte Carlo  Lasso
本文献已被 CNKI 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号