首页 | 本学科首页   官方微博 | 高级检索  
     


Analysis of the Impact of Sample Size,Attribute Variance and Within-Sample Choice Distribution on the Estimation Accuracy of Multinomial Logit Models Using Simulated Data
Authors:Minhui Zeng  Ming Zhong  John Douglas Hunt
Affiliation:1.Engineering Research Center for Transportation safety of MOE,Wuhan University of Technology,Wuhan,China;2.School of Traffic and Transportation Engineering,Changsha University of Science & Technology,Changsha,China;3.National Engineering Research Center for Water Transportation Safety,Wuhan,China;4.Department of Civil and Environmental Engineering,University of Waterloo,Ontario,Canada;5.Department of Civil Engineering,University of Calgary,Alberta,Canada
Abstract:Literature review indicates that sample size, attribute variance and within-sample choice distribution of alternatives are important considerations in the estimation of multinomial logit (MNL) models, but their impacts on the estimation accuracy have not been systematically studied. Therefore, the objective of this paper is to provide an empirical examination to the above issues through a set of simulated discrete choice preference and rank ordered preference datasets. In this paper, the utility coefficients, alternative specific constants (ASCs), and the mean and standard deviation of the four attributes for a set of seven hypothetical alternatives are specified as a priori. Then, synthetic datasets, with varying sample size, attribute variance and within-sample choice distribution are simulated. Based on these datasets, the utility coefficients and ASCs of the specified MNLs are re-estimated and compared with the original values specified as the priori. It is found that (1) the estimation accuracy of utility parameters increases as the sample size increases; (2) the utility coefficients can be re-estimated with reasonable accuracy, but the estimates of the ASCs are confronted with much larger errors; (3) as the variances of the alternative attributes increase, the estimation accuracy improves significantly; and (4) as the distribution of chosen choices becomes more balanced across alternatives within sample datasets, the hit-ratio decreases. The results indicate that (a) under a similar setting presented in this paper, a large sample consisting of a few thousand observations (3000–4000) may be needed in order to provide reasonable estimates for utility coefficients, particularly for ASCs; (b) a larger, but realistic attribute space is preferred in the stated preference survey design; and (c) choice datasets with unbalanced “chosen” choice frequency distribution is preferred, in order to better capture the elasticity between the “perceived utility” associated with alternative’s attributes.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号