Missing Data Imputation for Traffic Flow Based on Improved Local Least Squares |
| |
Authors: | Gang Chang Yi Zhang Danya Yao Tsinghua National Laboratory for Information Science Technology |
| |
Affiliation: | , Department of Automation, Tsinghua University, Beijing 100084, China |
| |
Abstract: | Complete and reliable field traffic data is vital for the planning, design, and operation of urban traffic management systems. However, traffic data is often very incomplete in many traffic information systems, which hinders effective use of the data. Methods are needed for imputing missing traffic data to minimize the effect of incomplete data on the utilization. This paper presents an improved Local Least Squares (LLS) approach to impute the incomplete data. The LLS is an improved version of the K Nearest Neighbor (KNN) method. First, the missing traffic data is replaced by a row average of the known values. Then, the vector angle and Euclidean distance are used to select the nearest neighbors. Finally, a regression step is used to get weights of the nearest neighbors and the imputation results. Traffic flow volume collected in Beijing was analyzed to compare this approach with the Bayesian Principle Component Analysis (BPCA) imputation approach. Tests show that this approach provides slightly better performance than BPCA imputation to impute missing traffic data. |
| |
Keywords: | Local Least Squares (LLS) vector angle missing value imputation traffic flow |
本文献已被 CNKI 等数据库收录! |
|