首页 | 本学科首页   官方微博 | 高级检索  
     检索      

平均排队长度差最小的单交叉口在线Q学习模型
引用本文:张术,韦钦平.平均排队长度差最小的单交叉口在线Q学习模型[J].岳阳师范学院学报,2013(4):22-25.
作者姓名:张术  韦钦平
作者单位:长沙理工大学交通运输工程学院,长沙410004
基金项目:湖南省自然科学基金重点项目(12JJ2025);长沙市科技局重点项目(K1106004-11)
摘    要:建立了以平均排队长度差最小为优化目标的在线Q学习模型.针对控制性能指标相对于临近的配时方案不敏感的特点,提出了以平均排队长度差作为基本单位重新构造奖励函数,目的是拉大各行为对应的Q值差距,提高模型的收敛速度和鲁棒性.集成ExcelVBA、Vissim、Matlab建立了在线仿真平台,作为计算环境对模型进行了计算.利用GPS数据对Vissim软件中车辆加减速度曲线进行了标定.计算结果表明以平均排队长度差作为优化目标能够优化整个交叉口的时空资源,本文建立的在线Q学习模型具有较快的收敛速度和鲁棒性,通过学习能够实现优化目标.

关 键 词:交通控制  配时优化  排队长度  在线Q学习

On-line Q Learning Model for Minimizing Average Queue Length Difference
ZHANG Shu,WEI Qin-ping.On-line Q Learning Model for Minimizing Average Queue Length Difference[J].Journal of Yueyang Normal University,2013(4):22-25.
Authors:ZHANG Shu  WEI Qin-ping
Institution:(School of Traffic and Transportation Engineering, Changsha University of Science and Technology, Changsha 410004, China)
Abstract:For adapting the randomness of traffic flow, the paper builds an on-line Q learning model for minimizing average queue length difference. Because performance index is approximate at the adjacent signal timing, the paper puts forward a method of building reward function to increase the gap between different behaviours to improve the robustness and computation speed. The paper integrates VBA, Vissim, and Matlab to build a simulation platform. Signal timing optimization of a single intersection with two phases is done by the on-line Q learning model, which can optimize the time and space resources of the intersection.
Keywords:traffic control  timing optimization  queue length  on-line Q learning
本文献已被 维普 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号