平均排队长度差最小的单交叉口在线Q学习模型 On-line Q Learning Model for Minimizing Average Queue Length Difference期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

平均排队长度差最小的单交叉口在线Q学习模型

引用本文：	张术,韦钦平.平均排队长度差最小的单交叉口在线Q学习模型[J].岳阳师范学院学报,2013(4):22-25.

作者姓名：	张术韦钦平

作者单位：	长沙理工大学交通运输工程学院,长沙410004

基金项目：	湖南省自然科学基金重点项目（12JJ2025）;长沙市科技局重点项目（K1106004-11）

摘要：	建立了以平均排队长度差最小为优化目标的在线Q学习模型．针对控制性能指标相对于临近的配时方案不敏感的特点，提出了以平均排队长度差作为基本单位重新构造奖励函数，目的是拉大各行为对应的Q值差距，提高模型的收敛速度和鲁棒性．集成ExcelVBA、Vissim、Matlab建立了在线仿真平台，作为计算环境对模型进行了计算．利用GPS数据对Vissim软件中车辆加减速度曲线进行了标定．计算结果表明以平均排队长度差作为优化目标能够优化整个交叉口的时空资源，本文建立的在线Q学习模型具有较快的收敛速度和鲁棒性，通过学习能够实现优化目标．
关键词：	交通控制配时优化排队长度在线Q学习
On-line Q Learning Model for Minimizing Average Queue Length Difference

ZHANG Shu,WEI Qin-ping.On-line Q Learning Model for Minimizing Average Queue Length Difference[J].Journal of Yueyang Normal University,2013(4):22-25.

Authors:	ZHANG Shu WEI Qin-ping

Institution:	(School of Traffic and Transportation Engineering, Changsha University of Science and Technology, Changsha 410004, China)

Abstract:	For adapting the randomness of traffic flow, the paper builds an on-line Q learning model for minimizing average queue length difference. Because performance index is approximate at the adjacent signal timing, the paper puts forward a method of building reward function to increase the gap between different behaviours to improve the robustness and computation speed. The paper integrates VBA, Vissim, and Matlab to build a simulation platform. Signal timing optimization of a single intersection with two phases is done by the on-line Q learning model, which can optimize the time and space resources of the intersection.

Keywords:	traffic control timing optimization queue length on-line Q learning
本文献已被维普等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏