首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于强化学习算法的公交信号优先策略
引用本文:舒波,李大铭,赵新良.基于强化学习算法的公交信号优先策略[J].东北大学学报(自然科学版),2012,33(10):1513-1516.
作者姓名:舒波  李大铭  赵新良
作者单位:东北大学工商管理学院,辽宁沈阳,110819
基金项目:辽宁省教育厅人文社会科学基金资助项目(2009JD31)
摘    要:综合分析了影响城市公共交通系统运行的多种因素,提出了一种新型的基于强化学习算法的城市公交信号优先控制策略.该策略利用强化学习算法的试错-改进机制,根据不同交通环境下信号控制策略实施后反馈的结果,迭代优化路口的公交信号优先控制策略,从而使其具备了自学习的能力.基于Paramics的仿真实验表明,该算法能够在保障路口正常交通秩序的同时,显著提高公交车运行效率.

关 键 词:公交系统  交通信号控制  公交信号优先  强化学习  回报函数  

Transit Signal Priority Strategy Based on Reinforcement Learning Algorithm
SHU Bo,LI Da-ming,ZHAO Xin-liang.Transit Signal Priority Strategy Based on Reinforcement Learning Algorithm[J].Journal of Northeastern University(Natural Science),2012,33(10):1513-1516.
Authors:SHU Bo  LI Da-ming  ZHAO Xin-liang
Institution:(School of Business & Administration,Northeastern University,Shenyang 110819,China.)
Abstract:Factors affecting public transit system were synthetically analyzed. An innovative transit signal priority (TSP) strategy based on reinforcement learning algorithm was proposed. The trial and error mechanism of reinforcement learning were utilized, so the signal plans could be optimized iteratively by implementing them and estimating the rewards. The proposed idea made the TSP strategy have a capability of self-learning. Based on the software of Paramics, simulations were carried out. And the results demonstrated that the proposed TSP strategy could not only improve the efficiency of transit operation, but also reduce the impacts on general traffic at signalized intersections.
Keywords:transit system  traffic signal control  transit signal priority  reinforcement learning  reward function
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《东北大学学报(自然科学版)》浏览原始摘要信息
点击此处可从《东北大学学报(自然科学版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号