首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于Q学习算法的两交叉口信号灯博弈协调控制
引用本文:赵晓华,李振龙,于泉,荣建.基于Q学习算法的两交叉口信号灯博弈协调控制[J].系统仿真学报,2007,19(18):4253-4256.
作者姓名:赵晓华  李振龙  于泉  荣建
作者单位:1. 北京工业大学北京市交通工程重点实验室,北京,100022
2. 北京工业大学电子信息与控制工程学院,北京,100022
基金项目:北京市自然科学基金;北京市教委科技创新平台建设;北京工业大学校科研和教改项目
摘    要:Q学习和博弈论相结合解决相邻两交叉口信号灯协调控制问题。在基本Q学习算法的基础上引入博弈论,以Q值作为赢得函数建立赢得矩阵。相邻两交叉口之间的协调关系属于二人非零和合作博弈,采用Nash公理方法求得其谈判解,并以此作为Q学习策略选择的依据实现两交叉口协调控制。应用Paramics交通仿真软件进行算法仿真,结果表明该方法的有效性。

关 键 词:博弈论  Q学习算法  Nash公理方法  两交叉口信号灯协调控制
文章编号:1004-731X(2007)18-4253-04
收稿时间:2006-07-14
修稿时间:2007-07-13

Game Coordination Control of Two Intersections Based on Q-Learning Algorithm
ZHAO Xiao-hua,LI Zhen-long,YU Quan,RONG Jian.Game Coordination Control of Two Intersections Based on Q-Learning Algorithm[J].Journal of System Simulation,2007,19(18):4253-4256.
Authors:ZHAO Xiao-hua  LI Zhen-long  YU Quan  RONG Jian
Abstract:Traffic signal coordination control for two adjacent intersections was studied by hybrid Q-learning and game theory. Based on the general Q-learning algorithm,the game theory was introduced and the payoff function of Q-values was used to build the payoff matrix. The problem of coordination control for two adjacent intersections is two-player Non-zero-sum cooperative game in Game theory. From Nash Negotiation solution by Nash Theorem,the result of the game which was the basis for making decision of Q-learning was obtained. Furthermore,coordination control for two adjacent intersections based on Q-learning and game theory was realized and illustrated by Paramics simulation software. The validity of this method is proved.
Keywords:game theory  Q-Learning algorithm  Nash theorem  traffic signal coordination control for two adjacent intersections
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号