基于Q学习算法的两交叉口信号灯博弈协调控制 Game Coordination Control of Two Intersections Based on Q-Learning Algorithm期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

基于Q学习算法的两交叉口信号灯博弈协调控制

引用本文：	赵晓华,李振龙,于泉,荣建.基于Q学习算法的两交叉口信号灯博弈协调控制[J].系统仿真学报,2007,19(18):4253-4256.

作者姓名：	赵晓华李振龙于泉荣建

作者单位：	1. 北京工业大学北京市交通工程重点实验室,北京,100022 2. 北京工业大学电子信息与控制工程学院,北京,100022

基金项目：	北京市自然科学基金;北京市教委科技创新平台建设;北京工业大学校科研和教改项目

摘要：	Q学习和博弈论相结合解决相邻两交叉口信号灯协调控制问题。在基本Q学习算法的基础上引入博弈论,以Q值作为赢得函数建立赢得矩阵。相邻两交叉口之间的协调关系属于二人非零和合作博弈,采用Nash公理方法求得其谈判解,并以此作为Q学习策略选择的依据实现两交叉口协调控制。应用Paramics交通仿真软件进行算法仿真,结果表明该方法的有效性。
关键词：	博弈论 Q学习算法 Nash公理方法两交叉口信号灯协调控制
文章编号：	1004-731X（2007）18-4253-04
收稿时间：	2006-07-14
修稿时间：	2007-07-13
Game Coordination Control of Two Intersections Based on Q-Learning Algorithm

ZHAO Xiao-hua,LI Zhen-long,YU Quan,RONG Jian.Game Coordination Control of Two Intersections Based on Q-Learning Algorithm[J].Journal of System Simulation,2007,19(18):4253-4256.

Authors:	ZHAO Xiao-hua LI Zhen-long YU Quan RONG Jian

Abstract:	Traffic signal coordination control for two adjacent intersections was studied by hybrid Q-learning and game theory. Based on the general Q-learning algorithm,the game theory was introduced and the payoff function of Q-values was used to build the payoff matrix. The problem of coordination control for two adjacent intersections is two-player Non-zero-sum cooperative game in Game theory. From Nash Negotiation solution by Nash Theorem,the result of the game which was the basis for making decision of Q-learning was obtained. Furthermore,coordination control for two adjacent intersections based on Q-learning and game theory was realized and illustrated by Paramics simulation software. The validity of this method is proved.

Keywords:	game theory Q-Learning algorithm Nash theorem traffic signal coordination control for two adjacent intersections
本文献已被 CNKI 维普万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏