一种改进的深度确定性策略梯度网络交通信号控制系统 Improved deep deterministic policy gradient network traffic signal control system期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

一种改进的深度确定性策略梯度网络交通信号控制系统

引用本文：	刘利军,王州,余臻.一种改进的深度确定性策略梯度网络交通信号控制系统[J].四川大学学报(自然科学版),2021,58(4):043003-043003-7.

作者姓名：	刘利军王州余臻

作者单位：	厦门大学航空航天学院,厦门大学航空航天学院,厦门大学航空航天学院

基金项目：	国家自然科学基金项目(No.61304110)；广东省自然科学基金(No.2018A030313124)；深圳市基础研究面上项目(No.JCYJ20190809163009630)；上海市自然科学基金(No.18ZR1443200)

摘要：	交通信号系统控制着城市车辆运行秩序,其效率高低直接影响了社会经济的发展.以十字路口的交通信号控制系统为研究对象,基于深度确定性策略梯度网络DDPG提出了一种改进算法.结合交通环境的特点设计了特征增强和样本去重算法提高算法的性能.通过对实际交通系统运行情况进行调研,基于SUMO仿真环境搭建了交叉路口交通仿真平台.利用FEPG算法控制交通信号,实现了车辆的高效通行.实验结果表明,该算法能够有效地降低车辆等待时间,减少车辆的污染排放.
关键词：	交通信号控制强化学习样本去重 SUMO
收稿时间：	2020/3/26 0:00:00
修稿时间：	2020/5/8 0:00:00
Improved deep deterministic policy gradient network traffic signal control system

LIU Li-Jun,WANG Zhou and YU Zhen.Improved deep deterministic policy gradient network traffic signal control system[J].Journal of Sichuan University (Natural Science Edition),2021,58(4):043003-043003-7.

Authors:	LIU Li-Jun WANG Zhou and YU Zhen

Institution:	School of Aerospace Engineering,Xiamen University,School of Aerospace Engineering,Xiamen University,School of Aerospace Engineering,Xiamen University

Abstract:	The traffic signal system controls the order of urban vehicles, and its efficiency directly affects the development of the social economy. This paper takes the traffic signal control system at the intersection as the research object, an improved algorithm is introduced based on the deep deterministic policy gradient (DDPG), in which the feature enhancement and sample deduplication algorithms combined with the characteristics of the traffic environment are designed to improve the performance of the algorithm. By analyzing the operation of the actual traffic system, a traffic simulation platform for intersections is set up based on the SUMO simulation environment. The FEPG algorithm is used to control traffic signals to achieve efficient vehicle traffic. Experimental results show that the algorithm can effectively reduce vehicle waiting time and reduce vehicle emissions.

Keywords:	Traffic signal control Reinforcement learning Sample deduplication SUMO
本文献已被 CNKI 等数据库收录！
	点击此处可从《四川大学学报(自然科学版)》浏览原始摘要信息
	点击此处可从《四川大学学报(自然科学版)》下载免费的PDF全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏