改进的Q学习算法在轨迹规划中的应用 Improved Algorithm of Q-Learning for Trajectory Planning期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

改进的Q学习算法在轨迹规划中的应用

引用本文：	赵辉,刘雅喆.改进的Q学习算法在轨迹规划中的应用[J].吉林大学学报(信息科学版),2016,34(5):697-702.

作者姓名：	赵辉刘雅喆

作者单位：	渤海大学工学院,辽宁锦州,121013;大庆师范学院计算机科学与信息技术学院,黑龙江大庆,163318

基金项目：	国家青年基金资助项目(61304053)

摘要：	为解决 Q 学习算法易陷入局部最优解问题, 改进了传统贪婪策略, 提出了一种分段渐近搜索策略。该策略通过动态调整策略参数, 使 Q 学习算法在学习过程中实现探索鄄学习鄄利用 3 个阶段的渐近跳转。同时将该搜索策略应用于 Q 学习算法中, 使改进的 Q 学习算法能更快速地逼近全局最优解。将改进算法应用于机械臂轨迹规划中, 其仿真结果表明, 该算法能稳定地引导机械臂沿最优轨迹快速到达目标位置。
关键词：	在线学习轨迹规划机械臂数学模型搜索策略
收稿时间：	2015-11-25
Improved Algorithm of Q-Learning for Trajectory Planning

ZHAO Hui,LIU Yazhe.Improved Algorithm of Q-Learning for Trajectory Planning[J].Journal of Jilin University:Information Sci Ed,2016,34(5):697-702.

Authors:	ZHAO Hui LIU Yazhe

Institution:	1. College of Engineering, Bohai University, Jinzhou 121013, China; 2. College of Computer Science and Information Technology, Daqing Normal University, Daqing 163318, China

Abstract:	Aiming at the local optimal solution for Q learning algorithm, a segment incremental search strategy was proposed base on greedy strategy. The improved Q learning jump gradually between three situations such as explore, learn and utilize by adjusting parameters of segment incremental search strategy, and it could approach the global optimal rapidly than the traditional one when the new search strategy is applied to the Q learning. The simulation results show that the manipulator reaches the target position accurately and quickly guided by the improve Q learning algorithm.

Keywords:	online learning trajectory plan manipulator mathematical model search strategy
本文献已被万方数据等数据库收录！
	点击此处可从《吉林大学学报(信息科学版)》浏览原始摘要信息
	点击此处可从《吉林大学学报(信息科学版)》下载免费的PDF全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏