首页 | 本学科首页   官方微博 | 高级检索  
     

基于强化学习的连续型机械臂自适应跟踪控制
引用本文:江达,蔡志勤,刘忠振,彭海军,吴志刚. 基于强化学习的连续型机械臂自适应跟踪控制[J]. 系统仿真学报, 2022, 34(10): 2264-2271. DOI: 10.16182/j.issn1004731x.joss.21-0632
作者姓名:江达  蔡志勤  刘忠振  彭海军  吴志刚
作者单位:1.大连理工大学,辽宁 大连 1160242.工业装备结构分析国家重点实验室,辽宁 大连 116024
基金项目:国家自然科学基金重大研究计划重点项目(91748203);国家自然科学基金优秀青年项目(11922203)
摘    要:针对空间主动碎片清除操作中连续型三臂节机器人系统跟踪问题,提出一种基于强化学习的自适应滑模控制算法。基于数据驱动的建模方法,采用BP神经网络对三臂节连续型机械臂进行建模,并作为预测模型指导强化学习实时调节所提出滑模控制器的控制参数,从而实现连续型机器人运动的实时跟踪控制。仿真结果表明:提出的数据驱动的预测模型对随机轨迹预测的相对误差保持在±1%以内,能够高精度地反映系统动态特性。对比固定参数的滑模控制器,提出的自适应控制器在保证系统达到控制目标的同时具有更低的超调量和更短的调节时间,表现出更好的控制效果。

关 键 词:空间连续型机器人  强化学习  预测控制  滑模控制  轨迹跟踪
收稿时间:2021-07-07

Reinforcement-learning-based Adaptive Tracking Control for a Space Continuum Robot Based on Reinforcement Learning
Da Jiang,Zhiqin Cai,Zhongzhen Liu,Haijun Peng,Zhigang Wu. Reinforcement-learning-based Adaptive Tracking Control for a Space Continuum Robot Based on Reinforcement Learning[J]. Journal of System Simulation, 2022, 34(10): 2264-2271. DOI: 10.16182/j.issn1004731x.joss.21-0632
Authors:Da Jiang  Zhiqin Cai  Zhongzhen Liu  Haijun Peng  Zhigang Wu
Affiliation:1.Dalian University of Technology, Dalian 116024, China2.State Key Laboratory of Structural Analysis for Industrial Equipment, Dalian 116024, China
Abstract:Aiming at the tracking control for three-arm space continuum robot in space active debris removal manipulation, an adaptive sliding mode control algorithm based on deep reinforcement learning is proposed. Through BP network, a data-driven dynamic model is developed as the predictive model to guide the reinforcement learning to adjust the sliding mode controller's parameters online, and finally realize a real-time tracking control. Simulation results show that the proposed data-driven predictive model can accurately predict the robot's dynamic characteristics with the relative error within ±1% to random trajectories. Compared with the fixed-parameter sliding mode controller, the proposed adaptive controller has a lower overshoot and shorter settling time and can achieve a better tracking performance.
Keywords:space continuum robot  reinforcement learning  predictive control  sliding mode control  trajectory tracking  
点击此处可从《系统仿真学报》浏览原始摘要信息
点击此处可从《系统仿真学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号