首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于模糊强化学习的双轮机器人姿态平衡控制
引用本文:闫安,陈章,董朝阳,何康辉.基于模糊强化学习的双轮机器人姿态平衡控制[J].系统工程与电子技术,2021,43(4):1036-1043.
作者姓名:闫安  陈章  董朝阳  何康辉
作者单位:1. 北京航空航天大学航空科学与工程学院, 北京 1001912. 清华大学自动化系, 北京 100084
基金项目:国家自然科学基金(61833016,61873295);航空人工智能专项基金(2018ZA51003)资助课题。
摘    要:针对单轨双轮机器人在静止情况下存在的固有静态不稳定问题, 提出一种基于模糊强化学习(简称为Fuzzy-Q)的控制方法。首先,运用拉格朗日法建立带控制力矩陀螺的系统动力学模型。然后, 在此基础上设计表格型强化学习算法, 实现机器人的稳定平衡控制。最后,针对算法存在的控制精度不高和控制器输出离散等问题, 采用模糊理论泛化动作空间, 改善控制精度, 并使控制输出连续。仿真实验表明, 相较于传统强化学习方法, 所提方法能够显著提高控制精度, 且可以有效抑制外界干扰力矩对系统的影响, 保证系统具有一定的抗干扰能力。

关 键 词:强化学习  模糊强化学习  模糊算法  控制力矩陀螺  单轨双轮机器人  
收稿时间:2020-06-13

Attitude balance control of two-wheeled robot based on fuzzy reinforcement learning
YAN An,CHEN Zhang,DONG Chaoyang,HE Kanghui.Attitude balance control of two-wheeled robot based on fuzzy reinforcement learning[J].System Engineering and Electronics,2021,43(4):1036-1043.
Authors:YAN An  CHEN Zhang  DONG Chaoyang  HE Kanghui
Institution:1. School of Aeronautic Science and Engineering, Beihang University, Beijing 100191, China2. Department of Automation, Tsinghua University, Beijing 100084, China
Abstract:In order to solve the inherent problem of static instability of monorail two-wheel robot under resting conditions,a control method of monorail two-wheel robot based on fuzzy reinforcement learning(Fuzzy-Q in short)is proposed.Firstly,the Lagrange method is used to establish the system dynamics model with control moment gyro.And then,on this basis,the tabular reinforcement learning algorithm is designed to realize the stable balance control of the robot.Finally,In order to solve the problems of low control accuracy and discretization of controller output,the fuzzy theory is used to generalize the action space,improve the control accuracy and make the control output continuous.The simulation results show that compared with the traditional reinforcement learning methods,the proposed Fuzzy-Q method can significantly improve the control accuracy,effectively inhibit the influence of external interference torque on the system,and ensure that the system has a great anti-interference capability.
Keywords:reinforcement learning  fuzzy reinforcement learning  fuzzy algorithm  control moment gyro  monorail two-wheeled robot
本文献已被 维普 等数据库收录!
点击此处可从《系统工程与电子技术》浏览原始摘要信息
点击此处可从《系统工程与电子技术》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号