基于模糊强化学习的双轮机器人姿态平衡控制 Attitude balance control of two-wheeled robot based on fuzzy reinforcement learning期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

基于模糊强化学习的双轮机器人姿态平衡控制

引用本文：	闫安,陈章,董朝阳,何康辉.基于模糊强化学习的双轮机器人姿态平衡控制[J].系统工程与电子技术,2021,43(4):1036-1043.

作者姓名：	闫安陈章董朝阳何康辉

作者单位：	1. 北京航空航天大学航空科学与工程学院, 北京 1001912. 清华大学自动化系, 北京 100084

基金项目：	国家自然科学基金(61833016,61873295);航空人工智能专项基金(2018ZA51003)资助课题。

摘要：	针对单轨双轮机器人在静止情况下存在的固有静态不稳定问题, 提出一种基于模糊强化学习(简称为Fuzzy-Q)的控制方法。首先，运用拉格朗日法建立带控制力矩陀螺的系统动力学模型。然后, 在此基础上设计表格型强化学习算法, 实现机器人的稳定平衡控制。最后，针对算法存在的控制精度不高和控制器输出离散等问题, 采用模糊理论泛化动作空间, 改善控制精度, 并使控制输出连续。仿真实验表明, 相较于传统强化学习方法, 所提方法能够显著提高控制精度, 且可以有效抑制外界干扰力矩对系统的影响, 保证系统具有一定的抗干扰能力。
关键词：	强化学习模糊强化学习模糊算法控制力矩陀螺单轨双轮机器人
收稿时间：	2020-06-13
Attitude balance control of two-wheeled robot based on fuzzy reinforcement learning

YAN An,CHEN Zhang,DONG Chaoyang,HE Kanghui.Attitude balance control of two-wheeled robot based on fuzzy reinforcement learning[J].System Engineering and Electronics,2021,43(4):1036-1043.

Authors:	YAN An CHEN Zhang DONG Chaoyang HE Kanghui

Institution:	1. School of Aeronautic Science and Engineering, Beihang University, Beijing 100191, China2. Department of Automation, Tsinghua University, Beijing 100084, China

Abstract:	In order to solve the inherent problem of static instability of monorail two-wheel robot under resting conditions,a control method of monorail two-wheel robot based on fuzzy reinforcement learning(Fuzzy-Q in short)is proposed.Firstly,the Lagrange method is used to establish the system dynamics model with control moment gyro.And then,on this basis,the tabular reinforcement learning algorithm is designed to realize the stable balance control of the robot.Finally,In order to solve the problems of low control accuracy and discretization of controller output,the fuzzy theory is used to generalize the action space,improve the control accuracy and make the control output continuous.The simulation results show that compared with the traditional reinforcement learning methods,the proposed Fuzzy-Q method can significantly improve the control accuracy,effectively inhibit the influence of external interference torque on the system,and ensure that the system has a great anti-interference capability.

Keywords:	reinforcement learning fuzzy reinforcement learning fuzzy algorithm control moment gyro monorail two-wheeled robot
本文献已被维普等数据库收录！
	点击此处可从《系统工程与电子技术》浏览原始摘要信息
	点击此处可从《系统工程与电子技术》下载免费的PDF全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏