强化学习主要算法的研究 |
| |
引用本文: | 李瑞.强化学习主要算法的研究[J].渝西学院学报(自然科学版),2004,3(3):22-25. |
| |
作者姓名: | 李瑞 |
| |
作者单位: | 渝西学院数学与计算机科学系 重庆永川402168 |
| |
摘 要: | 介绍了强化学习模型,分别提出了7个主要的强化学习算法并讨论了它们之间的区别和联系,最后指出了强化学习算法中有待解决的问题.
|
关 键 词: | 强化学习 动态规划 蒙特卡罗算法 瞬时差分算法 |
Study of the Main Reinforcement Learning Algorithms |
| |
Authors: | LI Rui |
| |
Abstract: | The model of reinforcement learning is first introduced in this paper ,Then the seven main algorithms including dynamic programming, Monte-Carlo method ,Temporal-Difference, Q-learning are given respectively and their difference and relation are pointed out .At last, future research direction are proposed. |
| |
Keywords: | reinforcement learning Dynamic Programming Monte-Carlo method Temporal-DiReinfo |
本文献已被 CNKI 维普 等数据库收录! |
|