首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于Markov对策和强化学习的多智能体协作研究
引用本文:李晓萌,杨煜普,许晓鸣.基于Markov对策和强化学习的多智能体协作研究[J].上海交通大学学报,2001,35(2):288-292.
作者姓名:李晓萌  杨煜普  许晓鸣
作者单位:上海交通大学自动化系,
基金项目:国家自然科学基金资助项目(3930070)
摘    要:MAS的协作机制研究,当前比较适用的研究框架是非零和Markov对策及基于Q-算法的强化学习。但实际上在这种框架下的Agent强调独立学习而不考虑其他Agent的行为,故MAS缺乏协作机制。并且,Q-算法要求Agent与环境的交互时具有完备的观察信息,这种情况过于理想化。文中针对以上两个不足,提出了在联合行动和不完备信息下的协调学习。理论分析和仿真实验表明,协调学习算法具有收敛性。

关 键 词:Markov对策  Q-学习算法  协调学习  多智能体系统  强化学习
文章编号:1006-2467(2001)02-0288-05
修稿时间:2000年5月22日

Research on Multiagent Cooperation with Markov Game and Reinforcement Learning
LI Xiao-meng,YANG Yu-Pu,XU Xiao-ming.Research on Multiagent Cooperation with Markov Game and Reinforcement Learning[J].Journal of Shanghai Jiaotong University,2001,35(2):288-292.
Authors:LI Xiao-meng  YANG Yu-Pu  XU Xiao-ming
Abstract:Non zero sum Markov game and reinforcement learning based on Q algorithm is a feasible frame for the research on the mechanism of multiagent system's cooperation. In fact, the independent learning is accentuated for agent regardless of other agents' actions under this frame. So, the mechanism of cooperation is deficient. And, it is over idealized that the perfect observed information is required when agents are interacting with environment. In the paper, cooperated learning under joined action and imperfect information was proposed for solving these two problems. Convergence of the improving algorithm was proved.
Keywords:Markov game  Q  learning algorithm  cooperated learning
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号