基于Markov对策和强化学习的多智能体协作研究 Research on Multiagent Cooperation with Markov Game and Reinforcement Learning期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

基于Markov对策和强化学习的多智能体协作研究

引用本文：	李晓萌,杨煜普,许晓鸣.基于Markov对策和强化学习的多智能体协作研究[J].上海交通大学学报,2001,35(2):288-292.

作者姓名：	李晓萌杨煜普许晓鸣

作者单位：	上海交通大学自动化系,

基金项目：	国家自然科学基金资助项目(3930070)

摘要：	MAS的协作机制研究，当前比较适用的研究框架是非零和Markov对策及基于Q－算法的强化学习。但实际上在这种框架下的Agent强调独立学习而不考虑其他Agent的行为，故MAS缺乏协作机制。并且，Q－算法要求Agent与环境的交互时具有完备的观察信息，这种情况过于理想化。文中针对以上两个不足，提出了在联合行动和不完备信息下的协调学习。理论分析和仿真实验表明，协调学习算法具有收敛性。
关键词：	Markov对策 Q－学习算法协调学习多智能体系统强化学习
文章编号：	1006-2467(2001)02-0288-05
修稿时间：	2000年5月22日
Research on Multiagent Cooperation with Markov Game and Reinforcement Learning

LI Xiao-meng,YANG Yu-Pu,XU Xiao-ming.Research on Multiagent Cooperation with Markov Game and Reinforcement Learning[J].Journal of Shanghai Jiaotong University,2001,35(2):288-292.

Authors:	LI Xiao-meng YANG Yu-Pu XU Xiao-ming

Abstract:	Non zero sum Markov game and reinforcement learning based on Q algorithm is a feasible frame for the research on the mechanism of multiagent system's cooperation. In fact, the independent learning is accentuated for agent regardless of other agents' actions under this frame. So, the mechanism of cooperation is deficient. And, it is over idealized that the perfect observed information is required when agents are interacting with environment. In the paper, cooperated learning under joined action and imperfect information was proposed for solving these two problems. Convergence of the improving algorithm was proved.

Keywords:	Markov game Q learning algorithm cooperated learning
本文献已被 CNKI 维普万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏