多Agent MDPs中并行Rollout学习算法 Parallel rollout algorithms for multi-agent MDPs期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

多Agent MDPs中并行Rollout学习算法

引用本文：	李豹.多Agent MDPs中并行Rollout学习算法[J].安徽工程科技学院学报,2014(2):75-78.

作者姓名：	李豹

作者单位：	中国人民银行芜湖市中心支行,安徽芜湖241000

摘要：	文章在rollout算法基础上研究了在多Agent MDPs的学习问题.利用神经元动态规划逼近方法来降低其空间复杂度,从而减少算法"维数灾".由于Rollout算法具有很强的内在并行性,文中还分析了并行求解方法.通过多级仓库库存控制的仿真试验,验证了Rollout算法在多Agent学习中的有效性.
关键词：	rollout算法神经元动态规划多Agent学习性能势并行算法
Parallel rollout algorithms for multi-agent MDPs

LI Bao.Parallel rollout algorithms for multi-agent MDPs[J].Journal of Anhui University of Technology and Science,2014(2):75-78.

Authors:	LI Bao

Institution:	LI Bao (Wuhu Cental Sub-Branch of the People＇s Bank of China,Wuhu 241000,China)

Abstract:	The paper researches Rollout algorithms （RA） for multi-Agent Markov decision processes （MDPs） in the framework of performance potentials theory. Neuro-dynamic programming （NDP） is used to reduce ＂curse of dimensionality＂ of algorithms, Since to rolout algorithms has a very strong intrinsic parallelism,the parallelization method of RA is employed to reduce the time of running algorithms. Finally,an example of multi-level inventory control by using RA under the supply chain environment is provided. The result shows that rollout algorithms are confirmed to be valid in multi-Agent learning.

Keywords:	rollout algorithms neuro-dynamic programming multi agent learning performance potentialsparallel algorithms
本文献已被 CNKI 维普等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏