首页 | 本学科首页   官方微博 | 高级检索  
     

基于GA-RL的进化博弈求解主从博弈结构的供应链协调问题
引用本文:赵晗萍,蒋家东,冯允成. 基于GA-RL的进化博弈求解主从博弈结构的供应链协调问题[J]. 系统工程理论与实践, 2010, 30(4): 667-672. DOI: 10.12011/1000-6788(2010)4-667
作者姓名:赵晗萍  蒋家东  冯允成
作者单位:1. 北京师范大学,减灾与应急管理研究院,北京,100875;环境演变与自然灾害教育部重点实验室北京,100875
2. 北京航空航天大学,经济管理学院,北京,100083;中国航空综合技术研究所,北京,100028
3. 北京航空航天大学,经济管理学院,北京,100083
基金项目:国家科技支撑计划,教育部博士点新教师基金,国家自然科学基金 
摘    要:供应链协调问题多数基于主从博弈结构建模,但如果研究对象是相对复杂的供应链结构.理论求解主从博弈问题就变得困难.因此从求解一对一的供应链协调问题开始,针对主从博弈问题的特点,利用个体学习的进化博弈仿真手段,设计了经销商利用经验分布的预期随机需求的信念更新模式与最优反应的决策模式,为生产商分别设计了基于强化学习的信念更新模式与基于遗传算法搜索策略空间的决策模式,并将两者有机结合,取得了博弈问题的均衡解并且验证该解与理论求解结果一致,为进一步求解复杂问题提供了新的途径.

关 键 词:供应链协调  进化博弈论  强化学习~(RL)  遗传算法  

Coordinating supply chain of Stackelberg game model based on evolutionary game with GA-RL
ZHAO Han-ping,JIANG Jiadong,FENG Yun-cheng. Coordinating supply chain of Stackelberg game model based on evolutionary game with GA-RL[J]. Systems Engineering —Theory & Practice, 2010, 30(4): 667-672. DOI: 10.12011/1000-6788(2010)4-667
Authors:ZHAO Han-ping  JIANG Jiadong  FENG Yun-cheng
Affiliation:ZHAO Han-ping~(1,2),JIANG Jia-dong~(3,4),FENG Yun-cheng~3 (1.Academy of Disaster Reduction , Emergency Management,Beijing Normal University,Beijing 100875,China,2.Key Laboratory of Environmental Change , Natural Disaster,Ministry of Education,3.School of Economics , Management,Beijing University of Aeronautics , Astronautics,Beijing 100083,4.China Aero Poly-technology Establishment,Beijing 100028,China)
Abstract:Problems of coordinating supply chain are based on Stackelberg game model,but if research object is complex supply chain,it is difficult to find equilibrium of Stackelberg game,so evolutionary game theory was introduced.According to characteristics of leaders and followers in Stackelberg game model,learning mechanism is designed for each player respectively.An algorithm of reinforcement learning combined with genetic searching is proposed for leaders(manufacturers),and a learning model of best-reply is desi...
Keywords:supply chain coordination  evolutionary game theory  reinforcement learning (RL)  genetic algorithm (GA)
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《系统工程理论与实践》浏览原始摘要信息
点击此处可从《系统工程理论与实践》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号