首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于神经网络增强学习算法的工艺任务分配方法
引用本文:苏莹莹,王宛山,王建荣,唐亮.基于神经网络增强学习算法的工艺任务分配方法[J].东北大学学报(自然科学版),2009,30(2):279-282.
作者姓名:苏莹莹  王宛山  王建荣  唐亮
作者单位:东北大学机械工程与自动化学院,辽宁,沈阳,110004
基金项目:教育酃高等学校博士学科点专项科研基金 
摘    要:在任务分配问题中,如果Markov决策过程模型的状态-动作空间很大就会出现"维数灾难".针对这一问题,提出一种基于BP神经网络的增强学习策略.利用BP神经网络良好的泛化能力,存储和逼近增强学习中状态-动作对的Q值,设计了基于Q学习的最优行为选择策略和Q学习的BP神经网络模型与算法.将所提方法应用于工艺任务分配问题,经过Matlab软件仿真实验,结果证实了该方法具有良好的性能和行为逼近能力.该方法进一步提高了增强学习理论在任务分配问题中的应用价值.

关 键 词:任务分配  工艺设计  增强学习  Q学习  神经网络  

Research on Task Allocation of Process Planning Based on Reinforcement Learning and Neural Network
SU Ying-ying,WANG Wan-shan,WANG Jian-rong,TANG Liang.Research on Task Allocation of Process Planning Based on Reinforcement Learning and Neural Network[J].Journal of Northeastern University(Natural Science),2009,30(2):279-282.
Authors:SU Ying-ying  WANG Wan-shan  WANG Jian-rong  TANG Liang
Institution:SU Ying-ying,WANG Wan-shan,WANG Jian-rong,TANG Liang (School of Mechanical Engineering & Automation,Northeastern University,Shenyang 110004,China.)
Abstract:Aiming at the curse of dimensionality caused by prodigiousness of state-action space for Markov decision-making process model,a kind of Q learning method based on neural network was proposed.The Q value of a state-action pair during reinforcement learning was approached and stored by means of the high generalizability of BP neural network,then the optimal strategy based on Q learning for selection of action and a BP neural network model and algorithm for Q learning were designed.The algorithm proposed was a...
Keywords:task allocation  process planning  reinforcement learning  Q learning  neural network  
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《东北大学学报(自然科学版)》浏览原始摘要信息
点击此处可从《东北大学学报(自然科学版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号