基于神经网络增强学习算法的工艺任务分配方法 Research on Task Allocation of Process Planning Based on Reinforcement Learning and Neural Network期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

基于神经网络增强学习算法的工艺任务分配方法

引用本文：	苏莹莹,王宛山,王建荣,唐亮.基于神经网络增强学习算法的工艺任务分配方法[J].东北大学学报(自然科学版),2009,30(2):279-282.

作者姓名：	苏莹莹王宛山王建荣唐亮

作者单位：	东北大学机械工程与自动化学院,辽宁,沈阳,110004

基金项目：	教育酃高等学校博士学科点专项科研基金

摘要：	在任务分配问题中,如果Markov决策过程模型的状态-动作空间很大就会出现"维数灾难".针对这一问题,提出一种基于BP神经网络的增强学习策略.利用BP神经网络良好的泛化能力,存储和逼近增强学习中状态-动作对的Q值,设计了基于Q学习的最优行为选择策略和Q学习的BP神经网络模型与算法.将所提方法应用于工艺任务分配问题,经过Matlab软件仿真实验,结果证实了该方法具有良好的性能和行为逼近能力.该方法进一步提高了增强学习理论在任务分配问题中的应用价值.
关键词：	任务分配工艺设计增强学习 Q学习神经网络
Research on Task Allocation of Process Planning Based on Reinforcement Learning and Neural Network

SU Ying-ying,WANG Wan-shan,WANG Jian-rong,TANG Liang.Research on Task Allocation of Process Planning Based on Reinforcement Learning and Neural Network[J].Journal of Northeastern University(Natural Science),2009,30(2):279-282.

Authors:	SU Ying-ying WANG Wan-shan WANG Jian-rong TANG Liang

Institution:	SU Ying-ying,WANG Wan-shan,WANG Jian-rong,TANG Liang (School of Mechanical Engineering & Automation,Northeastern University,Shenyang 110004,China.)

Abstract:	Aiming at the curse of dimensionality caused by prodigiousness of state-action space for Markov decision-making process model,a kind of Q learning method based on neural network was proposed.The Q value of a state-action pair during reinforcement learning was approached and stored by means of the high generalizability of BP neural network,then the optimal strategy based on Q learning for selection of action and a BP neural network model and algorithm for Q learning were designed.The algorithm proposed was a...

Keywords:	task allocation process planning reinforcement learning Q learning neural network
本文献已被 CNKI 万方数据等数据库收录！
	点击此处可从《东北大学学报(自然科学版)》浏览原始摘要信息
	点击此处可从《东北大学学报(自然科学版)》下载免费的PDF全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏