基于性能预测的遗传强化学习动态调度方法 Genetic Reinforcement Learning Approach to Dynamic Scheduling Based on Performance Prediction期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

基于性能预测的遗传强化学习动态调度方法

引用本文：	魏英姿,谷侃锋.基于性能预测的遗传强化学习动态调度方法[J].系统仿真学报,2010(12).

作者姓名：	魏英姿谷侃锋

作者单位：	1. 沈阳理工大学信息科学与工程学院,沈阳110159; 2. 中国科学院沈阳自动化研究所沈阳现代装备研究设计中心,沈阳110016;

摘要：	针对作业车间动态调度问题,在模式驱动调度的框架下,提出遗传强化学习动态调度方法。首先,采用优先规则编码的染色体表达问题的解,将染色体分割成基因模式作为分阶段调度算法的状态模式;其次,设计性能预测变量,构建启发式立即回报函数,引导和加快遗传强化学习算法的搜索进程;再次,设置遗传算子、强化学习及其相关参数以实现搜索过程＂开采＂与＂探索＂之间的平衡;最后,仿真实验结果验证了遗传强化学习调度方法的有效性。 Abstract： In the framework of pattern driven scheduling,a genetic reinforcement learning （GRL） approach to schedule the job in the dynamical job-shop was proposed.First,the chromosome was coded by preference rules-based representation for the problem.The chromosome was divided into gene schema as state patterns for the multi-phase scheduling system.Secondly,a performance predictive variable to construct instant reward function was designed which was used to guide the learning system to progress rapidly.Thirdly,genetic operators,RL and controlling parameters carried out the search strategy for the balance of ＂exploration＂ and ＂exploitation＂.Finally,the simulation results verify the efficiency of GRL scheduling approach.
关键词：	强化学习遗传算法预测生产周期作业车间动态调度
Genetic Reinforcement Learning Approach to Dynamic Scheduling Based on Performance Prediction

WEI Ying-zi,GU Kan-feng.Genetic Reinforcement Learning Approach to Dynamic Scheduling Based on Performance Prediction[J].Journal of System Simulation,2010(12).

Authors:	WEI Ying-zi GU Kan-feng

Abstract:	In the framework of pattern driven scheduling,a genetic reinforcement learning （GRL） approach to schedule the job in the dynamical job-shop was proposed.First,the chromosome was coded by preference rules-based representation for the problem.The chromosome was divided into gene schema as state patterns for the multi-phase scheduling system.Secondly,a performance predictive variable to construct instant reward function was designed which was used to guide the learning system to progress rapidly.Thirdly,genetic operators,RL and controlling parameters carried out the search strategy for the balance of ＂exploration＂ and ＂exploitation＂.Finally,the simulation results verify the efficiency of GRL scheduling approach.

Keywords:	reinforcement learning genetic algorithm prediction makespan dynamic job-shop scheduling
本文献已被万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏