首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于性能预测的遗传强化学习动态调度方法
引用本文:魏英姿,谷侃锋.基于性能预测的遗传强化学习动态调度方法[J].系统仿真学报,2010(12).
作者姓名:魏英姿  谷侃锋
作者单位:1. 沈阳理工大学信息科学与工程学院,沈阳110159;
2. 中国科学院沈阳自动化研究所沈阳现代装备研究设计中心,沈阳110016;
摘    要:针对作业车间动态调度问题,在模式驱动调度的框架下,提出遗传强化学习动态调度方法。首先,采用优先规则编码的染色体表达问题的解,将染色体分割成基因模式作为分阶段调度算法的状态模式;其次,设计性能预测变量,构建启发式立即回报函数,引导和加快遗传强化学习算法的搜索进程;再次,设置遗传算子、强化学习及其相关参数以实现搜索过程"开采"与"探索"之间的平衡;最后,仿真实验结果验证了遗传强化学习调度方法的有效性。
Abstract:
In the framework of pattern driven scheduling,a genetic reinforcement learning (GRL) approach to schedule the job in the dynamical job-shop was proposed.First,the chromosome was coded by preference rules-based representation for the problem.The chromosome was divided into gene schema as state patterns for the multi-phase scheduling system.Secondly,a performance predictive variable to construct instant reward function was designed which was used to guide the learning system to progress rapidly.Thirdly,genetic operators,RL and controlling parameters carried out the search strategy for the balance of "exploration" and "exploitation".Finally,the simulation results verify the efficiency of GRL scheduling approach.

关 键 词:强化学习  遗传算法  预测  生产周期  作业车间动态调度

Genetic Reinforcement Learning Approach to Dynamic Scheduling Based on Performance Prediction
WEI Ying-zi,GU Kan-feng.Genetic Reinforcement Learning Approach to Dynamic Scheduling Based on Performance Prediction[J].Journal of System Simulation,2010(12).
Authors:WEI Ying-zi  GU Kan-feng
Abstract:In the framework of pattern driven scheduling,a genetic reinforcement learning (GRL) approach to schedule the job in the dynamical job-shop was proposed.First,the chromosome was coded by preference rules-based representation for the problem.The chromosome was divided into gene schema as state patterns for the multi-phase scheduling system.Secondly,a performance predictive variable to construct instant reward function was designed which was used to guide the learning system to progress rapidly.Thirdly,genetic operators,RL and controlling parameters carried out the search strategy for the balance of "exploration" and "exploitation".Finally,the simulation results verify the efficiency of GRL scheduling approach.
Keywords:reinforcement learning  genetic algorithm  prediction  makespan  dynamic job-shop scheduling
本文献已被 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号