基于策略迭代和遗传算法的 SMDP鲁棒控制策略求解 Solution of the robust control policy for SMDPs based on the genetic algorithm and policy iteration期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

基于策略迭代和遗传算法的 SMDP鲁棒控制策略求解

引用本文：	程燕,唐昊,马学森.基于策略迭代和遗传算法的 SMDP鲁棒控制策略求解[J].合肥工业大学学报(自然科学版),2007,30(11):1404-1407.

作者姓名：	程燕唐昊马学森

作者单位：	合肥工业大学,计算机与信息学院,安徽,合肥,230009

基金项目：	国家自然科学基金 , 安徽省自然科学基金 , 合肥工业大学校科研和教改项目

摘要：	半马尔可夫决策过程(SMDP)描述的一类受控半Markov系统,其模型参数在实际中常常不确定或不可知,可能导致随机过程的性能函数和系统参数(即嵌入链转移概率和状态逗留时间分布)皆不确定。该文针对参数不相关的情况,给出求解鲁棒控制策略的迭代算法,并在迭代过程中引入遗传算法,以提高全局优化能力。数值例子表明,基于遗传算法的策略迭代应用于鲁棒决策问题中具有较好的优化效果。
关键词：	半马尔可夫决策过程性能势鲁棒控制遗传算法
文章编号：	1003-5060(2007)11-1404-04
修稿时间：	2006年11月13
Solution of the robust control policy for SMDPs based on the genetic algorithm and policy iteration

CHENG Yan,TANG Hao,MA Xue-sen.Solution of the robust control policy for SMDPs based on the genetic algorithm and policy iteration[J].Journal of Hefei University of Technology(Natural Science),2007,30(11):1404-1407.

Authors:	CHENG Yan TANG Hao MA Xue-sen

Abstract:	For a class of controlled semi-Markov systems,which are formulated as semi-Markov decision processes(SMDPs),some parameters are usually indeterminate or unknown,and the performance function or the system parameters,i.e.,the transition probabilities of the embedded chains and the sojourn time distribution of states,may be uncertain.For the case of independent parameters,a policy iteration is provided to derive the robust control policy,and the genetic algorithm is applied in order to improve the optimization result.The numerical example shows that the genetic algorithm-based policy iteration works well for robust decision problems.

Keywords:	semi-Markov decision process performance potential robust control genetic algorithm
本文献已被 CNKI 万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏