不确定SMDP基于全局优化的鲁棒决策问题 Robust Decision Problems Based on Global Optimization for Uncertain SMDPs期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

不确定SMDP基于全局优化的鲁棒决策问题

引用本文：	刘春,唐昊,程文娟. 不确定SMDP基于全局优化的鲁棒决策问题[J]. 系统仿真学报, 2005, 17(11): 2704-2707

作者姓名：	刘春唐昊程文娟

作者单位：	合肥工业大学计算机与信息学院,安徽,合肥,230009

基金项目：	国家自然科学基金项目（60404009）、安徽省自然科学基金项目（050420303）和合肥工业大学中青年科技创新群体计划资助.

摘要：	考虑半马尔可夫决策过程（SMDP）在一些系统参数不确定，且性能函数依赖于这些参数时的鲁棒决策问题。这些参数的不确定性不仅导致等价无穷小生成子的不确定性，也导致性能函数的不确定性。论文针对相关参数的情况，分别采用不同的全局优化算法，即填充函数法和模拟退火算法，进行鲁棒控制策略求解。仿真实例说明，全局优化方法的使用保证了平均准则和折扣准则下的计算结果之间当折扣因子趋近于零时的极限关系成立。
关键词：	半马尔可夫决策过程性能势鲁棒决策全局优化
文章编号：	1004-731X（2005）11-2704-04
收稿时间：	2004-09-23
修稿时间：	2005-05-17
Robust Decision Problems Based on Global Optimization for Uncertain SMDPs

LIU Chun,TANG Hao,CHENG Wen-juan. Robust Decision Problems Based on Global Optimization for Uncertain SMDPs[J]. Journal of System Simulation, 2005, 17(11): 2704-2707

Authors:	LIU Chun TANG Hao CHENG Wen-juan

Affiliation:	School of Computer and Information, Hefei University of Technology, Heifei 230009, China

Abstract:	The robust decision problems for a class of semi-Markov decision processes (SMCPs) are presented with some uncertain system parameters on which the performance function is dependent. The uncertainty of these parameters will lead to the uncertainty of equivalent infinitesimal generator and also the performance function. Different global optimization methods, such as simulated annealing and filled function approaches, were adopted respectively to derive the optimal robust control policy under the case of dependent parameters. With the application of these global optimization techniques, the obtained simulation results for a numerical example show that an average-cost problem is the limitation of a discounted problem as the discount factor goes to zero.

Keywords:	semi-markov decision processes performance potential robust decision global optimization.
本文献已被 CNKI 维普万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏