MDP平均模型的强最优性 Strong Optimality for MDP Average Model期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

MDP平均模型的强最优性

引用本文：	郭先平.MDP平均模型的强最优性[J].湖南师范大学自然科学学报,1996,19(1):21-24.

作者姓名：	郭先平

作者单位：	湖南师范大学数学系

摘要：	考虑的是任意状态空间，任意行动空间ＭＤＰ平均模型的四个平均准则，在Ｏ．Ｈ．Ｌｅｒｍａ的遍历性条件下，利用稳定性定理和可测选择理论简明地证明了存在平稳策略关于此模型的四个平均准则同时是最优的，从而扩充并加强了Ｏ．Ｈ．Ｌｅｒｍａ（１９８９）的主要结果。
关键词：	马氏决策规划平均目标强最优遍历性平稳策略
Strong Optimality for MDP Average Model

Guo Xianping.Strong Optimality for MDP Average Model[J].Journal of Natural Science of Hunan Normal University,1996,19(1):21-24.

Authors:	Guo Xianping

Abstract:	In this paper, we consider four average criteria of MDP with arbitrary state space and action space. Using the theory of measurable selection and the stability theorem,we prove that there exists a stationary policy which is optimal for the four average criteria at the same time under Lermas ergodicity conditions

Keywords:	markor decision progranming (MDP) average creterion strong optimality ergodicity stationary policies
本文献已被 CNKI 维普等数据库收录！