首页 | 本学科首页   官方微博 | 高级检索  
     

连续时间折扣模型最优策略的结构
引用本文:林元烈. 连续时间折扣模型最优策略的结构[J]. 清华大学学报(自然科学版), 1985, 0(3)
作者姓名:林元烈
作者单位:应用数学系
摘    要:本文研究了连续时间马氏决策规划折扣模型在(c)上最优策略的若干重要性质和它的结构。由于引进了映像及,使证明大为简化。特别是证明了:一随机平稳策略,它在(c)上是最优的充要条件是它可表为若干个决定性平稳最优策略的凸组合。

关 键 词:最优策略  马氏决策  连续时间折扣模型

Structure of Optimal Policy for Continuous Time Discounted Markov Decision Model
Lin Yuanlie. Structure of Optimal Policy for Continuous Time Discounted Markov Decision Model[J]. Journal of Tsinghua University(Science and Technology), 1985, 0(3)
Authors:Lin Yuanlie
Affiliation:Department of Applied Mathematics
Abstract:Certain important properties of an optimal policy in m (c) for a continuous time discounted Markov decision model are studied. The proof is much simplified since mappings and are used. It is shown that the randomized stationary policy is an optimal policy in m (c) if and only if it is convex combination of some deterministic stationary optimal policies.
Keywords:optimal policy   Markov decision   continuous time discounted model.  
本文献已被 CNKI 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号