连续时间折扣模型最优策略的结构 Structure of Optimal Policy for Continuous Time Discounted Markov Decision Model期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

连续时间折扣模型最优策略的结构

引用本文：	林元烈. 连续时间折扣模型最优策略的结构[J]. 清华大学学报(自然科学版), 1985, 0(3)

作者姓名：	林元烈

作者单位：	应用数学系

摘要：	本文研究了连续时间马氏决策规划折扣模型在（ｃ）上最优策略的若干重要性质和它的结构。由于引进了映像及，使证明大为简化。特别是证明了：一随机平稳策略，它在（ｃ）上是最优的充要条件是它可表为若干个决定性平稳最优策略的凸组合。
关键词：	最优策略马氏决策连续时间折扣模型
Structure of Optimal Policy for Continuous Time Discounted Markov Decision Model

Lin Yuanlie. Structure of Optimal Policy for Continuous Time Discounted Markov Decision Model[J]. Journal of Tsinghua University(Science and Technology), 1985, 0(3)

Authors:	Lin Yuanlie

Affiliation:	Department of Applied Mathematics

Abstract:	Certain important properties of an optimal policy in m (c) for a continuous time discounted Markov decision model are studied. The proof is much simplified since mappings and are used. It is shown that the randomized stationary policy is an optimal policy in m (c) if and only if it is convex combination of some deterministic stationary optimal policies.

Keywords:	optimal policy Markov decision continuous time discounted model.
本文献已被 CNKI 等数据库收录！