向量值有限平均MDP Averaged Einite Vactor Value Markov Decision Programming期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

向量值有限平均MDP

引用本文：	贾让成.向量值有限平均MDP[J].西北师范大学学报,1994,30(3):16-19.

作者姓名：	贾让成

作者单位：	西北师范大学数学系

基金项目：	甘肃省教委自然科学基金

摘要：	讨论了向量值离散时间平均准则下的有限马氏决策模型；在采取确定性平稳策略时所得马氏决策过程为遍历的假设下，证明了存在一个至多在Ｋ－１个状态是随机的平稳最优策略，并给出了其线性规划算法。同时证明了存在强最优策略的充要条件是其存在强确定性平稳最优策略。
关键词：	向量值平均准则马氏决策过程
Averaged Einite Vactor Value Markov Decision Programming

Jia Rangcheng.Averaged Einite Vactor Value Markov Decision Programming[J].Journal of Northwest Normal University Natural Science (Bimonthly),1994,30(3):16-19.

Authors:	Jia Rangcheng

Institution:	Department of Matheniatics

Abstract:	The vactor value Markov decision model is considered.It is assu med that the state andactionapaces are finite and the law of motion is unchain.i.e.every pure policy gives rise to a Merkov chainwith one recurrent class.It is proved that therc exists an optirnal stationary policy with a degree of ran-domization no more than K,A linear program pred1icing the optimal policy is presented.

Keywords:	finite Markov decision model optimal policy vactor value average criterion
本文献已被 CNKI 维普等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏