首页 | 本学科首页   官方微博 | 高级检索  
     检索      

向量值有限平均MDP
引用本文:贾让成.向量值有限平均MDP[J].西北师范大学学报,1994,30(3):16-19.
作者姓名:贾让成
作者单位:西北师范大学数学系
基金项目:甘肃省教委自然科学基金
摘    要:讨论了向量值离散时间平均准则下的有限马氏决策模型;在采取确定性平稳策略时所得马氏决策过程为遍历的假设下,证明了存在一个至多在K-1个状态是随机的平稳最优策略,并给出了其线性规划算法。同时证明了存在强最优策略的充要条件是其存在强确定性平稳最优策略。

关 键 词:向量值  平均准则  马氏决策过程

Averaged Einite Vactor Value Markov Decision Programming
Jia Rangcheng.Averaged Einite Vactor Value Markov Decision Programming[J].Journal of Northwest Normal University Natural Science (Bimonthly),1994,30(3):16-19.
Authors:Jia Rangcheng
Institution:Department of Matheniatics
Abstract:The vactor value Markov decision model is considered.It is assu med that the state andactionapaces are finite and the law of motion is unchain.i.e.every pure policy gives rise to a Merkov chainwith one recurrent class.It is proved that therc exists an optirnal stationary policy with a degree of ran-domization no more than K,A linear program pred1icing the optimal policy is presented.
Keywords:finite Markov decision model  optimal policy  vactor value  average criterion
本文献已被 CNKI 维普 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号