依赖于历史的折扣半马氏决策规划 Semi-Markov Decision Process with Discount Factors Depend on History期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

依赖于历史的折扣半马氏决策规划

引用本文：	张道智.依赖于历史的折扣半马氏决策规划[J].清华大学学报(自然科学版),1989(3).

作者姓名：	张道智

作者单位：	应用数学系

摘要：	研究无界报酬折扣半马氏决策规划问题．证明了：策略π·＝（π１·，π２·，…πｎ·，π·ｎ＋１，…）是最优策略，则π１·（∞）及（π１·，π２·，…，πｎ·）（∞）对同一折扣因子函数也是最优的，对任给的整数ｎ≥１，在一定的条件下，πｎ·（∞）也是最优的；证明了若最优策略存在，必存在最优平稳策略；证明了ε最优平稳策略的存在性。
关键词：	折扣因子函数最优策略最优平稳策略
Semi-Markov Decision Process with Discount Factors Depend on History

Zhang Daozhi.Semi-Markov Decision Process with Discount Factors Depend on History[J].Journal of Tsinghua University(Science and Technology),1989(3).

Authors:	Zhang Daozhi

Institution:	Department of Applied Mathematics

Abstract:

Keywords:	discount factors optimal strategies optimal stationary strategies
本文献已被 CNKI 等数据库收录！