THE BOREL STATE SPACE SEMI-MARKOVDECISION PROCESS WITH EXPECTED TOTAL REWARDS IN A SEMI-MARKOV ENVIRONMENT THE BOREL STATE SPACE SEMI-MARKOV DECISION PROCESS WITH EXPECTED TOTAL REWARDS IN A SEMI-MARKOV ENVIRONMENT期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

THE BOREL STATE SPACE SEMI-MARKOVDECISION PROCESS WITH EXPECTED TOTAL REWARDS IN A SEMI-MARKOV ENVIRONMENT

作者姓名：	XU Chen

作者单位：	XU Chen(School of Science，Shenzhen University，Shenzhen 518060，China)HU Qiying (School of Economy and Management，Xidian University，Xi'an 710071，China)

摘要：	1.IntroductionMarkovdecisionprocesses(MDP)candescribeMarkoviansequentialdecisionsystems(12]),amongwhichtherearemanysystemsinstochasticenvironmentsandtheenvironments'effectwillchangetheparametersmodelingthesystem,e.g.3arepairablesysteminastochasticenvironment(3])andqueueingsystemsinvariedstochasticenvironments(4]).ThusMDPinstochasticenvironmelltsoccuriftheoptimalcontrolofsuchsystemsisconsidered.ContinuoustimeMDPandsemi-Markovdecisionprocess(SMDP)inasemi-Markovenvironmentwithdiscountedc…
THE BOREL STATE SPACE SEMI-MARKOV DECISION PROCESS WITH EXPECTED TOTAL REWARDS IN A SEMI-MARKOV ENVIRONMENT

XU Chen.THE BOREL STATE SPACE SEMI-MARKOVDECISION PROCESS WITH EXPECTED TOTAL REWARDS IN A SEMI-MARKOV ENVIRONMENT[J].Journal of Systems Science and Complexity,1999(1).

Authors:	XU Chen

Abstract:	This paper investigates the Borel state space semi-Markov decision process (SMDP) with the criterion of expected total rewards in a semi-Markov environment. It describes a system which behaves like a SMDP except that the system is influenced by its environment modeled by a semi-Markov process. We transform the SMDP in a semiMarkov environment into an equivalent discrete time Markov decision process under the condition that rewards are all positive or all negative, and obtain the optimality equation and some properties for it.

Keywords:	Semi-Markov decision processes semi-Markov environment expected total rewards Borel state space
本文献已被 CNKI 等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏