首页 | 本学科首页   官方微博 | 高级检索  
     检索      

THE BOREL STATE SPACE SEMI-MARKOVDECISION PROCESS WITH EXPECTED TOTAL REWARDS IN A SEMI-MARKOV ENVIRONMENT
作者姓名:XU  Chen
作者单位:XU Chen(School of Science,Shenzhen University,Shenzhen 518060,China)HU Qiying (School of Economy and Management,Xidian University,Xi'an 710071,China)
摘    要:1.IntroductionMarkovdecisionprocesses(MDP)candescribeMarkoviansequentialdecisionsystems(12]),amongwhichtherearemanysystemsinstochasticenvironmentsandtheenvironments'effectwillchangetheparametersmodelingthesystem,e.g.3arepairablesysteminastochasticenvironment(3])andqueueingsystemsinvariedstochasticenvironments(4]).ThusMDPinstochasticenvironmelltsoccuriftheoptimalcontrolofsuchsystemsisconsidered.ContinuoustimeMDPandsemi-Markovdecisionprocess(SMDP)inasemi-Markovenvironmentwithdiscountedc…


THE BOREL STATE SPACE SEMI-MARKOV DECISION PROCESS WITH EXPECTED TOTAL REWARDS IN A SEMI-MARKOV ENVIRONMENT
XU Chen.THE BOREL STATE SPACE SEMI-MARKOVDECISION PROCESS WITH EXPECTED TOTAL REWARDS IN A SEMI-MARKOV ENVIRONMENT[J].Journal of Systems Science and Complexity,1999(1).
Authors:XU Chen
Abstract:This paper investigates the Borel state space semi-Markov decision process (SMDP) with the criterion of expected total rewards in a semi-Markov environment. It describes a system which behaves like a SMDP except that the system is influenced by its environment modeled by a semi-Markov process. We transform the SMDP in a semiMarkov environment into an equivalent discrete time Markov decision process under the condition that rewards are all positive or all negative, and obtain the optimality equation and some properties for it.
Keywords:Semi-Markov decision processes  semi-Markov environment  expected total rewards  Borel state space  
本文献已被 CNKI 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号