Nonatomic total rewards Markov decision processes with multiple criteria
详细信息    查看全文
  • 作者:Feinberg ; Eugene A. ; Piunovskiy ; Aleksey B.
  • 刊名:Journal of Mathematical Analysis and Applications
  • 出版年:2002
  • 出版时间:September 1, 2002
  • 年:2002
  • 卷:273
  • 期:1
  • 页码:93-111
  • 全文大小:150 K
文摘
We consider a Markov decision process with an uncountable state space for which the vector performance functional has the form of expected total rewards. Under the single condition that initial distribution and transition probabilities are nonatomic, we prove that the performance space coincides with that generated by nonrandomized Markov policies. We also provide conditions for the existence of optimal policies when the goal is to maximize one component of the performance vector subject to inequality constraints on other components. We illustrate our results with examples of production and financial problems.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700