Controlled Markov set-chains under average criteria

作者:

Highlights:

摘要

In this paper, applying an interval arithmetic analysis, we consider the average case of controlled Markov set-chains, whose process allows for fluctuating transition matrices at each step in time. We introduce a v-step contractive property for the average case, under which a Pareto optimal periodic policy is characterized as a maximal solution of optimality equation. Also, in the class of stationary policies, the behavior of the expected reward over T-horizon as T approaches ∞ is investigated and the left- and right-hand side optimality equations are given, by which a Pareto optimal stationary policy is found. As a numerical example, the Taxicab problem is considered.

论文关键词:Controlled Markov set-chains,Average reward criterion,Pareto optimal,Minorization condition,Interval arithmetic

论文评审过程:Available online 6 April 2001.

论文官网地址:https://doi.org/10.1016/S0096-3003(99)00241-6