The complexity of synchronizing Markov decision processes

作者：

Highlights：

•

摘要

We consider Markov decision processes (MDP) as generators of sequences of probability distributions over states. A probability distribution is p-synchronizing if the probability mass is at least p in a single state, or in a given set of states. We consider four temporal synchronizing modes: a sequence of probability distributions is always p-synchronizing, eventually p-synchronizing, weakly p-synchronizing, or strongly p-synchronizing if, respectively, all, some, infinitely many, or all but finitely many distributions in the sequence are p-synchronizing. We provide tight results on the expressiveness, decidability, complexity, and memory requirement for winning strategies for all synchronizing modes in MDPs.

论文关键词：Markov decision processes,Complexity,Sequences of probability distributions,Synchronization

论文评审过程：Received 6 May 2016, Revised 20 March 2018, Accepted 20 September 2018, Available online 9 October 2018, Version of Record 19 November 2018.

论文官网地址：https://doi.org/10.1016/j.jcss.2018.09.004