Online planning for multi-agent systems with bounded communication

作者：

Highlights：

•

摘要

We propose an online algorithm for planning under uncertainty in multi-agent settings modeled as DEC-POMDPs. The algorithm helps overcome the high computational complexity of solving such problems offline. The key challenges in decentralized operation are to maintain coordinated behavior with little or no communication and, when communication is allowed, to optimize value with minimal communication. The algorithm addresses these challenges by generating identical conditional plans based on common knowledge and communicating only when history inconsistency is detected, allowing communication to be postponed when necessary. To be suitable for online operation, the algorithm computes good local policies using a new and fast local search method implemented using linear programming. Moreover, it bounds the amount of memory used at each step and can be applied to problems with arbitrary horizons. The experimental results confirm that the algorithm can solve problems that are too large for the best existing offline planning algorithms and it outperforms the best online method, producing much higher value with much less communication in most cases. The algorithm also proves to be effective when the communication channel is imperfect (periodically unavailable). These results contribute to the scalability of decision-theoretic planning in multi-agent settings.

论文关键词：Decentralized POMDPs,Cooperation and collaboration,Planning under uncertainty,Communication in multi-agent systems

论文评审过程：Received 2 February 2010, Revised 22 September 2010, Accepted 26 September 2010, Available online 29 September 2010.

论文官网地址：https://doi.org/10.1016/j.artint.2010.09.008