Rethinking formal models of partially observable multiagent decision making

作者：

摘要

Multiagent decision-making in partially observable environments is usually modelled as either an extensive-form game (EFG) in game theory or a partially observable stochastic game (POSG) in multiagent reinforcement learning (MARL). One issue with the current situation is that while most practical problems can be modelled in both formalisms, the relationship of the two models is unclear, which hinders the transfer of ideas between the two communities. A second issue is that while EFGs have recently seen significant algorithmic progress, their classical formalization is unsuitable for efficient presentation of the underlying ideas, such as those around decomposition.

论文关键词：Imperfect information game,Multiagent reinforcement learning,Extensive form game,Partially-observable stochastic game,Public information,Decomposition

论文评审过程：Received 30 September 2020, Revised 28 September 2021, Accepted 18 November 2021, Available online 26 November 2021, Version of Record 30 November 2021.

论文官网地址：https://doi.org/10.1016/j.artint.2021.103645