Colearning in Differential Games

作者：John W. Sheppard

摘要

Game playing has been a popular problem area for research in artificial intelligence and machine learning for many years. In almost every study of game playing and machine learning, the focus has been on games with a finite set of states and a finite set of actions. Further, most of this research has focused on a single player or team learning how to play against another player or team that is applying a fixed strategy for playing the game. In this paper, we explore multiagent learning in the context of game playing and develop algorithms for “co-learning” in which all players attempt to learn their optimal strategies simultaneously. Specifically, we address two approaches to colearning, demonstrating strong performance by a memory-based reinforcement learner and comparable but faster performance with a tree-based reinforcement learner.

论文关键词：Markov games, differential games, pursuit games, multiagent learning, reinforcement learning, Q-learning

论文评审过程：

论文官网地址：https://doi.org/10.1023/A:1007566607659