Efficient learning equilibrium

作者:

摘要

We introduce efficient learning equilibrium (ELE), a normative approach to learning in non-cooperative settings. In ELE, the learning algorithms themselves are required to be in equilibrium. In addition, the learning algorithms must arrive at a desired value after polynomial time, and a deviation from the prescribed ELE becomes irrational after polynomial time. We prove the existence of an ELE (where the desired value is the expected payoff in a Nash equilibrium) and of a Pareto-ELE (where the objective is the maximization of social surplus) in repeated games with perfect monitoring. We also show that an ELE does not always exist in the imperfect monitoring case. Finally, we discuss the extension of these results to general-sum stochastic games.

论文关键词:Learning equilibrium,Ex-post equilibrium,Efficiency,Multi-agent learning,Repeated games,Stochastic games

论文评审过程:Received 5 March 2003, Accepted 19 April 2004, Available online 19 June 2004.

论文官网地址:https://doi.org/10.1016/j.artint.2004.04.013