Reward is enough

作者:

摘要

In this article we hypothesise that intelligence, and its associated abilities, can be understood as subserving the maximisation of reward. Accordingly, reward is enough to drive behaviour that exhibits abilities studied in natural and artificial intelligence, including knowledge, learning, perception, social intelligence, language, generalisation and imitation. This is in contrast to the view that specialised problem formulations are needed for each ability, based on other signals or objectives. Furthermore, we suggest that agents that learn through trial and error experience to maximise reward could learn behaviour that exhibits most if not all of these abilities, and therefore that powerful reinforcement learning agents could constitute a solution to artificial general intelligence.

论文关键词:Artificial intelligence,Artificial general intelligence,Reinforcement learning,Reward

论文评审过程:Received 12 November 2020, Revised 28 April 2021, Accepted 12 May 2021, Available online 24 May 2021, Version of Record 8 June 2021.

论文官网地址:https://doi.org/10.1016/j.artint.2021.103535