Improved reinforcement learning with curriculum

作者:

Highlights:

• A RL neural-network/tree-search agent can be improved by using curriculum.

• End-game-first curriculum improves the quality of training buffer.

• Early-game experiences generated during early epochs are low quality.

摘要

•A RL neural-network/tree-search agent can be improved by using curriculum.•End-game-first curriculum improves the quality of training buffer.•Early-game experiences generated during early epochs are low quality.

论文关键词:Curriculum learning,Reinforcement learning,Monte Carlo tree search,General game playing

论文评审过程:Received 8 August 2019, Revised 19 February 2020, Accepted 2 May 2020, Available online 8 May 2020, Version of Record 29 May 2020.

论文官网地址:https://doi.org/10.1016/j.eswa.2020.113515