Recent advances in leveraging human guidance for sequential decision-making tasks

作者：Ruohan Zhang, Faraz Torabi, Garrett Warnell, Peter Stone

摘要

A longstanding goal of artificial intelligence is to create artificial agents capable of learning to perform tasks that require sequential decision making. Importantly, while it is the artificial agent that learns and acts, it is still up to humans to specify the particular task to be performed. Classical task-specification approaches typically involve humans providing stationary reward functions or explicit demonstrations of the desired tasks. However, there has recently been a great deal of research energy invested in exploring alternative ways in which humans may guide learning agents that may, e.g., be more suitable for certain tasks or require less human effort. This survey provides a high-level overview of five recent machine learning frameworks that primarily rely on human guidance apart from pre-specified reward functions or conventional, step-by-step action demonstrations. We review the motivation, assumptions, and implementation of each framework, and we discuss possible future research directions.

论文关键词：Learning from demonstration, Imitation learning, Reinforcement learning, Human feedback, Hierarchical learning, Imitation from observation, Attention

论文评审过程：

论文官网地址：https://doi.org/10.1007/s10458-021-09514-w