Heuristically accelerated reinforcement learning modularization for multi-agent multi-objective problems

作者：Leonardo Anjoletto Ferreira, Carlos Henrique Costa Ribeiro, Reinaldo Augusto da Costa Bianchi

摘要

This article presents two new algorithms for finding the optimal solution of a Multi-agent Multi-objective Reinforcement Learning problem. Both algorithms make use of the concepts of modularization and acceleration by a heuristic function applied in standard Reinforcement Learning algorithms to simplify and speed up the learning process of an agent that learns in a multi-agent multi-objective environment. In order to verify performance of the proposed algorithms, we considered a predator-prey environment in which the learning agent plays the role of prey that must escape the pursuing predator while reaching for food in a fixed location. The results show that combining modularization and acceleration using a heuristics function indeed produced simplification and speeding up of the learning process in a complex problem when comparing with algorithms that do not make use of acceleration or modularization techniques, such as Q-Learning and Minimax-Q.

论文关键词：Reinforcement learning, Heuristically accelerated reinforcement learning, Multi-agent systems, Multi-objective problems

论文评审过程：

论文官网地址：https://doi.org/10.1007/s10489-014-0534-0