Learning Intelligent Behavior in a Non-stationary and Partially Observable Environment

作者：SelÇuk şenkul, Faruk Polat

摘要

Individual learning in an environment where more than one agent exist is a chal-lengingtask. In this paper, a single learning agent situated in an environment where multipleagents exist is modeled based on reinforcement learning. The environment is non-stationaryand partially accessible from an agents' point of view. Therefore, learning activities of anagent is influenced by actions of other cooperative or competitive agents in the environment.A prey-hunter capture game that has the above characteristics is defined and experimentedto simulate the learning process of individual agents. Experimental results show that thereare no strict rules for reinforcement learning. We suggest two new methods to improve theperformance of agents. These methods decrease the number of states while keeping as muchstate as necessary.

论文关键词：agent learning, multi-agent systems, Q-learning, reinforcement learning

论文评审过程：

论文官网地址：https://doi.org/10.1023/A:1019935502139