Comparing a class of dynamic model-based reinforcement learning schemes for handoff prioritization in mobile communication networks

作者：

Highlights：

•

摘要

This paper presents and compares three model-based reinforcement learning schemes for admission policy with handoff prioritization in mobile communication networks. The goal is to reduce the handoff failures while making efficient use of the wireless network resources. A performance measure is formed as a weighted linear function of the blocking probability of new connection requests and the handoff failure probability. Then, the problem is formulated as a semi-Markov decision process with an average cost criterion and a simulation-based learning algorithm is developed to approximate the optimal control policy. The proposed schemes are driven by a dynamic model estimated simultaneously while learning the control policy using samples generated from direct interactions with the network. Extensive simulations are provided to assess and compare their effectiveness of the algorithm under a variety of traffic conditions with some well-known policies.

论文关键词：Resource management,Handoff prioritization,Cellular systems,Mobile communication networks,Reinforcement learning,Semi-Markov decision process

论文评审过程：Available online 24 January 2011.

论文官网地址：https://doi.org/10.1016/j.eswa.2011.01.082