Deep reinforcement learning based preventive maintenance policy for serial production lines

作者：

Highlights：

• Apply a state-of-the-art deep reinforcement learning algorithm to the PM problem.

• Formulate the PM problem as an MDP with guidance of domain knowledge.

• Use a data-driven modeling method to build a fast simulator for learning.

• The DRL agent learns to perform group maintenance and opportunistic maintenance.

• Proposed method outperforms age-dependent policy and opportunistic policy.

摘要

•Apply a state-of-the-art deep reinforcement learning algorithm to the PM problem.•Formulate the PM problem as an MDP with guidance of domain knowledge.•Use a data-driven modeling method to build a fast simulator for learning.•The DRL agent learns to perform group maintenance and opportunistic maintenance.•Proposed method outperforms age-dependent policy and opportunistic policy.

论文关键词：Preventive maintenance,Production loss,Deep reinforcement learning,Serial production line,Group maintenance,Opportunistic maintenance

论文评审过程：Available online 30 June 2020, Version of Record 10 July 2020.

论文官网地址：https://doi.org/10.1016/j.eswa.2020.113701