Scalable Policies for the Dynamic Traveling Multi-Maintainer Problem with Alerts
Peter Verleijsdonk, Willem van Jaarsveld, Stella Kapodistria

TL;DR
This paper introduces a scalable deep reinforcement learning approach to optimize maintenance scheduling for large networks of assets, effectively handling the complex dynamic traveling multi-maintainer problem with alerts.
Contribution
It develops a novel DRL-based method with reformulated action space and heuristics for initial solutions, enabling scalable and near-optimal maintenance planning in large networks.
Findings
DRL solves single-maintainer instances optimally.
The approach scales to networks with up to 35 assets.
Policies are robust to network modifications.
Abstract
Downtime of industrial assets such as wind turbines and medical imaging devices is costly. To avoid such downtime costs, companies seek to initiate maintenance just before failure, which is challenging because: (i) Asset failures are notoriously difficult to predict, even in the presence of real-time monitoring devices which signal degradation; and (ii) Limited resources are available to serve a network of geographically dispersed assets. In this work, we study the dynamic traveling multi-maintainer problem with alerts (-DTMPA) under perfect condition information with the objective to devise scalable solution approaches to maintain large networks with maintenance engineers. Since such large-scale -DTMPA instances are computationally intractable, we propose an iterative deep reinforcement learning (DRL) algorithm optimizing long-term discounted maintenance costs. The efficiency…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReliability and Maintenance Optimization · Supply Chain and Inventory Management
