Deep Multi-Objective Reinforcement Learning for Utility-Based   Infrastructural Maintenance Optimization

Jesse van Remmerden; Maurice Kenter; Diederik M. Roijers; Charalampos; Andriotis; Yingqian Zhang; Zaharah Bukhsh

arXiv:2406.06184·cs.AI·January 9, 2025

Deep Multi-Objective Reinforcement Learning for Utility-Based Infrastructural Maintenance Optimization

Jesse van Remmerden, Maurice Kenter, Diederik M. Roijers, Charalampos, Andriotis, Yingqian Zhang, Zaharah Bukhsh

PDF

1 Repo

TL;DR

This paper presents MO-DCMAC, a multi-objective reinforcement learning method that optimizes infrastructural maintenance policies directly for multiple objectives, outperforming traditional rule-based approaches in diverse scenarios.

Contribution

Introduces MO-DCMAC, a novel multi-objective deep reinforcement learning algorithm for infrastructure maintenance, capable of handling non-linear utility functions and outperforming existing heuristic policies.

Findings

01

MO-DCMAC outperforms rule-based policies in multiple environments.

02

The method effectively optimizes for complex utility functions.

03

Performance validated on case studies including Amsterdam quay walls.

Abstract

In this paper, we introduce Multi-Objective Deep Centralized Multi-Agent Actor-Critic (MO- DCMAC), a multi-objective reinforcement learning (MORL) method for infrastructural maintenance optimization, an area traditionally dominated by single-objective reinforcement learning (RL) approaches. Previous single-objective RL methods combine multiple objectives, such as probability of collapse and cost, into a singular reward signal through reward-shaping. In contrast, MO-DCMAC can optimize a policy for multiple objectives directly, even when the utility function is non-linear. We evaluated MO-DCMAC using two utility functions, which use probability of collapse and cost as input. The first utility function is the Threshold utility, in which MO-DCMAC should minimize cost so that the probability of collapse is never above the threshold. The second is based on the Failure Mode, Effects, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jesserem/MODCMAC
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.