Reinforcement Learning for Multi-Objective Multi-Echelon Supply Chain Optimisation

Rifny Rachman; Josh Tingey; Richard Allmendinger; Pradyumn Shukla; Wei Pan

arXiv:2507.19788·cs.AI·July 29, 2025

Reinforcement Learning for Multi-Objective Multi-Echelon Supply Chain Optimisation

Rifny Rachman, Josh Tingey, Richard Allmendinger, Pradyumn Shukla, Wei Pan

PDF

TL;DR

This paper introduces a multi-objective reinforcement learning approach for complex supply chain optimization, effectively balancing economic, environmental, and social goals with superior performance over existing methods.

Contribution

It develops a novel multi-objective RL model for multi-echelon supply chains, outperforming traditional single-objective RL and evolutionary algorithms in complex scenarios.

Findings

01

Up to 75% higher hypervolume than MOEA-based approach

02

Solutions are approximately eleven times denser, indicating better robustness

03

Achieves stable production and inventory levels while minimizing demand loss

Abstract

This study develops a generalised multi-objective, multi-echelon supply chain optimisation model with non-stationary markets based on a Markov decision process, incorporating economic, environmental, and social considerations. The model is evaluated using a multi-objective reinforcement learning (RL) method, benchmarked against an originally single-objective RL algorithm modified with weighted sum using predefined weights, and a multi-objective evolutionary algorithm (MOEA)-based approach. We conduct experiments on varying network complexities, mimicking typical real-world challenges using a customisable simulator. The model determines production and delivery quantities across supply chain routes to achieve near-optimal trade-offs between competing objectives, approximating Pareto front sets. The results demonstrate that the primary approach provides the most balanced trade-off between…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.