Designing an efficient and equitable humanitarian supply chain dynamically via reinforcement learning

Weijia Jin

arXiv:2505.17439·cs.LG·May 26, 2025

Designing an efficient and equitable humanitarian supply chain dynamically via reinforcement learning

Weijia Jin

PDF

TL;DR

This paper proposes a reinforcement learning approach, specifically PPO, to dynamically optimize humanitarian supply chains for efficiency and fairness, outperforming heuristic algorithms.

Contribution

It introduces a PPO-based model that prioritizes satisfaction rate, offering a novel dynamic optimization method for humanitarian logistics.

Findings

01

PPO model outperforms heuristic algorithms in efficiency.

02

The model emphasizes equitable satisfaction across stakeholders.

03

Dynamic approach adapts to changing supply chain conditions.

Abstract

This study designs an efficient and equitable humanitarian supply chain dynamically by using reinforcement learning, PPO, and compared with heuristic algorithms. This study demonstrates the model of PPO always treats average satisfaction rate as the priority.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.