End-to-end Deep Reinforcement Learning for Stochastic Multi-objective Optimization in C-VRPTW

Abdo Abouelrous; Laurens Bliek; Yaoxin Wu; Yingqian Zhang

arXiv:2512.01518·cs.LG·December 2, 2025

End-to-end Deep Reinforcement Learning for Stochastic Multi-objective Optimization in C-VRPTW

Abdo Abouelrous, Laurens Bliek, Yaoxin Wu, Yingqian Zhang

PDF

Open Access

TL;DR

This paper introduces an end-to-end deep reinforcement learning model for stochastic multi-objective vehicle routing, effectively handling travel time uncertainty and multiple conflicting goals with improved training efficiency.

Contribution

It proposes a novel deep learning approach that manages stochastic travel times and multiple objectives simultaneously, with a scenario clustering technique to enhance training speed.

Findings

01

Model constructs high-quality Pareto fronts

02

Outperforms three baseline methods in run time and solution quality

03

Effectively handles travel time uncertainty and multiple objectives

Abstract

In this work, we consider learning-based applications in routing to solve a Vehicle Routing variant characterized by stochasticity and multiple objectives. Such problems are representative of practical settings where decision-makers have to deal with uncertainty in the operational environment as well as multiple conflicting objectives due to different stakeholders. We specifically consider travel time uncertainty. We also consider two objectives, total travel time and route makespan, that jointly target operational efficiency and labor regulations on shift length, although different objectives could be incorporated. Learning-based methods offer earnest computational advantages as they can repeatedly solve problems with limited interference from the decision-maker. We specifically focus on end-to-end deep learning models that leverage the attention mechanism and multiple solution…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVehicle Routing Optimization Methods · Advanced Multi-Objective Optimization Algorithms · Traffic control and management