End-to-end Deep Reinforcement Learning for Stochastic Multi-objective Optimization in C-VRPTW
Abdo Abouelrous, Laurens Bliek, Yaoxin Wu, Yingqian Zhang

TL;DR
This paper introduces an end-to-end deep reinforcement learning model for stochastic multi-objective vehicle routing, effectively handling travel time uncertainty and multiple conflicting goals with improved training efficiency.
Contribution
It proposes a novel deep learning approach that manages stochastic travel times and multiple objectives simultaneously, with a scenario clustering technique to enhance training speed.
Findings
Model constructs high-quality Pareto fronts
Outperforms three baseline methods in run time and solution quality
Effectively handles travel time uncertainty and multiple objectives
Abstract
In this work, we consider learning-based applications in routing to solve a Vehicle Routing variant characterized by stochasticity and multiple objectives. Such problems are representative of practical settings where decision-makers have to deal with uncertainty in the operational environment as well as multiple conflicting objectives due to different stakeholders. We specifically consider travel time uncertainty. We also consider two objectives, total travel time and route makespan, that jointly target operational efficiency and labor regulations on shift length, although different objectives could be incorporated. Learning-based methods offer earnest computational advantages as they can repeatedly solve problems with limited interference from the decision-maker. We specifically focus on end-to-end deep learning models that leverage the attention mechanism and multiple solution…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVehicle Routing Optimization Methods · Advanced Multi-Objective Optimization Algorithms · Traffic control and management
