Where the Action is: Let's make Reinforcement Learning for Stochastic Dynamic Vehicle Routing Problems work!
Florentin D Hildebrandt, Barrett Thomas, Marlin W Ulmer

TL;DR
This paper discusses the challenges of applying reinforcement learning to stochastic dynamic vehicle routing problems, emphasizing the need for joint research between operations research and computer science communities.
Contribution
It highlights the gap between communities and guides towards collaborative approaches to improve RL solutions for SDVRPs.
Findings
Identifies obstacles in applying RL to SDVRPs
Highlights the lack of joint research efforts
Provides guidance for future collaborative research
Abstract
There has been a paradigm-shift in urban logistic services in the last years; demand for real-time, instant mobility and delivery services grows. This poses new challenges to logistic service providers as the underlying stochastic dynamic vehicle routing problems (SDVRPs) require anticipatory real-time routing actions. Searching the combinatorial action space for efficient routing actions is by itself a complex task of mixed-integer programming (MIP) well-known by the operations research community. This complexity is now multiplied by the challenge of evaluating such actions with respect to their effectiveness given future dynamism and uncertainty, a potentially ideal case for reinforcement learning (RL) well-known by the computer science community. For solving SDVRPs, joint work of both communities is needed, but as we show, essentially non-existing. Both communities focus on their…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTransportation and Mobility Innovations · Vehicle Routing Optimization Methods · Transportation Planning and Optimization
Methodstravel james
