Multiagent Reinforcement Learning for Autonomous Routing and Pickup Problem with Adaptation to Variable Demand
Daniel Garces, Sushmita Bhattacharya, Stephanie Gil, Dimitri Bertsekas

TL;DR
This paper presents a reinforcement learning framework for autonomous vehicle routing that adapts to variable urban demand, improving coordination, non-myopic planning, and responsiveness to demand fluctuations, demonstrated on real taxi data.
Contribution
It introduces an adaptive multiagent RL approach combining online and offline methods to handle demand variability using Wasserstein ambiguity sets.
Findings
Outperforms classical routing methods in simulations.
Successfully adapts to demand fluctuations in real-world data.
Reduces wait times through coordinated, non-myopic policies.
Abstract
We derive a learning framework to generate routing/pickup policies for a fleet of autonomous vehicles tasked with servicing stochastically appearing requests on a city map. We focus on policies that 1) give rise to coordination amongst the vehicles, thereby reducing wait times for servicing requests, 2) are non-myopic, and consider a-priori potential future requests, 3) can adapt to changes in the underlying demand distribution. Specifically, we are interested in policies that are adaptive to fluctuations of actual demand conditions in urban environments, such as on-peak vs. off-peak hours. We achieve this through a combination of (i) an online play algorithm that improves the performance of an offline-trained policy, and (ii) an offline approximation scheme that allows for adapting to changes in the underlying demand model. In particular, we achieve adaptivity of our learned policy to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTransportation and Mobility Innovations · Traffic control and management · Transportation Planning and Optimization
