TL;DR
This paper introduces a deep reinforcement learning framework for dynamic pricing of managed lanes with multiple access points, considering real-world complexities like demand variability and partial observability, outperforming traditional feedback control methods.
Contribution
It develops a novel Deep-RL approach for toll setting in complex transportation networks, relaxing many assumptions of prior models and demonstrating superior performance.
Findings
Deep-RL policies increase revenue by up to 9.5% over heuristics.
Deep-RL reduces total system travel time by up to 10.4%.
Proposed reward shaping improves policy behavior.
Abstract
This article develops a deep reinforcement learning (Deep-RL) framework for dynamic pricing on managed lanes with multiple access locations and heterogeneity in travelers' value of time, origin, and destination. This framework relaxes assumptions in the literature by considering multiple origins and destinations, multiple access locations to the managed lane, en route diversion of travelers, partial observability of the sensor readings, and stochastic demand and observations. The problem is formulated as a partially observable Markov decision process (POMDP) and policy gradient methods are used to determine tolls as a function of real-time observations. Tolls are modeled as continuous and stochastic variables, and are determined using a feedforward neural network. The method is compared against a feedback control method used for dynamic pricing. We show that Deep-RL is effective in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
