To Optimize Human-in-the-loop Learning in Repeated Routing Games
Hongbo Li, Lingjie Duan

TL;DR
This paper introduces a novel recommendation mechanism for repeated routing games that significantly improves social efficiency by reducing the price of anarchy from infinity to near-optimal levels, addressing practical human-in-the-loop learning challenges.
Contribution
It proposes the user-differential probabilistic recommendation (UPR) mechanism, a non-monetary approach that ensures individual rationality and optimizes routing efficiency in repeated, non-myopic scenarios.
Findings
UPR mechanism reduces PoA from infinity to near 1+1/(4N+3).
Theoretical proof of PoA bounds under various information sharing mechanisms.
Experimental validation with real-world datasets confirms near-optimal performance.
Abstract
Today navigation applications (e.g., Waze and Google Maps) enable human users to learn and share the latest traffic observations, yet such information sharing simply aids selfish users to predict and choose the shortest paths to jam each other. Prior routing game studies focus on myopic users in oversimplified one-shot scenarios to regulate selfish routing via information hiding or pricing mechanisms. For practical human-in-the-loop learning (HILL) in repeated routing games, we face non-myopic users of differential past observations and need new mechanisms (preferably non-monetary) to persuade users to adhere to the optimal path recommendations. We model the repeated routing game in a typical parallel transportation network, which generally contains one deterministic path and stochastic paths. We first prove that no matter under the information sharing mechanism in use or the latest…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEnergy Efficient Wireless Sensor Networks · Reinforcement Learning in Robotics · Distributed Control Multi-Agent Systems
