To Analyze and Regulate Human-in-the-loop Learning for Congestion Games
Hongbo Li, Lingjie Duan

TL;DR
This paper studies human-in-the-loop congestion games, revealing limitations of current routing policies, and proposes a selective information disclosure mechanism to improve social efficiency and reduce the price of anarchy.
Contribution
It introduces a novel SID mechanism that incentivizes exploration in congestion games, significantly reducing the price of anarchy from unbounded to below 2.
Findings
Myopic routing policies under-explore, increasing inefficiency.
The proposed SID mechanism bounds the PoA below 2.
Extension of results to linear path graphs with multiple nodes.
Abstract
In congestion games, selfish users behave myopically to crowd to the shortest paths, and the social planner designs mechanisms to regulate such selfish routing through information or payment incentives. However, such mechanism design requires the knowledge of time-varying traffic conditions and it is the users themselves to learn and report past road experiences to the social planner (e.g., Waze or Google Maps). When congestion games meet mobile crowdsourcing, it is critical to incentivize selfish users to explore non-shortest paths in the best exploitation-exploration trade-off. First, we consider a simple but fundamental parallel routing network with one deterministic path and multiple stochastic paths for users with an average arrival probability . We prove that the current myopic routing policy (widely used in Waze and Google Maps) misses both exploration (when strong…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSmart Grid Security and Resilience · Smart Grid Energy Management · Reinforcement Learning in Robotics
