To Optimize Human-in-the-loop Learning in Repeated Routing Games

Hongbo Li; Lingjie Duan

arXiv:2411.09867·cs.GT·November 18, 2024

To Optimize Human-in-the-loop Learning in Repeated Routing Games

Hongbo Li, Lingjie Duan

PDF

Open Access

TL;DR

This paper introduces a novel recommendation mechanism for repeated routing games that significantly improves social efficiency by reducing the price of anarchy from infinity to near-optimal levels, addressing practical human-in-the-loop learning challenges.

Contribution

It proposes the user-differential probabilistic recommendation (UPR) mechanism, a non-monetary approach that ensures individual rationality and optimizes routing efficiency in repeated, non-myopic scenarios.

Findings

01

UPR mechanism reduces PoA from infinity to near 1+1/(4N+3).

02

Theoretical proof of PoA bounds under various information sharing mechanisms.

03

Experimental validation with real-world datasets confirms near-optimal performance.

Abstract

Today navigation applications (e.g., Waze and Google Maps) enable human users to learn and share the latest traffic observations, yet such information sharing simply aids selfish users to predict and choose the shortest paths to jam each other. Prior routing game studies focus on myopic users in oversimplified one-shot scenarios to regulate selfish routing via information hiding or pricing mechanisms. For practical human-in-the-loop learning (HILL) in repeated routing games, we face non-myopic users of differential past observations and need new mechanisms (preferably non-monetary) to persuade users to adhere to the optimal path recommendations. We model the repeated routing game in a typical parallel transportation network, which generally contains one deterministic path and $N$ stochastic paths. We first prove that no matter under the information sharing mechanism in use or the latest…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEnergy Efficient Wireless Sensor Networks · Reinforcement Learning in Robotics · Distributed Control Multi-Agent Systems