Look-Ahead Reasoning on Learning Platforms
Haiqing Zhu, Tijana Zrnic, Celestine Mendler-D\"unner

TL;DR
This paper explores how users strategically interact with learning platforms by looking ahead and coordinating, analyzing the effects on model outcomes and utility alignment, with implications for designing better learning systems.
Contribution
It formalizes level-k and collective reasoning in learning platforms, analyzing their impact on convergence, equilibrium, and utility alignment, introducing new insights into strategic user behavior.
Findings
Higher-level reasoning accelerates convergence but does not change the equilibrium.
Coordination among users can improve collective utility and alignment.
The paper characterizes utility trade-offs in collective action against algorithms.
Abstract
On many learning platforms, the optimization criteria guiding model training reflect the priorities of the designer rather than those of the individuals they affect. Consequently, users may act strategically to obtain more favorable outcomes. While past work has studied strategic user behavior on learning platforms, the focus has largely been on strategic responses to a deployed model, without considering the behavior of other users. In contrast, look-ahead reasoning takes into account that user actions are coupled, and -- at scale -- impact future predictions. Within this framework, we first formalize level-k thinking, a concept from behavioral economics, where users aim to outsmart their peers by looking one step ahead. We show that, while convergence to an equilibrium is accelerated, the equilibrium remains the same, providing no benefit of higher-level reasoning for individuals in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMobile Crowdsensing and Crowdsourcing · Ethics and Social Impacts of AI · Advanced Bandit Algorithms Research
