The Sample Complexity of Online Strategic Decision Making with Information Asymmetry and Knowledge Transportability

Jiachen Hu; Rui Ai; Han Zhong; Xiaoyu Chen; Liwei Wang; Zhaoran Wang; Zhuoran Yang

arXiv:2506.09940·cs.LG·June 12, 2025

The Sample Complexity of Online Strategic Decision Making with Information Asymmetry and Knowledge Transportability

Jiachen Hu, Rui Ai, Han Zhong, Xiaoyu Chen, Liwei Wang, Zhaoran Wang, Zhuoran Yang

PDF

Open Access

TL;DR

This paper introduces a sample-efficient online learning algorithm that addresses the challenges of information asymmetry and knowledge transfer in strategic multi-agent environments, achieving near-optimal policy learning.

Contribution

It presents a novel algorithm that effectively learns system dynamics under information asymmetry and knowledge transfer challenges with proven sample complexity bounds.

Findings

01

Achieves $ ext{O}(1/ ext{epsilon}^2)$ sample complexity for $ ext{epsilon}$-optimal policy

02

Handles non-i.i.d. actions in confounder learning

03

Addresses knowledge transfer in strategic online learning environments

Abstract

Information asymmetry is a pervasive feature of multi-agent systems, especially evident in economics and social sciences. In these settings, agents tailor their actions based on private information to maximize their rewards. These strategic behaviors often introduce complexities due to confounding variables. Simultaneously, knowledge transportability poses another significant challenge, arising from the difficulties of conducting experiments in target environments. It requires transferring knowledge from environments where empirical data is more readily available. Against these backdrops, this paper explores a fundamental question in online learning: Can we employ non-i.i.d. actions to learn about confounders even when requiring knowledge transfer? We present a sample-efficient algorithm designed to accurately identify system dynamics under information asymmetry and to navigate the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Game Theory and Applications · Advanced Bandit Algorithms Research