Beyond Local vs. External: A Game-Theoretic Framework for Trustworthy Knowledge Acquisition
Rujing Yao, Yufei Shi, Yang Wu, Ang Li, Zhuoren Jiang, XiaoFeng Wang, Haixu Tang, Xiaozhong Liu

TL;DR
This paper introduces GTKA, a game-theoretic framework that balances knowledge utility and privacy in querying external LLMs by decomposing sensitive queries and adversarially training to minimize leakage.
Contribution
It proposes a novel game-theoretic approach with a sub-query generator and attacker to enhance privacy without sacrificing answer quality.
Findings
GTKA reduces intent leakage compared to baselines.
GTKA maintains high answer fidelity in sensitive domains.
Experiments validate effectiveness in biomedical and legal benchmarks.
Abstract
Cloud-hosted Large Language Models (LLMs) offer unmatched reasoning capabilities and dynamic knowledge, yet submitting raw queries to these external services risks exposing sensitive user intent. Conversely, relying exclusively on trusted local models preserves privacy but often compromises answer quality due to limited parameter scale and knowledge. To resolve this dilemma, we propose Game-theoretic Trustworthy Knowledge Acquisition (GTKA), a framework that formulates the trade-off between knowledge utility and privacy as a strategic game. GTKA consists of three components: (i) a privacy-aware sub-query generator that decomposes sensitive intent into generalized, low-risk fragments; (ii) an adversarial reconstruction attacker that attempts to infer the original query from these fragments, providing adaptive leakage signals; and (iii) a trusted local integrator that synthesizes external…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
