Membership Inference Attack against Long-Context Large Language Models
Zixiong Wang, Gaoyang Liu, Yang Yang, Chen Wang

TL;DR
This paper demonstrates that Long-Context Large Language Models are vulnerable to membership inference attacks, revealing sensitive data in their extended contexts with high accuracy, thus exposing privacy risks.
Contribution
The study introduces the first set of membership inference attack strategies tailored for Long-Context LLMs and evaluates their effectiveness across multiple models.
Findings
Achieved up to 90.66% attack F1-score on multi-document QA datasets.
LCLMs are susceptible to membership leakage, posing privacy risks.
Proposed strategies effectively infer document membership in LCLMs contexts.
Abstract
Recent advances in Large Language Models (LLMs) have enabled them to overcome their context window limitations, and demonstrate exceptional retrieval and reasoning capacities on longer context. Quesion-answering systems augmented with Long-Context Language Models (LCLMs) can automatically search massive external data and incorporate it into their contexts, enabling faithful predictions and reducing issues such as hallucinations and knowledge staleness. Existing studies targeting LCLMs mainly concentrate on addressing the so-called lost-in-the-middle problem or improving the inference effiencicy, leaving their privacy risks largely unexplored. In this paper, we aim to bridge this gap and argue that integrating all information into the long context makes it a repository of sensitive information, which often contains private data such as medical records or personal identities. We further…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling
