TL;DR
This paper uncovers vulnerabilities in split learning for large language models, demonstrating a novel attack method that can reconstruct private training data despite existing defenses.
Contribution
It introduces BiSR, a bidirectional data reconstruction attack exploiting split learning and fine-tuning vulnerabilities, with extensive empirical validation.
Findings
BiSR achieves state-of-the-art data reconstruction accuracy.
The attack remains effective against multiple defense mechanisms.
Experimental results confirm significant privacy risks in split-based LLM frameworks.
Abstract
Recent advancements in pre-trained large language models (LLMs) have significantly influenced various domains. Adapting these models for specific tasks often involves fine-tuning (FT) with private, domain-specific data. However, privacy concerns keep this data undisclosed, and the computational demands for deploying LLMs pose challenges for resource-limited data holders. This has sparked interest in split learning (SL), a Model-as-a-Service (MaaS) paradigm that divides LLMs into smaller segments for distributed training and deployment, transmitting only intermediate activations instead of raw data. SL has garnered substantial interest in both industry and academia as it aims to balance user data privacy, model ownership, and resource challenges in the private fine-tuning of LLMs. Despite its privacy claims, this paper reveals significant vulnerabilities arising from the combination of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
