Unveiling the Vulnerability of Private Fine-Tuning in Split-Based   Frameworks for Large Language Models: A Bidirectionally Enhanced Attack

Guanzhong Chen; Zhenghan Qin; Mingxin Yang; Yajie Zhou; Tao Fan,; Tianyu Du; Zenglin Xu

arXiv:2409.00960·cs.CR·December 11, 2024

Unveiling the Vulnerability of Private Fine-Tuning in Split-Based Frameworks for Large Language Models: A Bidirectionally Enhanced Attack

Guanzhong Chen, Zhenghan Qin, Mingxin Yang, Yajie Zhou, Tao Fan,, Tianyu Du, Zenglin Xu

PDF

1 Repo

TL;DR

This paper uncovers vulnerabilities in split learning for large language models, demonstrating a novel attack method that can reconstruct private training data despite existing defenses.

Contribution

It introduces BiSR, a bidirectional data reconstruction attack exploiting split learning and fine-tuning vulnerabilities, with extensive empirical validation.

Findings

01

BiSR achieves state-of-the-art data reconstruction accuracy.

02

The attack remains effective against multiple defense mechanisms.

03

Experimental results confirm significant privacy risks in split-based LLM frameworks.

Abstract

Recent advancements in pre-trained large language models (LLMs) have significantly influenced various domains. Adapting these models for specific tasks often involves fine-tuning (FT) with private, domain-specific data. However, privacy concerns keep this data undisclosed, and the computational demands for deploying LLMs pose challenges for resource-limited data holders. This has sparked interest in split learning (SL), a Model-as-a-Service (MaaS) paradigm that divides LLMs into smaller segments for distributed training and deployment, transmitting only intermediate activations instead of raw data. SL has garnered substantial interest in both industry and academia as it aims to balance user data privacy, model ownership, and resource challenges in the private fine-tuning of LLMs. Despite its privacy claims, this paper reveals significant vulnerabilities arising from the combination of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

StupidTrees/SplitLLM
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.