Label Privacy in Split Learning for Large Models with   Parameter-Efficient Training

Philip Zmushko; Marat Mansurov; Ruslan Svirschevski; Denis Kuznedelev,; Max Ryabinin; Aleksandr Beznosikov

arXiv:2412.16669·cs.LG·December 24, 2024

Label Privacy in Split Learning for Large Models with Parameter-Efficient Training

Philip Zmushko, Marat Mansurov, Ruslan Svirschevski, Denis Kuznedelev,, Max Ryabinin, Aleksandr Beznosikov

PDF

Open Access 1 Repo

TL;DR

This paper introduces P³EFT, a privacy-preserving split learning algorithm for large models with parameter-efficient fine-tuning, ensuring label privacy with minimal performance loss in API-based training.

Contribution

The study presents P³EFT, a novel split learning method that maintains label privacy during fine-tuning large models with PEFT, outperforming existing privacy methods in accuracy.

Findings

01

P³EFT achieves competitive privacy and accuracy in NLP tasks.

02

Analysis shows LoRA's privacy vulnerabilities and how P³EFT mitigates them.

03

Experimental results on DeBERTa, Flan-T5, and LLaMA-2 demonstrate effectiveness.

Abstract

As deep learning models become larger and more expensive, many practitioners turn to fine-tuning APIs. These web services allow fine-tuning a model between two parties: the client that provides the data, and the server that hosts the model. While convenient, these APIs raise a new concern: the data of the client is at risk of privacy breach during the training procedure. This challenge presents an important practical case of vertical federated learning, where the two parties perform parameter-efficient fine-tuning (PEFT) of a large model. In this study, we systematically search for a way to fine-tune models over an API while keeping the labels private. We analyze the privacy of LoRA, a popular approach for parameter-efficient fine-tuning when training over an API. Using this analysis, we propose P $^{3}$ EFT, a multi-party split learning algorithm that takes advantage of existing PEFT…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

p3eft-iclr-2025/p3eft-iclr-2025
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Machine Learning and Data Classification

MethodsFlan-T5