Efficient and Privacy-Preserving Soft Prompt Transfer for LLMs

Xun Wang; Jing Xu; Franziska Boenisch; Michael Backes; Christopher A. Choquette-Choo; Adam Dziedzic

arXiv:2506.16196·cs.LG·June 23, 2025

Efficient and Privacy-Preserving Soft Prompt Transfer for LLMs

Xun Wang, Jing Xu, Franziska Boenisch, Michael Backes, Christopher A. Choquette-Choo, Adam Dziedzic

PDF

Open Access 1 Video

TL;DR

This paper introduces POST, a framework that enables privacy-preserving, efficient transfer of soft prompts from small models to large language models, reducing costs and protecting sensitive data.

Contribution

The paper proposes a novel method for transferring soft prompts with privacy guarantees, leveraging knowledge distillation to improve transferability across models.

Findings

01

POST reduces computational costs significantly.

02

It maintains high prompt utility while ensuring privacy.

03

Effective transfer of soft prompts to large LLMs demonstrated.

Abstract

Prompting has become a dominant paradigm for adapting large language models (LLMs). While discrete (textual) prompts are widely used for their interpretability, soft (parameter) prompts have recently gained traction in APIs. This is because they can encode information from more training samples while minimizing the user's token usage, leaving more space in the context window for task-specific input. However, soft prompts are tightly coupled to the LLM they are tuned on, limiting their generalization to other LLMs. This constraint is particularly problematic for efficiency and privacy: (1) tuning prompts on each LLM incurs high computational costs, especially as LLMs continue to grow in size. Additionally, (2) when the LLM is hosted externally, soft prompt tuning often requires sharing private data with the LLM provider. For instance, this is the case with the NVIDIA NeMo API. To address…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Efficient and Privacy-Preserving Soft Prompt Transfer for LLMs· slideslive

Taxonomy

TopicsTopic Modeling · Explainable Artificial Intelligence (XAI) · Privacy-Preserving Technologies in Data