Exploring Embedding Priors in Prompt-Tuning for Improved Interpretability and Control

Sergey Sedov; Sumanth Bharadwaj Hachalli Karanam; Venu Gopal Kadamba

arXiv:2412.18582·cs.CL·March 10, 2026

Exploring Embedding Priors in Prompt-Tuning for Improved Interpretability and Control

Sergey Sedov, Sumanth Bharadwaj Hachalli Karanam, Venu Gopal Kadamba

PDF

Open Access

TL;DR

This paper investigates the role of embedding priors in prompt-tuning, revealing their influence on embedding positions and activation space trajectories, and explores implications for model interpretability and task generalization.

Contribution

It introduces embedding priors in prompt-tuning, compares them with posteriors, and analyzes their impact on embeddings and activation spaces, offering insights into model behavior and potential control mechanisms.

Findings

01

Embedding priors significantly influence embedding positions.

02

Models can operate effectively with embeddings from diverse activation space regions.

03

Activation trajectories form distinct clusters for different task types.

Abstract

Prompt-Tuning is an efficient method for adapting pre-trained language models to new tasks with minimal computational overhead by modifying prompt embeddings. In this work, we investigate how crucial the phenomenon of embedding collapse, frequently observed in Prompt-Tuning, is for the final performance of the model. To address this question, we designed embedding priors and compared them with posteriors of the converged Soft and Deep Prompt-Tuning methods. Our findings suggest that priors strongly affect the position of the tuned embeddings, and models can effectively work with embeddings from different parts of activation spaces, including completely new regions. As the final Prompt-Tuning capabilities are limited, we hypothesize that controllable Prompt-Tuning posteriors may serve as a good starting point for tasks such as chain-of-thought (COT) distillation. Our experiments also…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsInterpreting and Communication in Healthcare