Leveraging Self-Attention for Input-Dependent Soft Prompting in LLMs

Ananth Muppidi; Abhilash Nandy; Sambaran Bandyopadhyay

arXiv:2506.05629·cs.CL·June 9, 2025

Leveraging Self-Attention for Input-Dependent Soft Prompting in LLMs

Ananth Muppidi, Abhilash Nandy, Sambaran Bandyopadhyay

PDF

Open Access

TL;DR

This paper introduces ID-SPAM, a novel input-dependent soft prompting method using self-attention, which enhances domain transfer and efficiency in large language models without extensive fine-tuning.

Contribution

It presents a new self-attention based soft prompting technique that dynamically generates prompts based on input tokens, improving transferability and efficiency.

Findings

01

Outperforms state-of-the-art soft prompting methods

02

Enhances zero-shot domain transfer capabilities

03

Maintains low number of trainable parameters

Abstract

The performance of large language models in domain-specific tasks necessitates fine-tuning, which is computationally expensive and technically challenging. This paper focuses on parameter-efficient fine-tuning using soft prompting, a promising approach that adapts pre-trained models to downstream tasks by learning a small set of parameters. We propose a novel Input Dependent Soft Prompting technique with a self-Attention Mechanism (ID-SPAM) that generates soft prompts based on the input tokens and attends different tokens with varying importance. Our method is simple and efficient, keeping the number of trainable parameters small. We show the merits of the proposed approach compared to state-of-the-art techniques on various tasks and show the improved zero shot domain transfer capability.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Domain Adaptation and Few-Shot Learning · Natural Language Processing Techniques