When Prompts Interact: Assessing Prompt Arithmetic for Deconfounding under Distribution Shift

Zhecheng Sheng; Yongsen Tan; Xiruo Ding; Trevor Cohen; Serguei Pakhomov

arXiv:2605.03096·cs.LG·May 6, 2026

When Prompts Interact: Assessing Prompt Arithmetic for Deconfounding under Distribution Shift

Zhecheng Sheng, Yongsen Tan, Xiruo Ding, Trevor Cohen, Serguei Pakhomov

PDF

TL;DR

This paper introduces Hybrid Prompt Arithmetic (HyPA), a parameter-efficient method that combines prompt tuning and task arithmetic to improve model robustness against confounding shifts in classification tasks.

Contribution

It proposes HyPA, a novel approach that enhances robustness to distribution shifts by combining prompt and confounder prompts, with analysis of its effects on model representations.

Findings

01

HyPA consistently improves robustness-performance trade-off under distribution shift.

02

HyPA reduces the influence of confounder signals in model predictions.

03

HyPA is a parameter-efficient method that mitigates reliance on spurious features.

Abstract

In classification tasks, models may rely on confounding variables to achieve strong in-distribution performance, capturing spurious features that fail under distribution shift. This shortcut behavior leads to substantial degradation in out-of-distribution settings. Task arithmetic offers a potential solution by removing unwanted signals via subtraction of secondary model updates, but it typically requires full fine-tuning, which is computationally expensive. Prompt tuning provides a parameter-efficient alternative by adapting models through a small set of trainable virtual tokens. Task arithmetic on the resulting prompts presents an appealing alternative to operations on entire models, but the extent to which this approach can limit reliance on spurious features remains to be established. In this work, we study whether composing soft prompts through task arithmetic improves robustness…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.