Dual-Path Stable Soft Prompt Generation for Domain Generalization
Yuedi Zhang, Shuanghao Bai, Wanqi Zhou, Zhirong Luan, Badong Chen

TL;DR
This paper introduces DPSPG, a transformer-based framework that uses negative learning to generate stable, effective prompts for domain generalization, outperforming existing methods across multiple benchmarks.
Contribution
It proposes a novel dual-path soft prompt generation method incorporating negative learning to enhance prompt stability and generalization in domain generalization tasks.
Findings
DPSPG outperforms state-of-the-art methods on five benchmark datasets.
Negative learning improves prompt robustness and reduces variability.
Theoretical analysis confirms increased margin and reduced gradient norm upper bound.
Abstract
Domain generalization (DG) aims to learn a model using data from one or multiple related but distinct source domains that can generalize well to unseen out-of-distribution target domains. Inspired by the success of large pre-trained vision-language models (VLMs), prompt tuning has emerged as an effective generalization strategy. However, it often struggles to capture domain-specific features due to its reliance on manually or fixed prompt inputs. Recently, some prompt generation methods have addressed this limitation by dynamically generating instance-specific and domain-specific prompts for each input, enriching domain information and demonstrating potential for enhanced generalization. Through further investigation, we identify a notable issue in existing prompt generation methods: the same input often yields significantly different and suboptimal prompts across different random…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Digital Filter Design and Implementation · Image Processing Techniques and Applications
