AnchorOPT: Towards Optimizing Dynamic Anchors for Adaptive Prompt Learning
Zheng Li, Yibing Song, Xin Zhang, Lei Luo, Xiang Li, Jian Yang

TL;DR
AnchorOPT introduces a dynamic, learnable anchor and position matrix for prompt learning with CLIP, enhancing flexibility and performance across tasks without added complexity.
Contribution
It proposes a novel dynamic anchor framework that learns task-specific anchors and adaptively optimizes their position, improving prompt learning flexibility and effectiveness.
Findings
Achieves comparable or superior performance to more complex methods.
Seamlessly integrates into existing frameworks, providing consistent gains.
Operates effectively across diverse datasets.
Abstract
Existing prompt learning methods, which are built upon CLIP models, leverage textual tokens as anchors to guide the learnable soft tokens. This guidance improves CLIP generalizations. However, these anchors-static in both value and position-lack cross-task and stage-adaptive flexibility. To address this limitation, we propose AnchorOPT, a dynamic anchor-based prompt learning framework. Specifically, AnchorOPT introduces dynamism in two key dimensions: (i) anchor values eschew handcrafted explicit textual tokens (e.g., "shape", "color"), instead learning dynamically from task-specific data; and (ii) the positional relationship between anchor and soft tokens is no longer fixed but adaptively optimized via a learnable position matrix conditioned on the training stage and task context. Training occurs in two stages: we first learn the anchor tokens, then freeze and transfer them to the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Topic Modeling
