CoAPT: Context Attribute words for Prompt Tuning
Gun Lee, Subin An, Sungyong Baik, Soochahn Lee

TL;DR
CoAPT introduces a prompt tuning method that incorporates attribute words to enhance text-image alignment in CLIP, significantly improving few/zero-shot image classification performance across various tasks.
Contribution
The paper presents a novel prompt tuning approach that integrates attribute words and a meta-network to improve text-image embedding alignment in CLIP-based classification.
Findings
Significant improvements in few/zero-shot classification accuracy.
Enhanced generalization across datasets and domains.
Effective combination of hard and soft prompts.
Abstract
We propose a novel prompt tuning method called CoAPT(Context Attribute words in Prompt Tuning) for few/zero-shot image classification. The core motivation is that attributes are descriptive words with rich information about a given concept. Thus, we aim to enrich text queries of existing prompt tuning methods, improving alignment between text and image embeddings in CLIP embedding space. To do so, CoAPT integrates attribute words as additional prompts within learnable prompt tuning and can be easily incorporated into various existing prompt tuning methods. To facilitate the incorporation of attributes into text embeddings and the alignment with image embeddings, soft prompts are trained together with an additional meta-network that generates input-image-wise feature biases from the concatenated feature encodings of the image-text combined queries. Our experiments demonstrate that CoAPT…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFormal Methods in Verification · Logic, programming, and type systems · Software Testing and Debugging Techniques
MethodsContrastive Language-Image Pre-training
