Plug-and-play Class-aware Knowledge Injection for Prompt Learning with Visual-Language Model

Junhui Yin; Nan Pu; Xinyu Zhang; Lingfeng Yang; Lin Wu; Xiaojie Wang; Zhun Zhong

arXiv:2605.05910·cs.CV·May 8, 2026

Plug-and-play Class-aware Knowledge Injection for Prompt Learning with Visual-Language Model

Junhui Yin, Nan Pu, Xinyu Zhang, Lingfeng Yang, Lin Wu, Xiaojie Wang, Zhun Zhong

PDF

1 Repo

TL;DR

The paper introduces CAKI, a plug-and-play framework that injects class-specific knowledge into vision-language models to improve zero-shot classification accuracy.

Contribution

It proposes a novel class-aware knowledge injection method with class-specific prompt generation and retrieval, enhancing existing prompt learning techniques.

Findings

01

CAKI improves performance on base and novel classes.

02

The method effectively incorporates class-specific knowledge.

03

Experiments validate the effectiveness of the proposed framework.

Abstract

Prompt learning has become an effective and widely used technique in enhancing vision-language models (VLMs) such as CLIP for various downstream tasks, particularly in zero-shot classification within specific domains. Existing methods typically focus on either learning class-shared prompts for a given domain or generating instance-specific prompts through conditional prompt learning. While these methods have achieved promising performance, they often overlook class-specific knowledge in prompt design, leading to suboptimal outcomes. The underlying reasons are: 1) class-specific prompts offer more fine-grained supervision compared to coarse class-shared prompts, which helps prevent misclassification of data from different classes into a single class; 2) compared to class-specific prompts, instance-specific prompts neglect the richer class-level information across multiple instances,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yjh576/CAKI
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.