ProAPO: Progressively Automatic Prompt Optimization for Visual Classification

Xiangyan Qu; Gaopeng Gou; Jiamin Zhuang; Jing Yu; Kun Song; Qihao Wang; Yili Li; Gang Xiong

arXiv:2502.19844·cs.CV·February 12, 2026

ProAPO: Progressively Automatic Prompt Optimization for Visual Classification

Xiangyan Qu, Gaopeng Gou, Jiamin Zhuang, Jing Yu, Kun Song, Qihao Wang, Yili Li, Gang Xiong

PDF

Open Access 1 Repo

TL;DR

ProAPO introduces a progressive, evolution-based method to optimize visually discriminative prompts for fine-grained image classification, reducing costs and overfitting without human input, and outperforming existing methods across multiple datasets.

Contribution

It proposes a novel evolution-based algorithm with effective prompt generation and selection strategies for automatic prompt optimization in visual classification.

Findings

01

Outperforms existing prompt-based methods on 13 datasets

02

Reduces prompt generation and iteration costs

03

Enhances transferability across different models

Abstract

Vision-language models (VLMs) have made significant progress in image classification by training with large-scale paired image-text data. Their performances largely depend on the prompt quality. While recent methods show that visual descriptions generated by large language models (LLMs) enhance the generalization of VLMs, class-specific prompts may be inaccurate or lack discrimination due to the hallucination in LLMs. In this paper, we aim to find visually discriminative prompts for fine-grained categories with minimal supervision and no human-in-the-loop. An evolution-based algorithm is proposed to progressively optimize language prompts from task-specific templates to class-specific descriptions. Unlike optimizing templates, the search space shows an explosion in class-specific candidate prompts. This increases prompt generation costs, iterative times, and the overfitting problem. To…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

MorningStarOvO/ProAPO
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Generative Adversarial Networks and Image Synthesis