Enhancing Zero-shot Counting via Language-guided Exemplar Learning
Mingjie Wang, Jun Zhou, Yong Dai, Eric Buys, Minglun Gong

TL;DR
This paper introduces ExpressCount, a novel zero-shot counting method leveraging language-guided exemplar learning with large language models and a dual-branch visual pipeline, achieving state-of-the-art results.
Contribution
The paper presents a new framework combining language-oriented exemplar perception and a visual counting pipeline, advancing zero-shot counting capabilities.
Findings
Achieves state-of-the-art zero-shot counting performance.
Demonstrates accuracy comparable to partial category-specific models.
Introduces FSC-147-Express dataset with linguistic annotations.
Abstract
Recently, Class-Agnostic Counting (CAC) problem has garnered increasing attention owing to its intriguing generality and superior efficiency compared to Category-Specific Counting (CSC). This paper proposes a novel ExpressCount to enhance zero-shot object counting by delving deeply into language-guided exemplar learning. Specifically, the ExpressCount is comprised of an innovative Language-oriented Exemplar Perceptron and a downstream visual Zero-shot Counting pipeline. Thereinto, the perceptron hammers at exploiting accurate exemplar cues from collaborative language-vision signals by inheriting rich semantic priors from the prevailing pre-trained Large Language Models (LLMs), whereas the counting pipeline excels in mining fine-grained features through dual-branch and cross-attention schemes, contributing to the high-quality similarity learning. Apart from building a bridge between the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques
