CountEx: Fine-Grained Counting via Exemplars and Exclusion
Yifeng Huang, Gia Khanh Nguyen, Minh Hoai

TL;DR
CountEx introduces a novel framework for fine-grained visual counting that allows explicit exclusion of distractors using multimodal prompts, significantly improving counting accuracy in cluttered scenes.
Contribution
It proposes CountEx, a discriminative counting method with a new query refinement module and introduces CoCount, a benchmark for evaluating fine-grained counting methods.
Findings
CountEx outperforms state-of-the-art methods in counting accuracy.
The Discriminative Query Refinement module effectively isolates exclusion cues.
CountEx generalizes well to both known and novel categories.
Abstract
This paper presents CountEx, a discriminative visual counting framework designed to address a key limitation of existing prompt-based methods: the inability to explicitly exclude visually similar distractors. While current approaches allow users to specify what to count via inclusion prompts, they often struggle in cluttered scenes with confusable object categories, leading to ambiguity and overcounting. CountEx enables users to express both inclusion and exclusion intent, specifying what to count and what to ignore, through multimodal prompts including natural language descriptions and optional visual exemplars. At the core of CountEx is a novel Discriminative Query Refinement module, which jointly reasons over inclusion and exclusion cues by first identifying shared visual features, then isolating exclusion-specific patterns, and finally applying selective suppression to refine the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Video Surveillance and Tracking Methods · Multimodal Machine Learning Applications
