Counting Through Occlusion: Framework for Open World Amodal Counting
Safaeid Hossain Arib, Rabeya Akter, Abdul Monaf Chowdhury, Md Jubair Ahmed Sourov, Md Mehedi Hasan

TL;DR
This paper introduces CountOCC, a novel framework for amodal object counting under occlusion that reconstructs occluded features using hierarchical multimodal guidance and enforces consistency in attention maps, achieving state-of-the-art results.
Contribution
CountOCC is the first framework to explicitly reconstruct occluded object features using multimodal guidance and attention consistency, improving counting accuracy in occluded scenes.
Findings
Achieves 26.72% MAE reduction on FSC-147-OCC
Sets new SOTA on CARPK-OCC with 49.89% MAE reduction
Demonstrates strong generalization on CAPTURe-Real dataset
Abstract
Object counting has achieved remarkable success on visible instances, yet state-of-the-art (SOTA) methods fail under occlusion. This failure stems from a fundamental architectural limitation where backbone networks encode occluding surfaces rather than target objects, thereby corrupting the feature representations required for accurate enumeration. To address this, we present CountOCC, an amodal counting framework that explicitly reconstructs occluded object features through hierarchical multimodal guidance. Rather than accepting degraded encodings, we synthesize complete representations by integrating spatial context from visible fragments with semantic priors from text and visual embeddings, generating features at occluded locations across multiple pyramid levels. We further introduce a visual equivalence objective that enforces consistency in attention space, ensuring that both…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Advanced Neural Network Applications · Visual Attention and Saliency Detection
