TFCounter:Polishing Gems for Training-Free Object Counting
Pan Ting, Jianfeng Lin, Wenhao Yu, Wenlong Zhang, Xiaoying Chen, Jinlu, Zhang, Binqiang Huang

TL;DR
TFCounter is a training-free, prompt-based object counting method that leverages foundation models and context-aware modules to achieve high accuracy and generalizability across diverse scenes without requiring training data.
Contribution
It introduces a novel training-free, prompt-context-aware framework for object counting that outperforms existing methods and demonstrates strong cross-domain generalization.
Findings
Outperforms existing training-free methods
Achieves competitive results with trained models
Demonstrates strong cross-domain generalization
Abstract
Object counting is a challenging task with broad application prospects in security surveillance, traffic management, and disease diagnosis. Existing object counting methods face a tri-fold challenge: achieving superior performance, maintaining high generalizability, and minimizing annotation costs. We develop a novel training-free class-agnostic object counter, TFCounter, which is prompt-context-aware via the cascade of the essential elements in large-scale foundation models. This approach employs an iterative counting framework with a dual prompt system to recognize a broader spectrum of objects varying in shape, appearance, and size. Besides, it introduces an innovative context-aware similarity module incorporating background context to enhance accuracy within messy scenes. To demonstrate cross-domain generalizability, we collect a novel counting dataset named BIKE-1000, including…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Processing and 3D Reconstruction · Handwritten Text Recognition Techniques · Digital Media Forensic Detection
