AFreeCA: Annotation-Free Counting for All

Adriano D'Alessandro; Ali Mahdavi-Amiri; Ghassan Hamarneh

arXiv:2403.04943·cs.CV·August 5, 2024·1 cites

AFreeCA: Annotation-Free Counting for All

Adriano D'Alessandro, Ali Mahdavi-Amiri, Ghassan Hamarneh

PDF

Open Access 1 Repo

TL;DR

This paper introduces AFreeCA, an unsupervised method for object counting that leverages text-to-image latent diffusion models to generate counting data without manual annotations, enabling versatile counting across diverse object categories.

Contribution

The paper proposes a novel unsupervised counting approach using LDMs for data generation and density classification, removing the need for annotated datasets and broadening object category applicability.

Findings

01

Outperforms other unsupervised and few-shot counting methods

02

Enables counting of diverse object categories without annotations

03

Uses LDMs to generate reliable counting data

Abstract

Object counting methods typically rely on manually annotated datasets. The cost of creating such datasets has restricted the versatility of these networks to count objects from specific classes (such as humans or penguins), and counting objects from diverse categories remains a challenge. The availability of robust text-to-image latent diffusion models (LDMs) raises the question of whether these models can be utilized to generate counting datasets. However, LDMs struggle to create images with an exact number of objects based solely on text prompts but they can be used to offer a dependable \textit{sorting} signal by adding and removing objects within an image. Leveraging this data, we initially introduce an unsupervised sorting methodology to learn object-related features that are subsequently refined and anchored for counting purposes using counting data generated by LDMs. Further, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

adrian-dalessandro/afreeca
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Healthcare · Time Series Analysis and Forecasting · Generative Adversarial Networks and Image Synthesis

MethodsDiffusion