Benchmarking and Analyzing Generative Data for Visual Recognition

Bo Li; Haotian Liu; Liangyu Chen; Yong Jae Lee; Chunyuan Li; Ziwei Liu

arXiv:2307.13697·cs.CV·July 29, 2025

Benchmarking and Analyzing Generative Data for Visual Recognition

Bo Li, Haotian Liu, Liangyu Chen, Yong Jae Lee, Chunyuan Li, Ziwei Liu

PDF

Open Access

TL;DR

This paper introduces GenBench, a comprehensive benchmark for evaluating generative data in visual recognition, along with a new metric CLER, and analyzes the effectiveness of generative data compared to retrieval methods.

Contribution

It presents GenBench, a large benchmark dataset, proposes CLER as a new evaluation metric, and provides insights into the advantages and limitations of generative data for visual recognition.

Findings

01

Generative data can be effective for visual recognition tasks.

02

CLER correlates better with downstream performance than existing metrics.

03

External knowledge injection improves recognition performance in most cases.

Abstract

Advancements in large pre-trained generative models have expanded their potential as effective data generators in visual recognition. This work delves into the impact of generative images, primarily comparing paradigms that harness external data (\ie generative \vs retrieval \vs original). Our key contributions are: \textbf{1) GenBench Construction:} We devise \textbf{GenBench}, a broad benchmark comprising 22 datasets with 2548 categories, to appraise generative data across various visual recognition tasks. \textbf{2) CLER Score:} To address the insufficient correlation of existing metrics (\eg, FID, CLIP score) with downstream recognition performance, we propose \textbf{CLER}, a training-free metric indicating generative data's efficiency for recognition tasks prior to training. \textbf{3) New Baselines:} Comparisons of generative data with retrieved data from the same external pool…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis

MethodsContrastive Language-Image Pre-training