Low-Biased General Annotated Dataset Generation

Dengyang Jiang; Haoyu Wang; Lei Zhang; Wei Wei; Guang Dai; Mengmeng; Wang; Jingdong Wang; Yanning Zhang

arXiv:2412.10831·cs.CV·March 20, 2025

Low-Biased General Annotated Dataset Generation

Dengyang Jiang, Haoyu Wang, Lei Zhang, Wei Wei, Guang Dai, Mengmeng, Wang, Jingdong Wang, Yanning Zhang

PDF

Open Access

TL;DR

This paper introduces lbGen, a framework that generates low-biased, category-annotated images using multimodal models and diffusion techniques, improving the generalization of visual models especially with scarce manual data.

Contribution

The paper proposes a novel low-biased dataset generation method leveraging multimodal models and diffusion, reducing reliance on manual data collection and bias.

Findings

01

Generated datasets improve model generalization across tasks.

02

Method outperforms manually labeled datasets in scarce data scenarios.

03

Enhances robustness of backbone networks in visual tasks.

Abstract

Pre-training backbone networks on a general annotated dataset (e.g., ImageNet) that comprises numerous manually collected images with category annotations has proven to be indispensable for enhancing the generalization capacity of downstream visual tasks. However, those manually collected images often exhibit bias, which is non-transferable across either categories or domains, thus causing the model's generalization capacity degeneration. To mitigate this problem, we present a low-biased general annotated dataset generation framework (lbGen). Instead of expensive manual collection, we aim at directly generating low-biased images with category annotations. To achieve this goal, we propose to leverage the advantage of a multimodal foundation model (e.g., CLIP), in terms of aligning images in a low-biased semantic space defined by language. Specifically, we develop a bi-level semantic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques

MethodsDiffusion