Low-Biased General Annotated Dataset Generation
Dengyang Jiang, Haoyu Wang, Lei Zhang, Wei Wei, Guang Dai, Mengmeng, Wang, Jingdong Wang, Yanning Zhang

TL;DR
This paper introduces lbGen, a framework that generates low-biased, category-annotated images using multimodal models and diffusion techniques, improving the generalization of visual models especially with scarce manual data.
Contribution
The paper proposes a novel low-biased dataset generation method leveraging multimodal models and diffusion, reducing reliance on manual data collection and bias.
Findings
Generated datasets improve model generalization across tasks.
Method outperforms manually labeled datasets in scarce data scenarios.
Enhances robustness of backbone networks in visual tasks.
Abstract
Pre-training backbone networks on a general annotated dataset (e.g., ImageNet) that comprises numerous manually collected images with category annotations has proven to be indispensable for enhancing the generalization capacity of downstream visual tasks. However, those manually collected images often exhibit bias, which is non-transferable across either categories or domains, thus causing the model's generalization capacity degeneration. To mitigate this problem, we present a low-biased general annotated dataset generation framework (lbGen). Instead of expensive manual collection, we aim at directly generating low-biased images with category annotations. To achieve this goal, we propose to leverage the advantage of a multimodal foundation model (e.g., CLIP), in terms of aligning images in a low-biased semantic space defined by language. Specifically, we develop a bi-level semantic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques
MethodsDiffusion
