DiverGen: Improving Instance Segmentation by Learning Wider Data Distribution with More Diverse Generative Data
Chengxiang Fan, Muzhi Zhu, Hao Chen, Yang Liu, Weijia Wu, Huaqi Zhang,, Chunhua Shen

TL;DR
DiverGen enhances instance segmentation by efficiently generating diverse synthetic data to expand the training distribution, reducing overfitting and improving accuracy, especially for rare categories.
Contribution
The paper introduces DiverGen, a novel strategy for constructing diverse generative datasets that improve instance segmentation performance by addressing distribution discrepancy and data diversity.
Findings
DiverGen outperforms X-Paste on LVIS with +1.1 box AP and +1.1 mask AP overall.
DiverGen achieves +1.9 box AP and +2.5 mask AP on rare categories.
Scaling generative data to millions maintains performance gains.
Abstract
Instance segmentation is data-hungry, and as model capacity increases, data scale becomes crucial for improving the accuracy. Most instance segmentation datasets today require costly manual annotation, limiting their data scale. Models trained on such data are prone to overfitting on the training set, especially for those rare categories. While recent works have delved into exploiting generative models to create synthetic datasets for data augmentation, these approaches do not efficiently harness the full potential of generative models. To address these issues, we introduce a more efficient strategy to construct generative datasets for data augmentation, termed DiverGen. Firstly, we provide an explanation of the role of generative data from the perspective of distribution discrepancy. We investigate the impact of different data on the distribution learned by the model. We argue that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Analysis and Summarization · Advanced Image and Video Retrieval Techniques · Face recognition and analysis
