Human-Guided Image Generation for Expanding Small-Scale Training Image Datasets
Changjian Chen, Fei Lv, Yalong Guan, Pengcheng Wang, Shengjie Yu,, Yifan Zhang, and Zhuo Tang

TL;DR
This paper introduces a human-guided image generation approach that enables controllable dataset expansion for small-scale datasets, improving model performance and user experience through sample-level prompt refinement.
Contribution
It proposes a multi-modal projection method with theoretical guarantees and a sample-level prompt refinement technique to facilitate controllable and user-friendly dataset augmentation.
Findings
Enhanced dataset diversity through human-guided generation
Improved classification and detection performance
Positive expert feedback on the method's usability
Abstract
The performance of computer vision models in certain real-world applications (e.g., rare wildlife observation) is limited by the small number of available images. Expanding datasets using pre-trained generative models is an effective way to address this limitation. However, since the automatic generation process is uncontrollable, the generated images are usually limited in diversity, and some of them are undesired. In this paper, we propose a human-guided image generation method for more controllable dataset expansion. We develop a multi-modal projection method with theoretical guarantees to facilitate the exploration of both the original and generated images. Based on the exploration, users refine the prompts and re-generate images for better performance. Since directly refining the prompts is challenging for novice users, we develop a sample-level prompt refinement method to make it…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRadiomics and Machine Learning in Medical Imaging · Medical Imaging and Analysis · Medical Image Segmentation Techniques
