Synthetic Data Generation and Adaption for Object Detection in Smart Vending Machines
Kai Wang, Fuyuan Shi, Wenqi Wang, Yibing Nan, Shiguo Lian

TL;DR
This paper introduces an improved synthetic data generation and adaptation method for training deep CNNs to enhance object detection in smart vending machines, addressing challenges of realism and data redundancy.
Contribution
It proposes a novel virtual scene simulation, camera distortion modeling, and generative network post-processing to produce more realistic synthetic images for better training.
Findings
Enhanced detection accuracy with synthetic data
Improved generalization to new environments
Effective reduction of redundant training data
Abstract
This paper presents an improved scheme for the generation and adaption of synthetic images for the training of deep Convolutional Neural Networks(CNNs) to perform the object detection task in smart vending machines. While generating synthetic data has proved to be effective for complementing the training data in supervised learning methods, challenges still exist for generating virtual images which are similar to those of the complex real scenes and minimizing redundant training data. To solve these problems, we consider the simulation of cluttered objects placed in a virtual scene and the wide-angle camera with distortions used to capture the whole scene in the data generation process, and post-processed the generated images with a elaborately-designed generative network to make them more similar to the real images. Various experiments have been conducted to prove the efficiency of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · 3D Surveying and Cultural Heritage · Generative Adversarial Networks and Image Synthesis
