Improving Pallet Detection Using Synthetic Data
Henry Gann, Josiah Bull, Trevor Gee, Mahla Nejati

TL;DR
This paper demonstrates that synthetic data, especially Unity-generated, can significantly improve pallet detection accuracy in warehouse environments, though lighting conditions greatly affect performance.
Contribution
It introduces a method using domain-randomised and Unity-generated synthetic data for pallet instance segmentation, with performance improvements over previous approaches.
Findings
69% and 50% mAP50 improvements on real data for stacked and racked pallets
Performance drops significantly under darker lighting conditions, up to 80% brightness reduction
Two-stage detector with YOLOv8 and SAM showed unstable performance
Abstract
The use of synthetic data in machine learning saves a significant amount of time when implementing an effective object detector. However, there is limited research in this domain. This study aims to improve upon previously applied implementations in the task of instance segmentation of pallets in a warehouse environment. This study proposes using synthetically generated domain-randomised data as well as data generated through Unity to achieve this. This study achieved performance improvements on the stacked and racked pallet categories by 69% and 50% mAP50, respectively when being evaluated on real data. Additionally, it was found that there was a considerable impact on the performance of a model when it was evaluated against images in a darker environment, dropping as low as 3% mAP50 when being evaluated on images with an 80% brightness reduction. This study also created a two-stage…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIndustrial Vision Systems and Defect Detection · Image and Object Detection Techniques · Image Processing and 3D Reconstruction
MethodsYou Only Look Once · Segment Anything Model
