Industrial Synthetic Segment Pre-training
Shinichi Mae, Ryousuke Yamada, Hirokatsu Kataoka

TL;DR
This paper introduces InsCore, a synthetic, fully annotated dataset for industrial instance segmentation that eliminates the need for real images or manual annotations, outperforming existing models including fine-tuned SAM on industrial datasets.
Contribution
The paper presents InsCore, a novel synthetic dataset generated via formula-driven supervised learning, enabling effective pre-training for industrial segmentation without real images or human annotations.
Findings
InsCore pre-trained models outperform COCO, ImageNet-21k, and fine-tuned SAM.
Achieves 6.2 points higher in segmentation performance on average.
Uses only 100k synthetic images, demonstrating high data efficiency.
Abstract
Pre-training on real-image datasets has been widely proven effective for improving instance segmentation. However, industrial applications face two key challenges: (1) legal and ethical restrictions, such as ImageNet's prohibition of commercial use, and (2) limited transferability due to the domain gap between web images and industrial imagery. Even recent vision foundation models, including the segment anything model (SAM), show notable performance degradation in industrial settings. These challenges raise critical questions: Can we build a vision foundation model for industrial applications without relying on real images or manual annotations? And can such models outperform even fine-tuned SAM on industrial datasets? To address these questions, we propose the Instance Core Segmentation Dataset (InsCore), a synthetic pre-training dataset based on formula-driven supervised learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Machine Learning and Data Classification
MethodsSegment Anything Model
