SegRCDB: Semantic Segmentation via Formula-Driven Supervised Learning

Risa Shinoda; Ryo Hayamizu; Kodai Nakashima; Nakamasa Inoue; Rio; Yokota; Hirokatsu Kataoka

arXiv:2309.17083·cs.CV·October 2, 2023·1 cites

SegRCDB: Semantic Segmentation via Formula-Driven Supervised Learning

Risa Shinoda, Ryo Hayamizu, Kodai Nakashima, Nakamasa Inoue, Rio, Yokota, Hirokatsu Kataoka

PDF

Open Access 1 Repo

TL;DR

SegRCDB introduces a formula-driven supervised learning approach for semantic segmentation pre-training, eliminating the need for manual annotations and outperforming traditional datasets like COCO-Stuff in key benchmarks.

Contribution

This paper presents the first application of formula-driven supervised learning for semantic segmentation pre-training, enabling large-scale dataset creation without manual labels.

Findings

01

SegRCDB pre-training outperforms COCO-Stuff on ADE-20k and Cityscapes.

02

Eliminates need for manual annotation in dataset creation.

03

Enhances efficiency in semantic segmentation pre-training.

Abstract

Pre-training is a strong strategy for enhancing visual models to efficiently train them with a limited number of labeled images. In semantic segmentation, creating annotation masks requires an intensive amount of labor and time, and therefore, a large-scale pre-training dataset with semantic labels is quite difficult to construct. Moreover, what matters in semantic segmentation pre-training has not been fully investigated. In this paper, we propose the Segmentation Radial Contour DataBase (SegRCDB), which for the first time applies formula-driven supervised learning for semantic segmentation. SegRCDB enables pre-training for semantic segmentation without real images or any manual semantic labels. SegRCDB is based on insights about what is important in pre-training for semantic segmentation and allows efficient pre-training. Pre-training with SegRCDB achieved higher mIoU than the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

dahlian00/segrcdb
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning