Multi-dataset Pretraining: A Unified Model for Semantic Segmentation
Bowen Shi, Xiaopeng Zhang, Haohang Xu, Wenrui Dai, Junni Zou, Hongkai, Xiong, Qi Tian

TL;DR
This paper introduces a unified multi-dataset pretraining framework for semantic segmentation that leverages fragmented annotations across datasets, improving transferability and performance with less pretraining data.
Contribution
It proposes a novel pixel-to-prototype contrastive loss and pixel-to-class sparse coding to effectively utilize multiple datasets for pretraining.
Findings
Outperforms ImageNet pretraining by a significant margin.
Requires less than 10% of samples for pretraining.
Achieves superior performance on several benchmarks.
Abstract
Collecting annotated data for semantic segmentation is time-consuming and hard to scale up. In this paper, we for the first time propose a unified framework, termed as Multi-Dataset Pretraining, to take full advantage of the fragmented annotations of different datasets. The highlight is that the annotations from different domains can be efficiently reused and consistently boost performance for each specific domain. This is achieved by first pretraining the network via the proposed pixel-to-prototype contrastive loss over multiple datasets regardless of their taxonomy labels, and followed by fine-tuning the pretrained model over specific dataset as usual. In order to better model the relationship among images and classes from different datasets, we extend the pixel level embeddings via cross dataset mixing and propose a pixel-to-class sparse coding strategy that explicitly models the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Multimodal Machine Learning Applications
