TL;DR
This paper introduces a novel end-to-end pre-training method for image segmentation models using pseudo semantic segmentation labels derived from explanations of classification models, significantly improving segmentation performance.
Contribution
The work proposes leveraging ensemble explanations from classifiers to generate pseudo labels, enabling effective end-to-end pre-training on classification datasets for segmentation tasks.
Findings
Pre-training with PSSL improves segmentation accuracy across multiple models.
The method enhances performance on diverse datasets like CityScapes and ADE20K.
Ensemble explanations reduce bias and improve pseudo label quality.
Abstract
While fine-tuning pre-trained networks has become a popular way to train image segmentation models, such backbone networks for image segmentation are frequently pre-trained using image classification source datasets, e.g., ImageNet. Though image classification datasets could provide the backbone networks with rich visual features and discriminative ability, they are incapable of fully pre-training the target model (i.e., backbone+segmentation modules) in an end-to-end manner. The segmentation modules are left to random initialization in the fine-tuning process due to the lack of segmentation labels in classification datasets. In our work, we propose a method that leverages Pseudo Semantic Segmentation Labels (PSSL), to enable the end-to-end pre-training for image segmentation models based on classification datasets. PSSL was inspired by the observation that the explanation results of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
MethodsClass-activation map · Local Interpretable Model-Agnostic Explanations
