Multi-Scale Grouped Prototypes for Interpretable Semantic Segmentation

Hugo Porta; Emanuele Dalsasso; Diego Marcos; Devis Tuia

arXiv:2409.09497·cs.CV·April 29, 2025

Multi-Scale Grouped Prototypes for Interpretable Semantic Segmentation

Hugo Porta, Emanuele Dalsasso, Diego Marcos, Devis Tuia

PDF

Open Access 1 Repo

TL;DR

This paper introduces a multi-scale prototype learning approach for semantic segmentation that enhances interpretability by explicitly modeling diverse prototypical parts at multiple scales and grouping them sparsely.

Contribution

It proposes a novel multi-scale prototype layer and a sparse grouping mechanism to improve interpretability and understanding of multi-scale object representations in segmentation.

Findings

01

Increases model sparsity and interpretability.

02

Narrows performance gap with non-interpretable models.

03

Demonstrates effectiveness on Pascal VOC, Cityscapes, and ADE20K.

Abstract

Prototypical part learning is emerging as a promising approach for making semantic segmentation interpretable. The model selects real patches seen during training as prototypes and constructs the dense prediction map based on the similarity between parts of the test image and the prototypes. This improves interpretability since the user can inspect the link between the predicted output and the patterns learned by the model in terms of prototypical information. In this paper, we propose a method for interpretable semantic segmentation that leverages multi-scale image representation for prototypical part learning. First, we introduce a prototype layer that explicitly learns diverse prototypical parts at several scales, leading to multi-scale representations in the prototype activation output. Then, we propose a sparse grouping mechanism that produces multi-scale sparse groups of these…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

eceo-epfl/scaleprotoseg
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques