Training-Free Fine-Grained Semantic Segmentations in Low Data Regimes: A FungiTastic Baseline

Sebastian Cavada; Francesco Pelosin; Lapo Faggi

arXiv:2605.22492·cs.CV·May 22, 2026

Training-Free Fine-Grained Semantic Segmentations in Low Data Regimes: A FungiTastic Baseline

Sebastian Cavada, Francesco Pelosin, Lapo Faggi

PDF

TL;DR

This paper introduces a training-free, two-stage framework for fine-grained semantic segmentation in low-data regimes, combining macro-taxonomic prompts with prototype matching to achieve scalable and effective results.

Contribution

It presents the first training-free baseline for fine-grained semantic segmentation in low-data settings, decoupling segmentation from classification with a novel feature space transformation.

Findings

01

Effective in one-shot to few-hundred-shot regimes

02

Improves prototype classification with feature space transformation

03

Scalable approach with low segmentation cost

Abstract

Fine-grained semantic segmentation requires both precise localization and discrimination between visually similar classes. In FungiTastic, this problem is further complicated by a long-tailed distribution and strong variation in image acquisition conditions. We propose a training-free two-stage framework that decouples segmentation from classification. SAM3 first produces class-agnostic mushroom masks using macro-taxonomic prompts, and DINOv3 then assigns fine-grained labels through prototype matching in the embedding space. To improve this stage, we apply a simple transformation of the DINOv3 feature space that improves prototype-based classification. Compared with class-specific prompting, our approach is more scalable and keeps the segmentation cost low. We report results from one-shot to few-hundred-shot regimes, providing, to the best of our knowledge, the first baseline for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.