Zero-Shot Segmentation through Prototype-Guidance for Multi-Label Plant Species Identification
Luciano Araujo Dourado Filho, Almir Moreira da Silva Neto, Rodrigo Pereira David, Rodrigo Tripodi Calumby

TL;DR
This paper introduces a novel zero-shot segmentation method using prototype guidance and Vision Transformer models for multi-label plant species identification, achieving competitive results in the PlantCLEF 2025 challenge.
Contribution
It proposes a new approach combining class prototypes, K-Means clustering, and a customized ViT with frozen DinoV2 features for zero-shot multi-label plant species segmentation.
Findings
Achieved fifth place in PlantCLEF 2025 challenge with an F1 score of 0.33331.
Demonstrated domain adaptation from single-label to multi-label classification.
Method scored close to top-performing submissions, indicating competitive performance.
Abstract
This paper presents an approach developed to address the PlantClef 2025 challenge, which consists of a fine-grained multi-label species identification, over high-resolution images. Our solution focused on employing class prototypes obtained from the training dataset as a proxy guidance for training a segmentation Vision Transformer (ViT) on the test set images. To obtain these representations, the proposed method extracts features from training dataset images and create clusters, by applying K-Means, with equals to the number of classes in the dataset. The segmentation model is a customized narrow ViT, built by replacing the patch embedding layer with a frozen DinoV2, pre-trained on the training dataset for individual species classification. This model is trained to reconstruct the class prototypes of the training dataset from the test dataset images. We then use this model to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSmart Agriculture and AI · Advanced Neural Network Applications · Cell Image Analysis Techniques
