GMT: Guided Mask Transformer for Leaf Instance Segmentation
Feng Chen, Sotirios A. Tsaftaris, Mario Valerio Giuffrida

TL;DR
This paper introduces GMT, a Transformer-based model that incorporates leaf spatial distribution priors to improve leaf instance segmentation accuracy, outperforming existing methods on multiple datasets.
Contribution
The novel Guided Mask Transformer (GMT) effectively integrates spatial priors into segmentation, addressing challenges like occlusion and size variation in leaf images.
Findings
Outperforms state-of-the-art on three plant datasets.
Effectively handles occlusion and size variation.
Leverages spatial priors for improved segmentation.
Abstract
Leaf instance segmentation is a challenging multi-instance segmentation task, aiming to separate and delineate each leaf in an image of a plant. Accurate segmentation of each leaf is crucial for plant-related applications such as the fine-grained monitoring of plant growth and crop yield estimation. This task is challenging because of the high similarity (in shape and colour), great size variation, and heavy occlusions among leaf instances. Furthermore, the typically small size of annotated leaf datasets makes it more difficult to learn the distinctive features needed for precise segmentation. We hypothesise that the key to overcoming the these challenges lies in the specific spatial patterns of leaf distribution. In this paper, we propose the Guided Mask Transformer (GMT), which leverages and integrates leaf spatial distribution priors into a Transformer-based segmentor. These spatial…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSmart Agriculture and AI · Leaf Properties and Growth Measurement · Greenhouse Technology and Climate Control
Methods+ ( 1 ) ⟷ 888 ⟷ ( 829 ) ⟷ 0881||How do I resolve a dispute on Expedia? · Sparse Evolutionary Training · Softmax · Layer Normalization · Byte Pair Encoding · Label Smoothing · Position-Wise Feed-Forward Layer · Dropout · Adam · Attention Is All You Need
