Paddy pest image segmentation based on multiscale attention fusion VM-UNet
Yunlong Zhang, Yu Shao, Ting Zhang

TL;DR
This paper introduces a new neural network model for accurately segmenting paddy pests in natural environments, improving detection performance over existing methods.
Contribution
The novel MSAF-VMUNet integrates VSS and U-Net with multiscale attention fusion for efficient and accurate paddy pest segmentation.
Findings
MSAF-VMUNet achieves 79.17% precision in paddy pest segmentation, outperforming U-Net and VM-UNet.
The model effectively handles small pest detection, occlusion, and noise without increasing computational complexity.
MSAF-VMUNet is validated on the IP102 dataset's paddy pest subset for real-world agricultural applications.
Abstract
Precise paddy pest image segmentation (PPIS) in the real-time natural environments is an important and challenging research. Convolutional Neural Networks (CNNs) and Transformers are the most popular architectures for image segmentation, but they usually have limitations in modeling global dependencies and quadratic computational complexity, respectively. A multiscale attention fusion VM-UNet (MSAF-VMUNet) for PPIS is constructed. It integrates the long-range dependencies modeling ability of Visual State Space Model (VSS) and the precise positioning capability of U-Net with low computational complexity. In the model, multiscale VSS (MSVSS) block is used to capture the long-range contextual information, and improved attention fusion (IAF) module is designed for multi-level feature learning between Encoder and Decoder. Attention VSS module is introduced in the bottleneck layer to enable…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSmart Agriculture and AI · Advanced Neural Network Applications · Remote Sensing in Agriculture
