# CMNet: an asymmetric dual-branch network for accurate cotton segmentation

**Authors:** Gengrong Zhang, Halidanmu Abudukelimu, Mayilamu Musideke, Shuqin Wu, Abudukelimu Abulizi, Cuiqin Guo, Yajun Zhang

PMC · DOI: 10.3389/fpls.2025.1692647 · Frontiers in Plant Science · 2026-03-03

## TL;DR

This paper introduces CMNet, a new deep learning network that improves cotton segmentation accuracy in agriculture, helping with tasks like harvesting and yield estimation.

## Contribution

The novel CMNet integrates a 2D Selective Scan module and other enhancements to achieve better segmentation accuracy and efficiency.

## Key findings

- CMNet achieved Dice, mIoU, and Accuracy scores of 91.06%, 84.18%, and 98.10% on a cotton image dataset.
- The model outperformed existing methods while reducing parameters and computational complexity.
- Generalization experiments on other plant datasets showed strong adaptability for multi-crop segmentation.

## Abstract

In agricultural automation, precise cotton segmentation is a key step for tasks such as intelligent harvesting and yield estimation. However, in complex field environments, factors such as background interference and irregular target shapes severely affect segmentation accuracy. Existing deep learning methods offer certain advantages but still generally suffer from limitations including insufficient accuracy, over-segmentation, and misidentification. To address these challenges, this study proposes a novel dual-branch cotton segmentation network, Cotton-aware Mamba-enhanced UNet (CMNet), which optimizes the ParaTransCNN architecture by incorporating the 2D Selective Scan (SS2D) module to replace the original Transformer branch, effectively balancing the extraction of local details and global semantic information while reducing computational burden. To enhance the model’s perception of irregularly shaped cotton, a Deformable Convolutional Networks v1 (DCNv1) module is integrated into the Vision Mamba (VMamba) branch, further improving the delineation of target boundaries. Additionally, an Atrous Spatial Pyramid Pooling (ASPP) module is introduced at the end of the Convolutional Neural Network (CNN) branch to strengthen multi-scale feature representation. To optimize the fusion of channel and spatial information, the Spatial and Channel Squeeze-and-Excitation (scSE) attention mechanism replaces the original module, enhancing feature modeling capability. Experimental results on an in-field cotton image dataset demonstrate that CMNet outperforms existing mainstream methods, achieving Dice, mIoU, and Accuracy of 91.06%, 84.18%, and 98.10%, respectively, while reducing parameter count and computational complexity, thus exhibiting excellent performance. Furthermore, generalization experiments on multiple other plant datasets also achieved outstanding results, validating the model’s adaptability and potential for broader applications in multi-crop segmentation tasks, providing valuable insights for smart agriculture segmentation research. The source code and dataset of this work are publicly available at https://github.com/halidanmu/CMNet.git.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12992305/full.md

## Figures

16 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12992305/full.md

## References

73 references — full list in the complete paper: https://tomesphere.com/paper/PMC12992305/full.md

---
Source: https://tomesphere.com/paper/PMC12992305