Real-time Strawberry Detection Based on Improved YOLOv5s Architecture for Robotic Harvesting in open-field environment
Zixuan He (1)(2), Salik Ram Khanal (1)(2), Xin Zhang (3), Manoj Karkee, (1)(2), Qin Zhang (1)(2) ((1) Center for Precision, Automated Agricultural, Systems, Washington State University, (2) Department of Biological Systems, Engineering, Washington State University

TL;DR
This paper introduces an improved YOLOv5s-based model, YOLOv5s-Straw, for real-time strawberry detection in open-field environments, achieving higher accuracy and faster inference suitable for robotic harvesting.
Contribution
The study develops a modified YOLOv5s architecture with enhanced modules and pooling strategies, demonstrating superior detection accuracy and speed over existing models for outdoor strawberry detection.
Findings
Achieved 80.3% mean average precision, outperforming other models.
Model inference speed of 18ms per image supports real-time application.
Higher accuracy in all maturity classes compared to baseline models.
Abstract
This study proposed a YOLOv5-based custom object detection model to detect strawberries in an outdoor environment. The original architecture of the YOLOv5s was modified by replacing the C3 module with the C2f module in the backbone network, which provided a better feature gradient flow. Secondly, the Spatial Pyramid Pooling Fast in the final layer of the backbone network of YOLOv5s was combined with Cross Stage Partial Net to improve the generalization ability over the strawberry dataset in this study. The proposed architecture was named YOLOv5s-Straw. The RGB images dataset of the strawberry canopy with three maturity classes (immature, nearly mature, and mature) was collected in open-field environment and augmented through a series of operations including brightness reduction, brightness increase, and noise adding. To verify the superiority of the proposed method for strawberry…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSmart Agriculture and AI · Plant Disease Management Techniques · Plant Virus Research Studies
MethodsSpatial Pyramid Pooling · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
