Strawberry detection and counting based on YOLOv7 pruning and information based tracking algorithm
Shiyu Liu, Congliang Zhou, Won Suk Lee

TL;DR
This paper enhances strawberry detection and tracking by pruning YOLOv7 detection heads for speed and accuracy, and introducing an information-based tracking algorithm that outperforms traditional methods in precision and reliability.
Contribution
It proposes an optimized pruning of YOLOv7 detection heads and an innovative IBTA for improved strawberry flower and fruit detection and tracking.
Findings
Pruning YOLOv7-tiny achieves 163.9 fps and 89.1% accuracy.
IBTA outperforms centroid tracking in MOTA and MOTP metrics.
IBTA shows better IDF1, IDR, IDP, MT, and ID metrics than CTA.
Abstract
The strawberry industry yields significant economic benefits for Florida, yet the process of monitoring strawberry growth and yield is labor-intensive and costly. The development of machine learning-based detection and tracking methodologies has been used for helping automated monitoring and prediction of strawberry yield, still, enhancement has been limited as previous studies only applied the deep learning method for flower and fruit detection, which did not consider the unique characteristics of image datasets collected by the machine vision system. This study proposed an optimal pruning of detection heads of the deep learning model (YOLOv7 and its variants) that could achieve fast and precise strawberry flower, immature fruit, and mature fruit detection. Thereafter, an enhanced object tracking algorithm, which is called the Information Based Tracking Algorithm (IBTA) utilized the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSmart Agriculture and AI
MethodsPruning · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
