WeedVision: Multi-Stage Growth and Classification of Weeds using DETR and RetinaNet for Precision Agriculture
Taminul Islam, Toqi Tahamid Sarker, Khaled R Ahmed, Cristiana Bernardi, Rankrape, Karla Gage

TL;DR
This paper develops and compares advanced object detection models, DETR and RetinaNet, for identifying and classifying 16 weed species across growth stages, demonstrating RetinaNet's superior accuracy and speed for real-time agricultural weed management.
Contribution
It introduces a large, labeled dataset and applies state-of-the-art detection models to improve weed identification across growth stages, highlighting RetinaNet's effectiveness for real-time applications.
Findings
RetinaNet achieved a mean Average Precision of 0.904 on test data.
RetinaNet outperformed DETR in inference speed at 7.28 FPS.
Both models showed increased accuracy as weeds matured.
Abstract
Weed management remains a critical challenge in agriculture, where weeds compete with crops for essential resources, leading to significant yield losses. Accurate detection of weeds at various growth stages is crucial for effective management yet challenging for farmers, as it requires identifying different species at multiple growth phases. This research addresses these challenges by utilizing advanced object detection models, specifically, the Detection Transformer (DETR) with a ResNet50 backbone and RetinaNet with a ResNeXt101 backbone, to identify and classify 16 weed species of economic concern across 174 classes, spanning their 11 weeks growth stages from seedling to maturity. A robust dataset comprising 203,567 images was developed, meticulously labeled by species and growth stage. The models were rigorously trained and evaluated, with RetinaNet demonstrating superior…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
