Spatial Transformer Network YOLO Model for Agricultural Object Detection
Yash Zambre, Ekdev Rajkitkul, Akshatha Mohan, Joshua Peeples

TL;DR
This paper introduces an enhanced YOLO model integrated with spatial transformer networks to improve agricultural object detection, especially in cluttered or occluded scenes, demonstrating significant performance gains on benchmark and real-world datasets.
Contribution
The paper presents a novel STN-YOLO model that incorporates spatial transformer networks into YOLO to improve focus on relevant image regions and spatial invariance for better detection accuracy.
Findings
Improved detection accuracy on agricultural datasets
Enhanced robustness to spatial transformations
Effective focus on important image regions
Abstract
Object detection plays a crucial role in the field of computer vision by autonomously locating and identifying objects of interest. The You Only Look Once (YOLO) model is an effective single-shot detector. However, YOLO faces challenges in cluttered or partially occluded scenes and can struggle with small, low-contrast objects. We propose a new method that integrates spatial transformer networks (STNs) into YOLO to improve performance. The proposed STN-YOLO aims to enhance the model's effectiveness by focusing on important areas of the image and improving the spatial invariance of the model before the detection process. Our proposed method improved object detection performance both qualitatively and quantitatively. We explore the impact of different localization networks within the STN module as well as the robustness of the model across different spatial transformations. We apply the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFood Supply Chain Traceability · Remote Sensing and Land Use · Smart Agriculture and AI
MethodsIs Expedia Customer Service available 24/7 hour? · Spatial Transformer
