Explicitly incorporating spatial information to recurrent networks for agriculture
Claus Smitt, Michael Halstead, Alireza Ahmadi, and Chris McCool

TL;DR
This paper introduces a novel method that explicitly incorporates spatial and temporal information into recurrent neural networks to enhance agricultural image classification, leveraging RGB-D data and odometry for improved accuracy and robustness.
Contribution
It presents a new approach combining spatial-temporal data fusion with recurrent models, significantly improving agricultural image segmentation performance.
Findings
Achieved 4.7% IoU improvement in crop-weed segmentation
Achieved 2.6% IoU improvement in fruit segmentation
Demonstrated robustness to variable framerates and odometry errors
Abstract
In agriculture, the majority of vision systems perform still image classification. Yet, recent work has highlighted the potential of spatial and temporal cues as a rich source of information to improve the classification performance. In this paper, we propose novel approaches to explicitly capture both spatial and temporal information to improve the classification of deep convolutional neural networks. We leverage available RGB-D images and robot odometry to perform inter-frame feature map spatial registration. This information is then fused within recurrent deep learnt models, to improve their accuracy and robustness. We demonstrate that this can considerably improve the classification performance with our best performing spatial-temporal model (ST-Atte) achieving absolute performance improvements for intersection-over-union (IoU[%]) of 4.7 for crop-weed segmentation and 2.6 for fruit…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSmart Agriculture and AI · Advanced Image and Video Retrieval Techniques · Remote Sensing and LiDAR Applications
MethodsSpatial & Temporal Attention · Diffusion-Convolutional Neural Networks · Gated Recurrent Unit
