An Artificial Intelligence System for Combined Fruit Detection and Georeferencing, Using RTK-Based Perspective Projection in Drone Imagery
Angus Baird, Stefano Giani

TL;DR
This paper introduces an AI system using Faster R-CNN for detecting and georeferencing apples in drone imagery, employing RTK data and a novel hyperparameter tuning approach to improve accuracy in orchard monitoring.
Contribution
It presents a novel preprocessing stage for individual tree cropping and a k-means based anchor box design for Faster R-CNN, enhancing apple detection accuracy in aerial imagery.
Findings
Achieved a calibrated mAP of 0.7627 for apple detection.
Optimal hyperparameters include 225 box proposals, learning rate 0.001, and ResNet 101 backbone.
K-means clustering significantly improved detection performance.
Abstract
This work presents an Artificial Intelligence (AI) system, based on the Faster Region-Based Convolution Neural Network (Faster R-CNN) framework, which detects and counts apples from oblique, aerial drone imagery of giant commercial orchards. To reduce computational cost, a novel precursory stage to the network is designed to preprocess raw imagery into cropped images of individual trees. Unique geospatial identifiers are allocated to these using the perspective projection model. This employs Real-Time Kinematic (RTK) data, Digital Terrain and Surface Models (DTM and DSM), as well as internal and external camera parameters. The bulk of experiments however focus on tuning hyperparameters in the detection network itself. Apples which are on trees and apples which are on the ground are treated as separate classes. A mean Average Precision (mAP) metric, calibrated by the size of the two…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSmart Agriculture and AI · Remote Sensing and LiDAR Applications · Remote Sensing in Agriculture
MethodsAverage Pooling · Region Proposal Network · 1x1 Convolution · Residual Connection · Convolution · Max Pooling · Softmax · Bottleneck Residual Block · Kaiming Initialization · *Communicated@Fast*How Do I Communicate to Expedia?
