TL;DR
This paper presents a selective tile processing method with attention and memory mechanisms to improve small object detection in UAV imagery, balancing accuracy and computational efficiency.
Contribution
It introduces a novel tile-based detection approach with attention and memory mechanisms to enhance small object detection in UAV images.
Findings
Selective tile processing improves detection accuracy for small objects.
The method maintains real-time performance suitable for UAVs.
Enhanced detection results compared to standard single-image CNNs.
Abstract
Many applications utilizing Unmanned Aerial Vehicles (UAVs) require the use of computer vision algorithms to analyze the information captured from their on-board camera. Recent advances in deep learning have made it possible to use single-shot Convolutional Neural Network (CNN) detection algorithms that process the input image to detect various objects of interest. To keep the computational demands low these neural networks typically operate on small image sizes which, however, makes it difficult to detect small objects. This is further emphasized when considering UAVs equipped with cameras where due to the viewing range, objects tend to appear relatively small. This paper therefore, explores the trade-offs involved when maintaining the resolution of the objects of interest by extracting smaller patches (tiles) from the larger input image and processing them using a neural network.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
