Foveated image processing for faster object detection and recognition in   embedded systems using deep convolutional neural networks

Uziel Jaramillo-Avila; Sean R. Anderson

arXiv:1908.09000·cs.CV·August 27, 2019

Foveated image processing for faster object detection and recognition in embedded systems using deep convolutional neural networks

Uziel Jaramillo-Avila, Sean R. Anderson

PDF

TL;DR

This paper introduces foveated image sampling to accelerate CNN-based object detection on embedded systems, achieving up to 4x faster processing with minimal recall loss in the foveal region.

Contribution

It proposes using foveated sampling to reduce image size and computational load for CNNs on embedded devices, improving speed with limited accuracy trade-offs.

Findings

01

Achieved 4x increase in frame rate on embedded GPU

02

Foveated sampling maintained 92% of baseline recall

03

Significant speed-up with minimal recall loss in the foveal region

Abstract

Object detection and recognition algorithms using deep convolutional neural networks (CNNs) tend to be computationally intensive to implement. This presents a particular challenge for embedded systems, such as mobile robots, where the computational resources tend to be far less than for workstations. As an alternative to standard, uniformly sampled images, we propose the use of foveated image sampling here to reduce the size of images, which are faster to process in a CNN due to the reduced number of convolution operations. We evaluate object detection and recognition on the Microsoft COCO database, using foveated image sampling at different image sizes, ranging from 416x416 to 96x96 pixels, on an embedded GPU -- an NVIDIA Jetson TX2 with 256 CUDA cores. The results show that it is possible to achieve a 4x speed-up in frame rates, from 3.59 FPS to 15.24 FPS, using 416x416 and 128x128…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsConvolution