Glance and Focus: a Dynamic Approach to Reducing Spatial Redundancy in   Image Classification

Yulin Wang; Kangchen Lv; Rui Huang; Shiji Song; Le Yang; Gao Huang

arXiv:2010.05300·cs.CV·October 13, 2020·28 cites

Glance and Focus: a Dynamic Approach to Reducing Spatial Redundancy in Image Classification

Yulin Wang, Kangchen Lv, Rui Huang, Shiji Song, Le Yang, Gao Huang

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a dynamic, reinforcement learning-based framework that selectively processes image regions to reduce computational costs in CNN-based image classification, maintaining accuracy while improving efficiency.

Contribution

It presents a flexible, general approach that adapts inference by selecting relevant image regions, compatible with various lightweight CNN architectures.

Findings

01

Reduces MobileNet-V3 latency by 20% on iPhone XS Max

02

Improves computational efficiency across multiple CNN models

03

Maintains accuracy while decreasing redundant computation

Abstract

The accuracy of deep convolutional neural networks (CNNs) generally improves when fueled with high resolution images. However, this often comes at a high computational cost and high memory footprint. Inspired by the fact that not all regions in an image are task-relevant, we propose a novel framework that performs efficient image classification by processing a sequence of relatively small inputs, which are strategically selected from the original image with reinforcement learning. Such a dynamic decision process naturally facilitates adaptive inference at test time, i.e., it can be terminated once the model is sufficiently confident about its prediction and thus avoids further redundant computation. Notably, our framework is general and flexible as it is compatible with most of the state-of-the-art light-weighted CNNs (such as MobileNets, EfficientNets and RegNets), which can be…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

blackfeather-wang/GFNet-Pytorch
pytorchOfficial

Videos

Glance and Focus: a Dynamic Approach to Reducing Spatial Redundancy in Image Classification· slideslive

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications