AntiDote: Attention-based Dynamic Optimization for Neural Network   Runtime Efficiency

Fuxun Yu; Chenchen Liu; Di Wang; Yanzhi Wang; Xiang Chen

arXiv:2008.06543·cs.CV·August 18, 2020

AntiDote: Attention-based Dynamic Optimization for Neural Network Runtime Efficiency

Fuxun Yu, Chenchen Liu, Di Wang, Yanzhi Wang, Xiang Chen

PDF

Open Access

TL;DR

AntiDote introduces a dynamic CNN optimization framework leveraging attention mechanisms to adaptively prune features during training and testing, significantly reducing FLOPs while maintaining accuracy.

Contribution

This work presents a novel dynamic optimization framework for CNNs that considers input-dependent feature importance, outperforming static pruning methods.

Findings

01

Achieves 37.4% to 54.5% FLOPs reduction

02

Maintains high accuracy with aggressive feature pruning

03

Demonstrates effectiveness across various test networks

Abstract

Convolutional Neural Networks (CNNs) achieved great cognitive performance at the expense of considerable computation load. To relieve the computation load, many optimization works are developed to reduce the model redundancy by identifying and removing insignificant model components, such as weight sparsity and filter pruning. However, these works only evaluate model components' static significance with internal parameter information, ignoring their dynamic interaction with external inputs. With per-input feature activation, the model component significance can dynamically change, and thus the static methods can only achieve sub-optimal results. Therefore, we propose a dynamic CNN optimization framework in this work. Based on the neural network attention mechanism, we propose a comprehensive dynamic optimization framework including (1) testing-phase channel and column feature map…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Human Pose and Action Recognition

MethodsPruning