AntiDote: Attention-based Dynamic Optimization for Neural Network Runtime Efficiency
Fuxun Yu, Chenchen Liu, Di Wang, Yanzhi Wang, Xiang Chen

TL;DR
AntiDote introduces a dynamic CNN optimization framework leveraging attention mechanisms to adaptively prune features during training and testing, significantly reducing FLOPs while maintaining accuracy.
Contribution
This work presents a novel dynamic optimization framework for CNNs that considers input-dependent feature importance, outperforming static pruning methods.
Findings
Achieves 37.4% to 54.5% FLOPs reduction
Maintains high accuracy with aggressive feature pruning
Demonstrates effectiveness across various test networks
Abstract
Convolutional Neural Networks (CNNs) achieved great cognitive performance at the expense of considerable computation load. To relieve the computation load, many optimization works are developed to reduce the model redundancy by identifying and removing insignificant model components, such as weight sparsity and filter pruning. However, these works only evaluate model components' static significance with internal parameter information, ignoring their dynamic interaction with external inputs. With per-input feature activation, the model component significance can dynamically change, and thus the static methods can only achieve sub-optimal results. Therefore, we propose a dynamic CNN optimization framework in this work. Based on the neural network attention mechanism, we propose a comprehensive dynamic optimization framework including (1) testing-phase channel and column feature map…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Human Pose and Action Recognition
MethodsPruning
