Optimal channel selection with discrete QCQP
Yeonwoo Jeong, Deokjae Lee, Gaon An, Changyong Son, Hyun Oh Song

TL;DR
This paper introduces a novel channel pruning method using discrete QCQP that optimally selects channels in neural networks, ensuring resource constraints are met and inactive weights are avoided, leading to improved pruning performance.
Contribution
The paper proposes a discrete QCQP-based channel selection method that guarantees resource constraint satisfaction and prevents inactive weights, extending to non-sequential connections and inference time modeling.
Findings
Outperforms existing channel pruning methods on CIFAR-10 and ImageNet.
Guarantees resource constraints are tightly met in FLOPs, memory, and network size.
Accurately models inference time for better resource management.
Abstract
Reducing the high computational cost of large convolutional neural networks is crucial when deploying the networks to resource-constrained environments. We first show the greedy approach of recent channel pruning methods ignores the inherent quadratic coupling between channels in the neighboring layers and cannot safely remove inactive weights during the pruning procedure. Furthermore, due to these inactive weights, the greedy methods cannot guarantee to satisfy the given resource constraints and deviate with the true objective. In this regard, we propose a novel channel selection method that optimally selects channels via discrete QCQP, which provably prevents any inactive weights and guarantees to meet the resource constraints tightly in terms of FLOPs, memory usage, and network size. We also propose a quadratic model that accurately estimates the actual inference time of the pruned…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Sparse and Compressive Sensing Techniques
MethodsPruning
