Locally Free Weight Sharing for Network Width Search
Xiu Su, Shan You, Tao Huang, Fei Wang, Chen Qian, Changshui Zhang,, Chang Xu

TL;DR
This paper introduces CafeNet, a novel locally free weight sharing strategy for neural network width search, which improves performance evaluation and search efficiency by allowing more flexible weight sharing and reducing the search space.
Contribution
The paper proposes a locally free weight sharing method (CafeNet) and FLOPs-sensitive bins to enhance network width search accuracy and efficiency, outperforming existing methods.
Findings
Improved performance on ImageNet, CIFAR-10, CelebA, and MS COCO datasets.
Boosted EfficientNet-B0 accuracy by 0.41%.
Demonstrated superiority over state-of-the-art baselines.
Abstract
Searching for network width is an effective way to slim deep neural networks with hardware budgets. With this aim, a one-shot supernet is usually leveraged as a performance evaluator to rank the performance \wrt~different width. Nevertheless, current methods mainly follow a manually fixed weight sharing pattern, which is limited to distinguish the performance gap of different width. In this paper, to better evaluate each width, we propose a locally free weight sharing strategy (CafeNet) accordingly. In CafeNet, weights are more freely shared, and each width is jointly indicated by its base channels and free channels, where free channels are supposed to loCAte FrEely in a local zone to better represent each width. Besides, we propose to further reduce the search space by leveraging our introduced FLOPs-sensitive bins. As a result, our CafeNet can be trained stochastically and get…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Human Pose and Action Recognition
