BossNAS: Exploring Hybrid CNN-transformers with Block-wisely Self-supervised Neural Architecture Search
Changlin Li, Tao Tang, Guangrun Wang, Jiefeng Peng, Bing Wang, Xiaodan, Liang, Xiaojun Chang

TL;DR
BossNAS introduces a self-supervised neural architecture search method that effectively explores hybrid CNN-transformer architectures by block-wise training, achieving state-of-the-art accuracy on ImageNet with improved architecture rating correlations.
Contribution
The paper proposes a novel unsupervised NAS approach with ensemble bootstrapping for block-wise training, enabling efficient search in hybrid CNN-transformer spaces.
Findings
Achieves 82.5% ImageNet accuracy with BossNet-T.
Surpasses EfficientNet in accuracy with similar compute.
Outperforms state-of-the-art NAS methods in architecture rating correlation.
Abstract
A myriad of recent breakthroughs in hand-crafted neural architectures for visual recognition have highlighted the urgent need to explore hybrid architectures consisting of diversified building blocks. Meanwhile, neural architecture search methods are surging with an expectation to reduce human efforts. However, whether NAS methods can efficiently and effectively handle diversified search spaces with disparate candidates (e.g. CNNs and transformers) is still an open question. In this work, we present Block-wisely Self-supervised Neural Architecture Search (BossNAS), an unsupervised NAS method that addresses the problem of inaccurate architecture rating caused by large weight-sharing space and biased supervision in previous methods. More specifically, we factorize the search space into blocks and utilize a novel self-supervised training scheme, named ensemble bootstrapping, to train each…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning
MethodsDepthwise Convolution · *Communicated@Fast*How Do I Communicate to Expedia? · Pointwise Convolution · Depthwise Separable Convolution · Batch Normalization · Squeeze-and-Excitation Block · Inverted Residual Block · Dense Connections · Dropout · Convolution
