Joint Neural Architecture Search and Quantization
Yukang Chen, Gaofeng Meng, Qian Zhang, Xinbang Zhang, Liangchen Song,, Shiming Xiang, Chunhong Pan

TL;DR
This paper introduces JASQ, a unified framework that jointly searches for neural network architectures and quantization policies using evolutionary algorithms, resulting in more efficient models with higher accuracy for mobile devices.
Contribution
It presents the first integrated approach combining neural architecture search and quantization policy optimization into a single framework, improving model performance and compression.
Findings
Outperforms methods that search only for architecture or quantization.
Achieves higher accuracy than float models, e.g., 1.02% higher on MobileNet-v1.
Produces compact models like JASQNet with 2.97% error rate on CIFAR-10.
Abstract
Designing neural architectures is a fundamental step in deep learning applications. As a partner technique, model compression on neural networks has been widely investigated to gear the needs that the deep learning algorithms could be run with the limited computation resources on mobile devices. Currently, both the tasks of architecture design and model compression require expertise tricks and tedious trials. In this paper, we integrate these two tasks into one unified framework, which enables the joint architecture search with quantization (compression) policies for neural networks. This method is named JASQ. Here our goal is to automatically find a compact neural network model with high performance that is suitable for mobile devices. Technically, a multi-objective evolutionary search algorithm is introduced to search the models under the balance between model size and performance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Machine Learning and Data Classification · Advanced Image and Video Retrieval Techniques
