G-ICSO-NAS: Shifting Gears between Gradient and Swarm for Robust Neural Architecture Search
Xingbang Du, Enzhi Zhang, Rui Zhong, Yang Cao, Masaharu Munetomo

TL;DR
G-ICSO-NAS introduces a hybrid neural architecture search framework combining swarm optimization and gradient methods, achieving high accuracy with minimal computational cost.
Contribution
It proposes a three-stage optimization strategy that effectively combines global swarm-based search with efficient gradient-based fine-tuning.
Findings
Achieves 97.46% accuracy on CIFAR-10 with 0.15 GPU-Days.
Records 83.1% on CIFAR-100 and 75.02% on ImageNet.
Outperforms state-of-the-art on NAS-Bench-201 benchmark.
Abstract
Neural Architecture Search (NAS) has become a pivotal technique in automated machine learning. Evolutionary Algorithm (EA)-based methods demonstrate superior search quality but suffer from prohibitive computational costs, while gradient-based approaches like DARTS offer high efficiency but are prone to premature convergence and performance collapse. To bridge this gap, we propose G-ICSO-NAS, a hybrid framework implementing a three-stage optimization strategy. The Warm-up Phase pre-trains supernet weights () via differentiable methods while architecture parameters () remain frozen. The Exploration Phase adopts a hybrid co-optimization mechanism: an Improved Competitive Swarm Optimizer (ICSO) with diversity-aware fitness navigates the architecture space to update , while gradient descent concurrently updates . The Stability Phase employs fine-grained gradient-based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
