Conditional super-network weights
Kevin Alexander Laube, Andreas Zell

TL;DR
This paper introduces conditional weights into super-networks for neural architecture search, improving architecture selection accuracy with minimal additional resource overhead, especially in sequential network designs.
Contribution
It extends super-networks with conditional weights based on operation combinations, enhancing their ability to model dependencies and improve NAS performance.
Findings
Improved architecture selection in NAS-Bench 201 and NAS-Bench-Macro.
Nearly negligible resource overhead for sequential networks.
Enhanced modeling of operation dependencies in super-networks.
Abstract
Modern Neural Architecture Search methods have repeatedly broken state-of-the-art results for several disciplines. The super-network, a central component of many such methods, enables quick estimates of accuracy or loss statistics for any architecture in the search space. They incorporate the network weights of all candidate architectures and can thus approximate specific ones by applying the respective operations. However, this design ignores potential dependencies between consecutive operations. We extend super-networks with conditional weights that depend on combinations of choices and analyze their effect. Experiments in NAS-Bench 201 and NAS-Bench-Macro-based search spaces show improvements in the architecture selection and that the resource overhead is nearly negligible for sequential network designs.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Advanced Neural Network Applications · Machine Learning and ELM
