einspace: Searching for Neural Architectures from Fundamental Operations
Linus Ericsson, Miguel Espinosa, Chenhongyi Yang, Antreas Antoniou,, Amos Storkey, Shay B. Cohen, Steven McDonagh, Elliot J. Crowley

TL;DR
This paper introduces einspace, a versatile and expressive search space for neural architecture search based on a probabilistic grammar, enabling the discovery of diverse and potentially transformative neural architectures.
Contribution
We propose einspace, a novel search space built from a probabilistic grammar that supports diverse, complex architectures including convolutions and attention, advancing NAS capabilities.
Findings
Found competitive architectures from scratch using einspace.
Achieved significant improvements when initializing with strong baselines.
Demonstrated versatility across diverse NAS datasets.
Abstract
Neural architecture search (NAS) finds high performing networks for a given task. Yet the results of NAS are fairly prosaic; they did not e.g. create a shift from convolutional structures to transformers. This is not least because the search spaces in NAS often aren't diverse enough to include such transformations a priori. Instead, for NAS to provide greater potential for fundamental design shifts, we need a novel expressive search space design which is built from more fundamental operations. To this end, we introduce einspace, a search space based on a parameterised probabilistic context-free grammar. Our space is versatile, supporting architectures of various sizes and complexities, while also containing diverse network operations which allow it to model convolutions, attention components and more. It contains many existing competitive architectures, and provides flexibility for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsNeural Networks and Applications · Machine Learning and Algorithms
