Towards Code-Oriented LM Embeddings for Surrogate-Assisted Neural Architecture Search
Pranav Somu, Advay Balakrishnan, Stepan Kravtsov, Aaron McDaniel, Jason Zutty

TL;DR
This paper introduces COLE, a low-cost, code-based embedding method leveraging language models to improve neural architecture search efficiency without extensive fine-tuning.
Contribution
It proposes a novel code-oriented embedding strategy using off-the-shelf language models for surrogate modeling in NAS, reducing overhead and enhancing search performance.
Findings
Raw code inputs outperform other text encodings in predictive accuracy.
COLE improves surrogate-assisted search efficiency in NAS-Bench-201.
Replacing structural encodings with COLE reduces evaluation budget by 34%.
Abstract
Developing effective surrogates (performance predictors) for Neural Architecture Search (NAS) typically requires expensive fine-tuning or the engineering of complex representations. We propose a low-cost embedding strategy that leverages the inductive bias of Language Models (LMs) to eliminate these overheads. By representing architectures as PyTorch class definition text, we demonstrate that off-the-shelf LMs act as competitive feature extractors without NAS-specialized fine-tuning. The final predictor is constructed by passing the extracted Code-Oriented LM Embeddings (COLE) through a lightweight regression head. We also investigate strategies to improve embedding quality and utilization. Our experiments on the NAS-Bench-201 and einspace search spaces reveal that raw code inputs yield higher predictive performance than other text-based encodings (e.g., ONNX-to-text encodings) when…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
