Towards Code-Oriented LM Embeddings for Surrogate-Assisted Neural Architecture Search

Pranav Somu; Advay Balakrishnan; Stepan Kravtsov; Aaron McDaniel; Jason Zutty

arXiv:2605.15649·cs.LG·May 18, 2026

Towards Code-Oriented LM Embeddings for Surrogate-Assisted Neural Architecture Search

Pranav Somu, Advay Balakrishnan, Stepan Kravtsov, Aaron McDaniel, Jason Zutty

PDF

TL;DR

This paper introduces COLE, a low-cost, code-based embedding method leveraging language models to improve neural architecture search efficiency without extensive fine-tuning.

Contribution

It proposes a novel code-oriented embedding strategy using off-the-shelf language models for surrogate modeling in NAS, reducing overhead and enhancing search performance.

Findings

01

Raw code inputs outperform other text encodings in predictive accuracy.

02

COLE improves surrogate-assisted search efficiency in NAS-Bench-201.

03

Replacing structural encodings with COLE reduces evaluation budget by 34%.

Abstract

Developing effective surrogates (performance predictors) for Neural Architecture Search (NAS) typically requires expensive fine-tuning or the engineering of complex representations. We propose a low-cost embedding strategy that leverages the inductive bias of Language Models (LMs) to eliminate these overheads. By representing architectures as PyTorch class definition text, we demonstrate that off-the-shelf LMs act as competitive feature extractors without NAS-specialized fine-tuning. The final predictor is constructed by passing the extracted Code-Oriented LM Embeddings (COLE) through a lightweight regression head. We also investigate strategies to improve embedding quality and utilization. Our experiments on the NAS-Bench-201 and einspace search spaces reveal that raw code inputs yield higher predictive performance than other text-based encodings (e.g., ONNX-to-text encodings) when…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.