Enhancing LLM-Based Neural Network Generation: Few-Shot Prompting and Efficient Validation for Automated Architecture Design
Raghuvir Duvvuri, Chandini Vysyaraju, Avi Goyal, Dmitry Ignatov, Radu Timofte

TL;DR
This paper advances LLM-based neural network architecture generation for computer vision by systematically studying prompt examples and introducing a fast deduplication validation method, enabling efficient and diverse architecture search.
Contribution
It introduces Few-Shot Architecture Prompting with optimal example count and a lightweight hash validation technique, improving diversity, speed, and evaluation rigor in LLM-driven architecture design.
Findings
Using 3 examples in prompting balances diversity and focus.
Whitespace-Normalized Hash Validation speeds up deduplication by 100x.
Generated 1,900 unique architectures across seven benchmarks.
Abstract
Automated neural network architecture design remains a significant challenge in computer vision. Task diversity and computational constraints require both effective architectures and efficient search methods. Large Language Models (LLMs) present a promising alternative to computationally intensive Neural Architecture Search (NAS), but their application to architecture generation in computer vision has not been systematically studied, particularly regarding prompt engineering and validation strategies. Building on the task-agnostic NNGPT/LEMUR framework, this work introduces and validates two key contributions for computer vision. First, we present Few-Shot Architecture Prompting (FSAP), the first systematic study of the number of supporting examples (n = 1, 2, 3, 4, 5, 6) for LLM-based architecture generation. We find that using n = 3 examples best balances architectural diversity and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
