LUMINA: LLM-Guided GPU Architecture Exploration via Bottleneck Analysis
Tao Zhang, Rui Ma, Shuotao Xu, Yongqiang Xiong, Peng Cheng

TL;DR
LUMINA is an LLM-guided framework that significantly improves GPU architecture exploration efficiency and quality by automating design space analysis and optimization, reducing exploration steps and outperforming traditional methods.
Contribution
The paper introduces LUMINA, a novel LLM-driven GPU design exploration framework that automates bottleneck analysis and enhances search efficiency in large design spaces.
Findings
LUMINA finds 6 better GPU designs in 20 steps within a 4.7 million sample space.
LUMINA achieves 17.5x higher exploration efficiency than traditional methods.
LUMINA produces 32.9% better Pareto hypervolume compared to ML baselines.
Abstract
GPU design space exploration (DSE) for modern AI workloads, such as Large-Language Model (LLM) inference, is challenging because of GPUs' vast, multi-modal design spaces, high simulation costs, and complex design optimization objectives (e.g. performance, power and area trade-offs). Existing automated DSE methods are often prohibitively expensive, either requiring an excessive number of exploration samples or depending on intricate, manually crafted analyses of interdependent critical paths guided by human heuristics. We present LUMINA, an LLM-driven GPU architecture exploration framework that leverage AI to enhance the DSE efficiency and efficacy for GPUs. LUMINA extracts architectural knowledge from simulator code and performs sensitivity studies to automatically compose DSE rules,which are auto-corrected during exploration. A core component of LUMINA is a DSE Benchmark that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Embedded Systems Design Techniques · Advanced Neural Network Applications
