LLMulator: Generalizable Cost Modeling for Dataflow Accelerators with Input-Adaptive Control Flow

Kaiyan Chang; Wenlong Zhu; Shengwen Liang; Huawei Li; Ying Wang

arXiv:2508.17826·cs.AR·August 26, 2025

LLMulator: Generalizable Cost Modeling for Dataflow Accelerators with Input-Adaptive Control Flow

Kaiyan Chang, Wenlong Zhu, Shengwen Liang, Huawei Li, Ying Wang

PDF

TL;DR

LLMulator is a novel framework that uses large language models and reinforcement learning to accurately predict performance of dataflow accelerators across diverse architectures and input-dependent control flows.

Contribution

It introduces a hardware- and application-aware numeric modeling approach with input-adaptive calibration and data augmentation for improved generalization.

Findings

01

Reduces cycle prediction error by 9.7% with dynamic calibration.

02

Converges to 11.2% error after few iterations.

03

Significantly improves prediction accuracy across architectures.

Abstract

Accurate and fast performance prediction for dataflow-based accelerators is vital for efficient hardware design and design space exploration, yet existing methods struggle to generalize across architectures, applications, and input-dependent control flows. We present LLMulator, a progressive numeric modeling framework leveraging the program semantic knowledge of pre-trained large language models (LLMs) for robust, hardware- and application-aware prediction. Our numeric model treats performance values as categorical token sequences, enabling range-agnostic estimates and confidence-aware predictions for unseen applications. To handle input-dependent control flows, we introduce a reinforcement learning-based dynamic calibration method, reducing cycle prediction error by 9.7% over static models and converging to 11.2% error after a few iterations. For cross-hardware generalization, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.