HLS-Seek: QoR-Aware Code Generation for High-Level Synthesis via Proxy Comparative Reward Reinforcement Learning

Qingyun Zou; Feng Yu; Hongshi Tan; Yao Chen; Bingsheng He; WengFai Wong

arXiv:2605.13536·cs.LG·May 14, 2026

HLS-Seek: QoR-Aware Code Generation for High-Level Synthesis via Proxy Comparative Reward Reinforcement Learning

Qingyun Zou, Feng Yu, Hongshi Tan, Yao Chen, Bingsheng He, WengFai Wong

PDF

TL;DR

HLS-Seek is a reinforcement learning framework for high-level synthesis that uses a proxy reward model and uncertainty-aware techniques to optimize hardware design quality efficiently.

Contribution

It introduces a QoR-aware NL-to-HLS framework with a proxy reward system and uncertainty-aware Monte Carlo dropout switching, enabling faster and more effective optimization.

Findings

01

Achieves 81.5% syntax correctness pass@1 on HLS-eval.

02

Surpasses GPT-5.1 and other models in QoR metrics.

03

Attains the lowest latency on 16 out of 30 kernels.

Abstract

High-Level Synthesis (HLS) compiles algorithmic C/C++ descriptions into hardware, with Quality of Results (QoR) -- latency and resource utilization -- critically governed by pragma configurations and code structure. Existing LLM-based HLS approaches train for functional correctness but ignore QoR entirely. We observe that reinforcement learning (RL) for HLS does not require absolute synthesis results -- only relative comparisons between candidates. Based on this insight, we propose \textbf{HLS-Seek}, a QoR-aware NL-to-HLS framework that replaces expensive synthesis-in-the-loop RL with a comparative proxy reward model achieving 99.53\% Pareto-dominance accuracy. To prevent reward hacking, we introduce \textit{uncertainty-aware Monte Carlo (MC) dropout switching} that selectively invokes real Vitis HLS synthesis for low-confidence candidates and online updates the proxy, creating a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.