Vision-Guided Iterative Refinement for Frontend Code Generation
Hannah Sansford, Derek H. C. Law, Wei Liu, Abhishek Tripathi, Niresh Agarwal, Gerrit J. J. van den Burg

TL;DR
This paper introduces an automated vision-language critic that guides iterative refinement of frontend web code, significantly improving quality over single-pass generation and exploring parameter-efficient fine-tuning to internalize critique benefits.
Contribution
It presents a fully automated critic-in-the-loop framework using vision-language models for visual feedback, and investigates fine-tuning methods to replicate critic improvements.
Findings
Up to 17.8% performance increase with three refinement cycles.
Fine-tuning with LoRA achieves 25% of critic-in-the-loop gains.
Automated visual critique outperforms single-pass code generation.
Abstract
Code generation with large language models often relies on multi-stage human-in-the-loop refinement, which is effective but very costly - particularly in domains such as frontend web development where the solution quality depends on rendered visual output. We present a fully automated critic-in-the-loop framework in which a vision-language model serves as a visual critic that provides structured feedback on rendered webpages to guide iterative refinement of generated code. Across real-world user requests from the WebDev Arena dataset, this approach yields consistent improvements in solution quality, achieving up to 17.8% increase in performance over three refinement cycles. Next, we investigate parameter-efficient fine-tuning using LoRA to understand whether the improvements provided by the critic can be internalized by the code-generating LLM. Fine-tuning achieves 25% of the gains from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
