Vision-Guided Iterative Refinement for Frontend Code Generation

Hannah Sansford; Derek H. C. Law; Wei Liu; Abhishek Tripathi; Niresh Agarwal; Gerrit J. J. van den Burg

arXiv:2604.05839·cs.AI·April 8, 2026

Vision-Guided Iterative Refinement for Frontend Code Generation

Hannah Sansford, Derek H. C. Law, Wei Liu, Abhishek Tripathi, Niresh Agarwal, Gerrit J. J. van den Burg

PDF

TL;DR

This paper introduces an automated vision-language critic that guides iterative refinement of frontend web code, significantly improving quality over single-pass generation and exploring parameter-efficient fine-tuning to internalize critique benefits.

Contribution

It presents a fully automated critic-in-the-loop framework using vision-language models for visual feedback, and investigates fine-tuning methods to replicate critic improvements.

Findings

01

Up to 17.8% performance increase with three refinement cycles.

02

Fine-tuning with LoRA achieves 25% of critic-in-the-loop gains.

03

Automated visual critique outperforms single-pass code generation.

Abstract

Code generation with large language models often relies on multi-stage human-in-the-loop refinement, which is effective but very costly - particularly in domains such as frontend web development where the solution quality depends on rendered visual output. We present a fully automated critic-in-the-loop framework in which a vision-language model serves as a visual critic that provides structured feedback on rendered webpages to guide iterative refinement of generated code. Across real-world user requests from the WebDev Arena dataset, this approach yields consistent improvements in solution quality, achieving up to 17.8% increase in performance over three refinement cycles. Next, we investigate parameter-efficient fine-tuning using LoRA to understand whether the improvements provided by the critic can be internalized by the code-generating LLM. Fine-tuning achieves 25% of the gains from…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.