TL;DR
UIBenchKit is an open-source toolkit that standardizes the evaluation of design-to-code models, facilitating fair comparison and accelerating research in web automation.
Contribution
It introduces a unified, plug-and-play platform for benchmarking design-to-code methods with consistent metrics and environment setup.
Findings
Benchmarking reveals strengths and weaknesses of current methods.
The toolkit enables systematic comparison across multiple metrics.
Provides insights for future research directions.
Abstract
Recent years have seen substantial progress in automated design-to-code generation, with many methods proposed for generating HTML and CSS from webpage screenshots. However, the absence of a standardized evaluation platform makes it difficult to compare these methods fairly, limiting both practical adoption and systematic research progress. To bridge this gap, we introduce UIBenchKit, an open-source, integrated toolkit designed to unify the evaluation of design-to-code tasks. UIBenchKit abstracts the complexities of environment setup, model inference, and code rendering, offering researchers a plug-and-play architecture to compare various methods under consistent settings. In addition, it offers an analytical interface for comparison across multiple metrics. Using UIBenchKit, we conduct a benchmarking study of existing tools and derive several findings that highlight directions for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
