Automatically Generating UI Code from Screenshot: A   Divide-and-Conquer-Based Approach

Yuxuan Wan; Chaozheng Wang; Yi Dong; Wenxuan Wang; Shuqing Li; Yintong; Huo; Michael R. Lyu

arXiv:2406.16386·cs.SE·April 28, 2025

Automatically Generating UI Code from Screenshot: A Divide-and-Conquer-Based Approach

Yuxuan Wan, Chaozheng Wang, Yi Dong, Wenxuan Wang, Shuqing Li, Yintong, Huo, Michael R. Lyu

PDF

TL;DR

This paper introduces DCGen, a divide-and-conquer approach that segments website screenshots to improve automatic UI code generation, significantly enhancing accuracy and speed over existing methods.

Contribution

DCGen is the first segment-aware MLLM-based method for translating screenshots into UI code, addressing issues like element omission and distortion.

Findings

01

Up to 15% improvement in visual similarity

02

Up to 8% improvement in code similarity

03

Faster implementation of webpages by developers

Abstract

Websites are critical in today's digital world, with over 1.11 billion currently active and approximately 252,000 new sites launched daily. Converting website layout design into functional UI code is a time-consuming yet indispensable step of website development. Manual methods of converting visual designs into functional code present significant challenges, especially for non-experts. To explore automatic design-to-code solutions, we first conduct a motivating study on GPT-4o and identify three types of issues in generating UI code: element omission, element distortion, and element misarrangement. We further reveal that a focus on smaller visual segments can help multimodal large language models (MLLMs) mitigate these failures in the generation process. In this paper, we propose DCGen, a divide-and-conquer-based approach to automate the translation of webpage design to UI code. DCGen…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsFocus