Figma2Code: Automating Multimodal Design to Code in the Wild

Yi Gui; Jiawan Zhang; Yina Wang; Tianran Ma; Yao Wan; Shilin He; Dongping Chen; Zhou Zhao; Wenbin Jiang; Xuanhua Shi; Hai Jin; Philip S Yu

arXiv:2604.13648·cs.SE·April 16, 2026

Figma2Code: Automating Multimodal Design to Code in the Wild

Yi Gui, Jiawan Zhang, Yina Wang, Tianran Ma, Yao Wan, Shilin He, Dongping Chen, Zhou Zhao, Wenbin Jiang, Xuanhua Shi, Hai Jin, Philip S Yu

PDF

1 Video

TL;DR

Figma2Code introduces a multimodal approach to convert Figma design files into UI code, leveraging rich metadata to improve automation in real-world development workflows.

Contribution

The paper presents a new multimodal dataset and benchmarks for design-to-code tasks using Figma files, advancing beyond image-only approaches.

Findings

01

Proprietary models excel in visual fidelity but struggle with layout responsiveness.

02

Models tend to directly map primitive visual attributes from metadata.

03

A curated dataset of 213 high-quality Figma cases was created for benchmarking.

Abstract

Front-end development constitutes a substantial portion of software engineering, yet converting design mockups into production-ready User Interface (UI) code remains tedious and costly. While recent work has explored automating this process with Multimodal Large Language Models (MLLMs), existing approaches typically rely solely on design images. As a result, they must infer complex UI details from images alone, often leading to degraded results. In real-world development workflows, however, design mockups are usually delivered as Figma files, a widely used tool for front-end design, that embed rich multimodal information (e.g., metadata and assets) essential for generating high-quality UI. To bridge this gap, we introduce Figma2Code, a new task that advances design-to-code into a multimodal setting and aims to automate design-to-code in the wild. Specifically, we collect paired design…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Figma2Code: Automating Multimodal Design to Code in the Wild· slideslive