Learning to Denoise Raw Mobile UI Layouts for Improving Datasets at Scale
Gang Li, Gilles Baechler, Manuel Tragut, Yang Li

TL;DR
This paper introduces the CLAY pipeline, a deep learning method for denoising and annotating mobile UI layouts, significantly improving dataset quality and reducing manual labeling efforts for UI research.
Contribution
The paper presents a novel deep learning pipeline that automatically cleans and annotates mobile UI layouts, creating the large CLAY dataset and outperforming heuristic methods.
Findings
High accuracy in detecting invalid layout nodes (82.7% F1)
Effective recognition of object types (85.9% F1)
Significant improvement over heuristic baselines
Abstract
The layout of a mobile screen is a critical data source for UI design research and semantic understanding of the screen. However, UI layouts in existing datasets are often noisy, have mismatches with their visual representation, or consists of generic or app-specific types that are difficult to analyze and model. In this paper, we propose the CLAY pipeline that uses a deep learning approach for denoising UI layouts, allowing us to automatically improve existing mobile UI layout datasets at scale. Our pipeline takes both the screenshot and the raw UI layout, and annotates the raw layout by removing incorrect nodes and assigning a semantically meaningful type to each node. To experiment with our data-cleaning pipeline, we create the CLAY dataset of 59,555 human-annotated screen layouts, based on screenshots and raw layouts from Rico, a public mobile UI corpus. Our deep models achieve high…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInteractive and Immersive Displays · Visual Attention and Saliency Detection · Innovative Human-Technology Interaction
