Psychologically-Inspired, Unsupervised Inference of Perceptual Groups of GUI Widgets from GUI Images
Mulong Xie, Zhenchang Xing, Sidong Feng, Chunyang Chen, Liming Zhu,, Xiwei Xu

TL;DR
This paper introduces an unsupervised, image-based method inspired by Gestalt principles to automatically segment GUI widgets into perceptual groups, enhancing visual understanding without relying on GUI implementation details or training data.
Contribution
The paper presents a novel unsupervised approach for perceptual grouping of GUI widgets from images, independent of GUI code and training data, inspired by psychological principles.
Findings
Outperforms existing heuristics-based methods on a large dataset
Requires only GUI images, no implementation details or training data
Improves automation in GUI design and analysis tasks
Abstract
Graphical User Interface (GUI) is not merely a collection of individual and unrelated widgets, but rather partitions discrete widgets into groups by various visual cues, thus forming higher-order perceptual units such as tab, menu, card or list. The ability to automatically segment a GUI into perceptual groups of widgets constitutes a fundamental component of visual intelligence to automate GUI design, implementation and automation tasks. Although humans can partition a GUI into meaningful perceptual groups of widgets in a highly reliable way, perceptual grouping is still an open challenge for computational approaches. Existing methods rely on ad-hoc heuristics or supervised machine learning that is dependent on specific GUI implementations and runtime information. Research in psychology and biological vision has formulated a set of principles (i.e., Gestalt theory of perception) that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Gaze Tracking and Assistive Technology
