GUing: A Mobile GUI Search Engine using a Vision-Language Model
Jialiang Wei, Anne-Lise Courbis, Thomas Lambolais, Binbin Xu, Pierre, Louis Bernard, G\'erard Dray, Walid Maalej

TL;DR
This paper introduces GUing, a vision-language model-based GUI search engine trained on a large dataset of app screenshots and captions, enabling more accurate text-to-GUI retrieval and related tasks.
Contribution
The paper presents GUing, the first vision-language model specifically trained for GUI retrieval, utilizing a large dataset of labeled app screenshots for improved accuracy.
Findings
Outperforms previous methods in text-to-GUI retrieval with up to 0.69 Recall@10
Achieves 0.91 HIT@10 in GUI retrieval tasks
Demonstrates effectiveness in GUI classification and sketch-to-GUI retrieval
Abstract
Graphical User Interfaces (GUIs) are central to app development projects. App developers may use the GUIs of other apps as a means of requirements refinement and rapid prototyping or as a source of inspiration for designing and improving their own apps. Recent research has thus suggested retrieving relevant GUI designs that match a certain text query from screenshot datasets acquired through crowdsourced or automated exploration of GUIs. However, such text-to-GUI retrieval approaches only leverage the textual information of the GUI elements, neglecting visual information such as icons or background images. In addition, retrieved screenshots are not steered by app developers and lack app features that require particular input data. To overcome these limitations, this paper proposes GUing, a GUI search engine based on a vision-language model called GUIClip, which we trained specifically…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques
