ActionBert: Leveraging User Actions for Semantic Understanding of User Interfaces
Zecheng He, Srinivas Sunkara, Xiaoxue Zang, Ying Xu, Lijuan Liu, Nevan, Wichers, Gabriel Schubiner, Ruby Lee, Jindong Chen, Blaise Ag\"uera y, Arcas

TL;DR
ActionBert is a pre-trained model that leverages user actions, visual cues, and domain-specific features to improve semantic understanding of user interfaces, enhancing tasks like icon classification and component retrieval.
Contribution
It introduces ActionBert, a novel pre-training approach that utilizes user interaction traces and multi-modal features for comprehensive UI understanding.
Findings
Outperforms baseline models by up to 15.5% across tasks
Effectively captures UI semantics from user actions and domain features
Improves generalization in diverse UI understanding tasks
Abstract
As mobile devices are becoming ubiquitous, regularly interacting with a variety of user interfaces (UIs) is a common aspect of daily life for many people. To improve the accessibility of these devices and to enable their usage in a variety of settings, building models that can assist users and accomplish tasks through the UI is vitally important. However, there are several challenges to achieve this. First, UI components of similar appearance can have different functionalities, making understanding their function more important than just analyzing their appearance. Second, domain-specific features like Document Object Model (DOM) in web pages and View Hierarchy (VH) in mobile applications provide important signals about the semantics of UI elements, but these features are not in a natural language format. Third, owing to a large diversity in UIs and absence of standard DOM or VH…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsWeb Data Mining and Analysis · Recommender Systems and Techniques · Digital Accessibility for Disabilities
