From User Interface to Agent Interface: Efficiency Optimization of UI Representations for LLM Agents
Dezhi Ran, Zhi Gong, Yuzhe Guo, Mengzhou Wu, Yuan Cao, Haochuan Lu, Hengyu Zhang, Xia Zeng, Gang Cao, Liangchao Yao, Yuetang Deng, Wei Yang, Tao Xie

TL;DR
UIFormer is a novel framework that optimizes UI representations for LLM agents, significantly reducing token usage and improving efficiency without sacrificing performance, applicable across Android and Web platforms.
Contribution
UIFormer introduces a constraint-based, structured decomposition approach for automated UI transformation program synthesis, addressing key challenges in efficiency and semantic correctness.
Findings
Achieves 48.7% to 55.8% token reduction in UI representations.
Maintains or improves agent performance across benchmarks.
Validated in real-world deployment at WeChat.
Abstract
While Large Language Model (LLM) agents show great potential for automated UI navigation such as automated UI testing and AI assistants, their efficiency has been largely overlooked. Our motivating study reveals that inefficient UI representation creates a critical performance bottleneck. However, UI representation optimization, formulated as the task of automatically generating programs that transform UI representations, faces two unique challenges. First, the lack of Boolean oracles, which traditional program synthesis uses to decisively validate semantic correctness, poses a fundamental challenge to co-optimization of token efficiency and completeness. Second, the need to process large, complex UI trees as input while generating long, compositional transformation programs, making the search space vast and error-prone. Toward addressing the preceding limitations, we present UIFormer,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Software Testing and Debugging Techniques · Artificial Intelligence in Games
