AutoRPA: Efficient GUI Automation through LLM-Driven Code Synthesis from Interactions

Minghao Chen; Xinyi Hu; Zhou Yu; Yufei Yin

arXiv:2605.21082·cs.AI·May 21, 2026

AutoRPA: Efficient GUI Automation through LLM-Driven Code Synthesis from Interactions

Minghao Chen, Xinyi Hu, Zhou Yu, Yufei Yin

PDF

TL;DR

AutoRPA automatically converts LLM-driven GUI interaction strategies into efficient, reusable RPA functions, significantly reducing token usage and improving runtime performance in repetitive tasks.

Contribution

AutoRPA introduces a novel framework that distills ReAct-style GUI agents into robust RPA functions using a translator-builder pipeline and hybrid repair strategies.

Findings

01

RPA functions reduce token usage by 82% to 96%.

02

AutoRPA improves runtime efficiency in GUI tasks.

03

Generated RPA functions successfully solve similar tasks.

Abstract

Large Language Model (LLM) based agents have demonstrated proficiency in multi-step interactions with graphical user interfaces (GUIs). While most research focuses on improving single-task performance, practical scenarios often involve repetitive GUI tasks for which invoking LLM reasoning repeatedly, i.e., the ReAct paradigm, is inefficient. Prior to LLMs, traditional Robotic Process Automation (RPA) offers runtime efficiency but demands significant manual effort to develop and maintain. To bridge this gap, we propose AutoRPA, a framework that automatically distills the decision logic of ReAct-style agents into robust RPA functions. AutoRPA introduces two core innovations: (1) A translator-builder pipeline, where a translator agent converts hard-coded ReAct actions into soft-coded procedures, and a builder agent synthesizes robust RPA functions via retrieval-augmented generation over…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.