SmartFlow: Robotic Process Automation using LLMs
Arushi Jain, Shubham Paliwal, Monika Sharma, Lovekesh Vig, Gautam, Shroff

TL;DR
SmartFlow is an AI-based RPA system that leverages large language models and computer vision to understand and automate complex GUI-based tasks across diverse applications, improving adaptability and reducing human intervention.
Contribution
We introduce SmartFlow, a novel RPA system combining LLMs and deep learning for visual understanding, enabling adaptable automation without manual configuration.
Findings
SmartFlow demonstrates robustness across diverse application layouts.
The system effectively automates complex, screen-based business processes.
Our dataset supports research in visual understanding for RPA.
Abstract
Robotic Process Automation (RPA) systems face challenges in handling complex processes and diverse screen layouts that require advanced human-like decision-making capabilities. These systems typically rely on pixel-level encoding through drag-and-drop or automation frameworks such as Selenium to create navigation workflows, rather than visual understanding of screen elements. In this context, we present SmartFlow, an AI-based RPA system that uses pre-trained large language models (LLMs) coupled with deep-learning based image understanding. Our system can adapt to new scenarios, including changes in the user interface and variations in input data, without the need for human intervention. SmartFlow uses computer vision and natural language processing to perceive visible elements on the graphical user interface (GUI) and convert them into a textual representation. This information is then…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotic Process Automation Applications · FinTech, Crowdfunding, Digital Finance
Methodstravel james · Sparse Evolutionary Training
