Synapse: Trajectory-as-Exemplar Prompting with Memory for Computer Control
Longtao Zheng, Rundong Wang, Xinrun Wang, Bo An

TL;DR
Synapse introduces a novel approach for computer control agents using trajectory-based exemplars, memory, and state abstraction to enhance generalization and performance in complex, long-horizon tasks with limited context.
Contribution
It proposes a new prompting method with trajectory exemplars, state abstraction, and memory retrieval, significantly improving generalization and success rates in computer control tasks.
Findings
Achieves 99.2% success rate on MiniWoB++
First ICL method to solve the book-flight task in MiniWoB++
56% improvement over previous state-of-the-art in Mind2Web
Abstract
Building agents with large language models (LLMs) for computer control is a burgeoning research area, where the agent receives computer states and performs actions to complete complex tasks. Previous computer agents have demonstrated the benefits of in-context learning (ICL); however, their performance is hindered by several issues. First, the limited context length of LLMs and complex computer states restrict the number of exemplars, as a single webpage can consume the entire context. Second, the exemplars in current methods, such as high-level plans and multi-choice questions, cannot represent complete trajectories, leading to suboptimal performance in long-horizon tasks. Third, existing computer agents rely on task-specific exemplars and overlook the similarity among tasks, resulting in poor generalization to novel tasks. To address these challenges, we introduce Synapse, a computer…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Data Stream Mining Techniques · Data Quality and Management
MethodsFocus
