Devil's Advocate: Anticipatory Reflection for LLM Agents
Haoyu Wang, Tao Li, Zhiwei Deng, Dan Roth, Yang Li

TL;DR
This paper presents a novel introspective approach for LLM agents that improves their task-solving consistency and adaptability by decomposing tasks, anticipating failures, and reviewing actions, leading to higher success rates and efficiency.
Contribution
Introduces a three-fold introspective methodology for LLM agents, enhancing their ability to anticipate failures, align actions, and refine strategies in complex tasks.
Findings
23.5% success rate in WebArena tasks, outperforming zero-shot methods by 3.5%.
Reduces plan revisions and trials by 45%.
Demonstrates improved robustness and efficiency in web environment tasks.
Abstract
In this work, we introduce a novel approach that equips LLM agents with introspection, enhancing consistency and adaptability in solving complex tasks. Our approach prompts LLM agents to decompose a given task into manageable subtasks (i.e., to make a plan), and to continuously introspect upon the suitability and results of their actions. %; and when necessary, to explore ``the road not taken.'' We implement a three-fold introspective intervention: 1) anticipatory reflection on potential failures and alternative remedy before action execution, 2) post-action alignment with subtask objectives and backtracking with remedy to ensure utmost effort in plan execution, and 3) comprehensive review upon plan completion for future strategy refinement. By deploying and experimenting with this methodology -- a zero-shot approach -- within WebArena for practical tasks in web environments, our agent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Rights Management and Security · Auction Theory and Applications · Blockchain Technology Applications and Security
