Agent-E: From Autonomous Web Navigation to Foundational Design Principles in Agentic Systems
Tamer Abuelsaad, Deepak Akkil, Prasenjit Dey, Ashish Jagmohan, and Aditya Vempaty, Ravi Kokku

TL;DR
This paper introduces Agent-E, a novel web agent with architectural improvements that outperform state-of-the-art agents on benchmarks, and derives foundational design principles for developing effective agentic systems.
Contribution
The paper presents Agent-E, a web agent with hierarchical architecture, advanced observation techniques, and change observation, along with general design principles for agent development.
Findings
Agent-E outperforms SOTA web agents by 10-30% on WebVoyager benchmark.
Hierarchical architecture and observation denoising improve agent performance.
Design principles include domain-specific skills and self-improvement mechanisms.
Abstract
AI Agents are changing the way work gets done, both in consumer and enterprise domains. However, the design patterns and architectures to build highly capable agents or multi-agent systems are still developing, and the understanding of the implication of various design choices and algorithms is still evolving. In this paper, we present our work on building a novel web agent, Agent-E \footnote{Our code is available at \url{https://github.com/EmergenceAI/Agent-E}}. Agent-E introduces numerous architectural improvements over prior state-of-the-art web agents such as hierarchical architecture, flexible DOM distillation and denoising method, and the concept of \textit{change observation} to guide the agent towards more accurate performance. We first present the results of an evaluation of Agent-E on WebVoyager benchmark dataset and show that Agent-E beats other SOTA text and multi-modal web…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMulti-Agent Systems and Negotiation
