Agent-E: From Autonomous Web Navigation to Foundational Design   Principles in Agentic Systems

Tamer Abuelsaad; Deepak Akkil; Prasenjit Dey; Ashish Jagmohan; and Aditya Vempaty; Ravi Kokku

arXiv:2407.13032·cs.AI·July 19, 2024·2 cites

Agent-E: From Autonomous Web Navigation to Foundational Design Principles in Agentic Systems

Tamer Abuelsaad, Deepak Akkil, Prasenjit Dey, Ashish Jagmohan, and Aditya Vempaty, Ravi Kokku

PDF

Open Access 2 Repos

TL;DR

This paper introduces Agent-E, a novel web agent with architectural improvements that outperform state-of-the-art agents on benchmarks, and derives foundational design principles for developing effective agentic systems.

Contribution

The paper presents Agent-E, a web agent with hierarchical architecture, advanced observation techniques, and change observation, along with general design principles for agent development.

Findings

01

Agent-E outperforms SOTA web agents by 10-30% on WebVoyager benchmark.

02

Hierarchical architecture and observation denoising improve agent performance.

03

Design principles include domain-specific skills and self-improvement mechanisms.

Abstract

AI Agents are changing the way work gets done, both in consumer and enterprise domains. However, the design patterns and architectures to build highly capable agents or multi-agent systems are still developing, and the understanding of the implication of various design choices and algorithms is still evolving. In this paper, we present our work on building a novel web agent, Agent-E \footnote{Our code is available at \url{https://github.com/EmergenceAI/Agent-E}}. Agent-E introduces numerous architectural improvements over prior state-of-the-art web agents such as hierarchical architecture, flexible DOM distillation and denoising method, and the concept of \textit{change observation} to guide the agent towards more accurate performance. We first present the results of an evaluation of Agent-E on WebVoyager benchmark dataset and show that Agent-E beats other SOTA text and multi-modal web…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMulti-Agent Systems and Negotiation