Build the web for agents, not agents for the web
Xing Han L\`u, Gaurav Kamath, Marius Mosbach, Siva Reddy

TL;DR
This paper proposes designing web interfaces specifically for AI agents, called Agentic Web Interface (AWI), to improve web agent capabilities by aligning interfaces with agent needs rather than human use.
Contribution
It introduces the concept of AWI and six guiding principles to optimize web interfaces for autonomous agents, addressing limitations of current human-centric designs.
Findings
Proposes AWI as a new web interface paradigm for agents
Establishes six principles for AWI design focusing on safety and efficiency
Aims to enable more reliable and transparent web agent interactions
Abstract
Recent advancements in Large Language Models (LLMs) and multimodal counterparts have spurred significant interest in developing web agents -- AI systems capable of autonomously navigating and completing tasks within web environments. While holding tremendous promise for automating complex web interactions, current approaches face substantial challenges due to the fundamental mismatch between human-designed interfaces and LLM capabilities. Current methods struggle with the inherent complexity of web inputs, whether processing massive DOM trees, relying on screenshots augmented with additional information, or bypassing the user interface entirely through API interactions. This position paper advocates for a paradigm shift in web agent research: rather than forcing web agents to adapt to interfaces designed for humans, we should develop a new interaction paradigm specifically optimized for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and dialogue systems · Natural Language Processing Techniques · Multi-Agent Systems and Negotiation
