Symbiotic Cooperation for Web Agents: Harnessing Complementary Strengths of Large and Small LLMs
Ruichen Zhang, Mufan Qiu, Zhen Tan, Mohan Zhang, Vincent Lu, Jie Peng,, Kaidi Xu, Leandro Z. Agudelo, Peter Qian, Tianlong Chen

TL;DR
This paper introduces AgentSymbiotic, an iterative framework that enhances web browsing agents by leveraging the complementary strengths of large and small LLMs through data synthesis, distillation, and a hybrid privacy mode, achieving state-of-the-art results.
Contribution
The paper proposes a novel symbiotic framework combining large and small LLMs for web agents, with innovative distillation and data synthesis strategies to improve performance.
Findings
Large LLMs excel at trajectory generation for distillation.
Small LLMs' divergence promotes exploration of new trajectories.
AgentSymbiotic achieves state-of-the-art results on WEBARENA.
Abstract
Web browsing agents powered by large language models (LLMs) have shown tremendous potential in automating complex web-based tasks. Existing approaches typically rely on large LLMs (e.g., GPT-4o) to explore web environments and generate trajectory data, which is then used either for demonstration retrieval (for large LLMs) or to distill small LLMs (e.g., Llama3) in a process that remains decoupled from the exploration. In this paper, we propose AgentSymbiotic, an iterative framework that couples data synthesis with task-performance, yielding a "symbiotic improvement" for both large and small LLMs. Our study uncovers a complementary dynamic between LLM types: while large LLMs excel at generating high-quality trajectories for distillation, the distilled small LLMs-owing to their distinct reasoning capabilities-often choose actions that diverge from those of their larger counterparts. This…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsWeb Data Mining and Analysis · Topic Modeling · Spam and Phishing Detection
