Proposer-Agent-Evaluator(PAE): Autonomous Skill Discovery For Foundation Model Internet Agents
Yifei Zhou, Qianlan Yang, Kaixiang Lin, Min Bai, Xiong Zhou, Yu-Xiong, Wang, Sergey Levine, Erran Li

TL;DR
This paper introduces Proposer-Agent-Evaluator (PAE), a system enabling foundation model agents to autonomously discover and practice skills through autonomous task proposal and reinforcement learning, achieving state-of-the-art results in vision-based web navigation.
Contribution
The work presents the first autonomous skill discovery system combining task proposal, RL, and success evaluation for foundation model agents in real-world environments.
Findings
Achieved state-of-the-art performance on vision-based web navigation benchmarks.
Demonstrated effective autonomous skill discovery without human-annotated instructions.
Validated system on real-world and self-hosted websites with strong results.
Abstract
The vision of a broadly capable and goal-directed agent, such as an Internet-browsing agent in the digital world and a household humanoid in the physical world, has rapidly advanced, thanks to the generalization capability of foundation models. Such a generalist agent needs to have a large and diverse skill repertoire, such as finding directions between two travel locations and buying specific items from the Internet. If each skill needs to be specified manually through a fixed set of human-annotated instructions, the agent's skill repertoire will necessarily be limited due to the quantity and diversity of human-annotated instructions. In this work, we address this challenge by proposing Proposer-Agent-Evaluator, an effective learning system that enables foundation model agents to autonomously discover and practice skills in the wild. At the heart of PAE is a context-aware task proposer…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Mining Algorithms and Applications · Multi-Agent Systems and Negotiation · Semantic Web and Ontologies
MethodsEmirates Airlines Office in Dubai · Sparse Evolutionary Training
