WebXSkill: Skill Learning for Autonomous Web Agents

Zhaoyang Wang; Qianhui Wu; Xuchao Zhang; Chaoyun Zhang; Wenlin Yao; Fazle Elahi Faisal; Baolin Peng; Si Qin; Suman Nath; Qingwei Lin; Chetan Bansal; Dongmei Zhang; Saravan Rajmohan; Jianfeng Gao; Huaxiu Yao

arXiv:2604.13318·cs.AI·April 16, 2026

WebXSkill: Skill Learning for Autonomous Web Agents

Zhaoyang Wang, Qianhui Wu, Xuchao Zhang, Chaoyun Zhang, Wenlin Yao, Fazle Elahi Faisal, Baolin Peng, Si Qin, Suman Nath, Qingwei Lin, Chetan Bansal, Dongmei Zhang, Saravan Rajmohan, Jianfeng Gao, Huaxiu Yao

PDF

1 Repo

TL;DR

WebXSkill introduces executable, parameterized skills for autonomous web agents, bridging the gap between natural language guidance and code execution, significantly improving task success rates.

Contribution

It presents a novel framework that extracts, organizes, and deploys executable skills with step-level guidance, enhancing web agent performance.

Findings

01

Improves task success rate by up to 12.9 points on WebVoyager.

02

Extracts reusable action subsequences from synthetic trajectories.

03

Provides both automated and guided execution modes.

Abstract

Autonomous web agents powered by large language models (LLMs) have shown promise in completing complex browser tasks, yet they still struggle with long-horizon workflows. A key bottleneck is the grounding gap in existing skill formulations: textual workflow skills provide natural language guidance but cannot be directly executed, while code-based skills are executable but opaque to the agent, offering no step-level understanding for error recovery or adaptation. We introduce WebXSkill, a framework that bridges this gap with executable skills, each pairing a parameterized action program with step-level natural language guidance, enabling both direct execution and agent-driven adaptation. WebXSkill operates in three stages: skill extraction mines reusable action subsequences from readily available synthetic agent trajectories and abstracts them into parameterized skills, skill…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

aiming-lab/WebXSkill
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.