Safe and Scalable Web Agent Learning via Recreated Websites

Hyungjoo Chae; Jungsoo Park; Alan Ritter

arXiv:2603.10505·cs.CL·March 12, 2026

Safe and Scalable Web Agent Learning via Recreated Websites

Hyungjoo Chae, Jungsoo Park, Alan Ritter

PDF

Open Access

TL;DR

This paper introduces VeriEnv, a framework that creates safe, verifiable synthetic web environments from real websites, enabling scalable, self-evolving training for web agents without real-world risks.

Contribution

VeriEnv automatically clones real websites into executable environments, allowing agents to learn with verifiable rewards and scale training safely and efficiently.

Findings

01

Agents trained with VeriEnv generalize well to unseen websites.

02

Self-evolving training improves site-specific mastery.

03

Scaling environments enhances agent performance.

Abstract

Training autonomous web agents is fundamentally limited by the environments they learn from: real-world websites are unsafe to explore, hard to reset, and rarely provide verifiable feedback. We propose VeriEnv, a framework that treats language models as environment creators, automatically cloning real-world websites into fully executable, verifiable synthetic environments. By exposing controlled internal access via a Python SDK, VeriEnv enables agents to self-generate tasks with deterministic, programmatically verifiable rewards, eliminating reliance on heuristic or LLM-based judges. This design decouples agent learning from unsafe real-world interaction while enabling scalable self-evolution through environment expansion. Through experiments on web agent benchmarks, we show that agents trained with VeriEnv generalize to unseen websites, achieve site-specific mastery through…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Topic Modeling