HarmonyGuard: Toward Safety and Utility in Web Agents via Adaptive Policy Enhancement and Dual-Objective Optimization

Yurun Chen; Xavier Hu; Yuhan Liu; Keting Yin; Juncheng Li; Zhuosheng Zhang; Shengyu Zhang

arXiv:2508.04010·cs.CL·August 7, 2025

HarmonyGuard: Toward Safety and Utility in Web Agents via Adaptive Policy Enhancement and Dual-Objective Optimization

Yurun Chen, Xavier Hu, Yuhan Liu, Keting Yin, Juncheng Li, Zhuosheng Zhang, Shengyu Zhang

PDF

TL;DR

HarmonyGuard is a multi-agent framework that enhances web agent safety and utility by adaptive policy extraction and dual-objective optimization, addressing the limitations of prior single-objective, single-turn approaches.

Contribution

It introduces a collaborative multi-agent system with adaptive policy enhancement and real-time dual-objective optimization for safer and more effective web agents.

Findings

01

Policy compliance improved by up to 38%

02

Task completion increased by up to 20%

03

Achieves over 90% policy compliance across tasks

Abstract

Large language models enable agents to autonomously perform tasks in open web environments. However, as hidden threats within the web evolve, web agents face the challenge of balancing task performance with emerging risks during long-sequence operations. Although this challenge is critical, current research remains limited to single-objective optimization or single-turn scenarios, lacking the capability for collaborative optimization of both safety and utility in web environments. To address this gap, we propose HarmonyGuard, a multi-agent collaborative framework that leverages policy enhancement and objective optimization to jointly improve both utility and safety. HarmonyGuard features a multi-agent architecture characterized by two fundamental capabilities: (1) Adaptive Policy Enhancement: We introduce the Policy Agent within HarmonyGuard, which automatically extracts and maintains…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.