Poly-Guard: Massive Multi-Domain Safety Policy-Grounded Guardrail Dataset
Mintong Kang, Zhaorun Chen, Chejian Xu, Jiawei Zhang, Chengquan Guo, Minzhou Pan, Ivan Revilla, Yu Sun, Bo Li

TL;DR
Poly-Guard introduces a comprehensive multi-domain safety dataset grounded in real safety policies, enabling better evaluation and development of guardrail models for diverse applications.
Contribution
It provides the first large-scale, policy-grounded multi-domain guardrail dataset with diverse formats and adversarial challenges, addressing gaps in existing benchmarks.
Findings
Models show limited domain-specific safety coverage.
Model safety performance varies across risk categories.
All models are vulnerable to adversarial attacks.
Abstract
As LLMs become widespread across diverse applications, concerns about the security and safety of LLM interactions have intensified. Numerous guardrail models and benchmarks have been developed to ensure LLM content safety. However, existing guardrail benchmarks are often built upon ad hoc risk taxonomies that lack a principled grounding in standardized safety policies, limiting their alignment with real-world operational requirements. Moreover, they tend to overlook domain-specific risks, while the same risk category can carry different implications across different domains. To bridge these gaps, we introduce Poly-Guard, the first massive multi-domain safety policy-grounded guardrail dataset. Poly-Guard offers: (1) broad domain coverage across eight safety-critical domains, such as finance, law, and codeGen; (2) policy-grounded risk construction based on authentic, domain-specific…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTransportation Safety and Impact Analysis · Infrastructure Maintenance and Monitoring · Geophysical Methods and Applications
