OmniGuard: Unified Omni-Modal Guardrails with Deliberate Reasoning

Boyu Zhu; Xiaofei Wen; Wenjie Jacky Mo; Tinghui Zhu; Yanan Xie; Peng Qi; Muhao Chen

arXiv:2512.02306·cs.AI·December 3, 2025

OmniGuard: Unified Omni-Modal Guardrails with Deliberate Reasoning

Boyu Zhu, Xiaofei Wen, Wenjie Jacky Mo, Tinghui Zhu, Yanan Xie, Peng Qi, Muhao Chen

PDF

Open Access

TL;DR

OmniGuard introduces a unified omni-modal safety framework with deliberate reasoning, trained on a large diverse dataset, to improve robustness and effectiveness in safeguarding across all modalities in human-AI interactions.

Contribution

The paper presents OmniGuard, the first omni-modal guardrail system capable of safeguarding across all modalities with deliberate reasoning, supported by a large, annotated omni-modal safety dataset.

Findings

01

Achieves strong safety performance across 15 benchmarks.

02

Demonstrates robustness in diverse multimodal safety scenarios.

03

Provides a unified framework for omni-modal risk mitigation.

Abstract

Omni-modal Large Language Models (OLLMs) that process text, images, videos, and audio introduce new challenges for safety and value guardrails in human-AI interaction. Prior guardrail research largely targets unimodal settings and typically frames safeguarding as binary classification, which limits robustness across diverse modalities and tasks. To address this gap, we propose OmniGuard, the first family of omni-modal guardrails that performs safeguarding across all modalities with deliberate reasoning ability. To support the training of OMNIGUARD, we curate a large, comprehensive omni-modal safety dataset comprising over 210K diverse samples, with inputs that cover all modalities through both unimodal and cross-modal samples. Each sample is annotated with structured safety labels and carefully curated safety critiques from expert models through targeted distillation. Extensive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Multimodal Machine Learning Applications · Human-Automation Interaction and Safety