Does Unification Come at a Cost? Uni-SafeBench: A Safety Benchmark for Unified Multimodal Large Models

Zixiang Peng; Yongxiu Xu; Qinyi Zhang; Jiexun Shen; Yifan Zhang; Hongbo Xu; Yubin Wang; Gaopeng Gou

arXiv:2604.00547·cs.AI·April 2, 2026

Does Unification Come at a Cost? Uni-SafeBench: A Safety Benchmark for Unified Multimodal Large Models

Zixiang Peng, Yongxiu Xu, Qinyi Zhang, Jiexun Shen, Yifan Zhang, Hongbo Xu, Yubin Wang, Gaopeng Gou

PDF

TL;DR

This paper introduces Uni-SafeBench, a comprehensive safety benchmark for Unified Multimodal Large Models, revealing that unification improves capabilities but significantly degrades inherent safety, especially in open-source models.

Contribution

The paper presents Uni-SafeBench and Uni-Judger, new tools for evaluating safety in UMLMs, and provides systematic analysis of safety trade-offs in unified models.

Findings

01

Unification enhances model capabilities but degrades safety.

02

Open-source UMLMs have lower safety performance than specialized models.

03

The benchmark covers six safety categories across seven task types.

Abstract

Unified Multimodal Large Models (UMLMs) integrate understanding and generation capabilities within a single architecture. While this architectural unification, driven by the deep fusion of multimodal features, enhances model performance, it also introduces important yet underexplored safety challenges. Existing safety benchmarks predominantly focus on isolated understanding or generation tasks, failing to evaluate the holistic safety of UMLMs when handling diverse tasks under a unified framework. To address this, we introduce Uni-SafeBench, a comprehensive benchmark featuring a taxonomy of six major safety categories across seven task types. To ensure rigorous assessment, we develop Uni-Judger, a framework that effectively decouples contextual safety from intrinsic safety. Based on comprehensive evaluations across Uni-SafeBench, we uncover that while the unification process enhances…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.