SDEval: Safety Dynamic Evaluation for Multimodal Large Language Models

Hanqing Wang; Yuan Tian; Mingyu Liu; Zhenhao Zhang; Xiangyang Zhu

arXiv:2508.06142·cs.CV·January 6, 2026

SDEval: Safety Dynamic Evaluation for Multimodal Large Language Models

Hanqing Wang, Yuan Tian, Mingyu Liu, Zhenhao Zhang, Xiangyang Zhu

PDF

Open Access 1 Video

TL;DR

SDEval introduces a dynamic evaluation framework for Multimodal Large Language Models, enabling controllable safety benchmark adjustments to better assess safety risks and limitations amid evolving MLLM capabilities.

Contribution

It is the first framework to dynamically adjust safety benchmarks for MLLMs, addressing data contamination and outdated evaluations.

Findings

01

SDEval significantly influences safety evaluation results.

02

It mitigates data contamination issues.

03

Exposes safety limitations of current MLLMs.

Abstract

In the rapidly evolving landscape of Multimodal Large Language Models (MLLMs), the safety concerns of their outputs have earned significant attention. Although numerous datasets have been proposed, they may become outdated with MLLM advancements and are susceptible to data contamination issues. To address these problems, we propose \textbf{SDEval}, the \textit{first} safety dynamic evaluation framework to controllably adjust the distribution and complexity of safety benchmarks. Specifically, SDEval mainly adopts three dynamic strategies: text, image, and text-image dynamics to generate new samples from original benchmarks. We first explore the individual effects of text and image dynamics on model safety. Then, we find that injecting text dynamics into images can further impact safety, and conversely, injecting image dynamics into text also leads to safety risks. SDEval is general…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

SDEval: Safety Dynamic Evaluation for Multimodal Large Language Models· underline

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Software Reliability and Analysis Research