Towards Evaluating Proactive Risk Awareness of Multimodal Language Models

Youliang Yuan; Wenxiang Jiao; Yuejin Xie; Chihao Shen; Menghan Tian; Wenxuan Wang; Jen-tse Huang; Pinjia He

arXiv:2505.17455·cs.CL·October 21, 2025

Towards Evaluating Proactive Risk Awareness of Multimodal Language Models

Youliang Yuan, Wenxiang Jiao, Yuejin Xie, Chihao Shen, Menghan Tian, Wenxuan Wang, Jen-tse Huang, Pinjia He

PDF

TL;DR

This paper introduces PaSBench, a benchmark for evaluating the proactive safety capabilities of multimodal language models across various safety-critical scenarios, revealing significant limitations in current models' ability to predict risks.

Contribution

It establishes the first proactive safety benchmark for multimodal models, providing systematic evaluation and analysis of their limitations in risk detection.

Findings

01

Top models achieve around 70% accuracy but miss 45-55% of risks

02

Failure analysis shows issues in proactive reasoning, not knowledge deficits

03

Dataset and benchmark promote development of safer, proactive AI systems

Abstract

Human safety awareness gaps often prevent the timely recognition of everyday risks. In solving this problem, a proactive safety artificial intelligence (AI) system would work better than a reactive one. Instead of just reacting to users' questions, it would actively watch people's behavior and their environment to detect potential dangers in advance. Our Proactive Safety Bench (PaSBench) evaluates this capability through 416 multimodal scenarios (128 image sequences, 288 text logs) spanning 5 safety-critical domains. Evaluation of 36 advanced models reveals fundamental limitations: Top performers like Gemini-2.5-pro achieve 71% image and 64% text accuracy, but miss 45-55% risks in repeated trials. Through failure analysis, we identify unstable proactive reasoning rather than knowledge deficits as the primary limitation. This work establishes (1) a proactive safety benchmark, (2)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.