ForesightSafety Bench: A Frontier Risk Evaluation and Governance Framework towards Safe AI

Haibo Tong; Feifei Zhao; Linghao Feng; Ruoyu Wu; Ruolin Chen; Lu Jia; Zhou Zhao; Jindong Li; Tenglong Li; Erliang Lin; Shuai Yang; Enmeng Lu; Yinqian Sun; Qian Zhang; Zizhe Ruan; Jinyu Fan; Zeyang Yue; Ping Wu; Huangrui Li; Chengyi Sun; Yi Zeng

arXiv:2602.14135·cs.AI·March 2, 2026

ForesightSafety Bench: A Frontier Risk Evaluation and Governance Framework towards Safe AI

Haibo Tong, Feifei Zhao, Linghao Feng, Ruoyu Wu, Ruolin Chen, Lu Jia, Zhou Zhao, Jindong Li, Tenglong Li, Erliang Lin, Shuai Yang, Enmeng Lu, Yinqian Sun, Qian Zhang, Zizhe Ruan, Jinyu Fan, Zeyang Yue, Ping Wu, Huangrui Li, Chengyi Sun, Yi Zeng

PDF

Open Access 1 Datasets

TL;DR

The paper introduces ForesightSafety Bench, a comprehensive AI safety evaluation framework covering 94 risk dimensions, systematically assessing over twenty advanced models to identify safety vulnerabilities in frontier AI systems.

Contribution

It proposes a hierarchical, evolving safety benchmark with extensive risk dimensions and evaluates multiple large models to reveal key safety vulnerabilities in cutting-edge AI.

Findings

01

Widespread safety vulnerabilities across multiple AI safety pillars

02

Identification of key risk patterns and capability boundaries in advanced models

03

Benchmark covers 94 risk dimensions and includes extensive structured data

Abstract

Rapidly evolving AI exhibits increasingly strong autonomy and goal-directed capabilities, accompanied by derivative systemic risks that are more unpredictable, difficult to control, and potentially irreversible. However, current AI safety evaluation systems suffer from critical limitations such as restricted risk dimensions and failed frontier risk detection. The lagging safety benchmarks and alignment technologies can hardly address the complex challenges posed by cutting-edge AI models. To bridge this gap, we propose the "ForesightSafety Bench" AI Safety Evaluation Framework, beginning with 7 major Fundamental Safety pillars and progressively extends to advanced Embodied AI Safety, AI4Science Safety, Social and Environmental AI risks, Catastrophic and Existential Risks, as well as 8 critical industrial safety domains, forming a total of 94 refined risk dimensions. To date, the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

Beijing-AISI/ForesightSafety-Bench
dataset· 77 dl
77 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEthics and Social Impacts of AI · Adversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI)