DiCriTest: Testing Scenario Generation for Decision-Making Agents Considering Diversity and Criticality
Qitong Chu, Yufeng Yue, Danya Yao, and Huaxin Pei

TL;DR
DiCriTest is a novel framework that enhances the generation of diverse and critical testing scenarios for decision-making agents by coordinating parameter and behavior spaces, improving safety verification in dynamic environments.
Contribution
It introduces a dual-space guided approach combining hierarchical parameter space analysis and behavior space feedback to better balance diversity and criticality in scenario generation.
Findings
Improves critical scenario generation by 56.23% on average.
Achieves greater diversity with new parameter-behavior metrics.
Outperforms state-of-the-art baselines in experiments.
Abstract
The growing deployment of decision-making agents in dynamic environments increases the demand for safety verification. While critical testing scenario generation has emerged as an appealing verification methodology, effectively balancing diversity and criticality remains a key challenge for existing methods, particularly due to local optima entrapment in high-dimensional scenario spaces. To address this limitation, we propose a dual-space guided testing framework that coordinates scenario parameter space and agent behavior space, aiming to generate testing scenarios considering diversity and criticality. Specifically, in the scenario parameter space, a hierarchical representation framework combines dimensionality reduction and multi-dimensional subspace evaluation to efficiently localize diverse and critical subspaces. This guides dynamic coordination between two generation modes: local…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
