Intelligent Resilience Testing for Decision-Making Agents with Dual-Mode Surrogate Adaptation

Jingxuan Yang; Weichao Xu; Yuchen Shi; Yi Zhang; Shuo Feng; Huaxin Pei

arXiv:2512.09372·eess.SY·December 11, 2025

Intelligent Resilience Testing for Decision-Making Agents with Dual-Mode Surrogate Adaptation

Jingxuan Yang, Weichao Xu, Yuchen Shi, Yi Zhang, Shuo Feng, Huaxin Pei

PDF

Open Access

TL;DR

This paper introduces IRTest, an adaptive online framework that enhances the testing of decision-making agents by reducing surrogate model gaps through neural fine-tuning and importance sampling, improving robustness and generalizability.

Contribution

The paper presents IRTest, a novel online adaptive testing framework combining neural fine-tuning and importance sampling to improve surrogate-based testing of decision agents.

Findings

01

IRTest improves failure discovery efficiency.

02

IRTest enhances testing robustness across systems.

03

IRTest demonstrates strong generalizability in experiments.

Abstract

Testing and evaluating decision-making agents remains challenging due to unknown system architectures, limited access to internal states, and the vastness of high-dimensional scenario spaces. Existing testing approaches often rely on surrogate models of decision-making agents to generate large-scale scenario libraries; however, discrepancies between surrogate models and real decision-making agents significantly limit their generalizability and practical applicability. To address this challenge, this paper proposes intelligent resilience testing (IRTest), a unified online adaptive testing framework designed to rapidly adjust to diverse decision-making agents. IRTest initializes with an offline-trained surrogate prediction model and progressively reduces surrogate-to-real gap during testing through two complementary adaptation mechanisms: (i) online neural fine-tuning in data-rich…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Software Testing and Debugging Techniques · Explainable Artificial Intelligence (XAI)