Fault-Tolerant Design and Multi-Objective Model Checking for Real-Time Deep Reinforcement Learning Systems

Guoxin Su; Thomas Robinson; Hoa Khanh Dam; Li Liu; David S. Rosenblum

arXiv:2603.23113·cs.SE·March 25, 2026

Fault-Tolerant Design and Multi-Objective Model Checking for Real-Time Deep Reinforcement Learning Systems

Guoxin Su, Thomas Robinson, Hoa Khanh Dam, Li Liu, David S. Rosenblum

PDF

Open Access

TL;DR

This paper introduces a formal framework combining timed automata and Markov decision processes to enhance fault tolerance and multi-objective optimization in real-time deep reinforcement learning systems, addressing safety and performance.

Contribution

It presents a novel formal approach for designing real-time switching mechanisms and a convex query technique for multi-objective model checking in DRL systems.

Findings

01

The framework effectively balances safety and performance objectives.

02

MOPMC tool demonstrates high scalability and efficiency.

03

Formal methods improve dependability in real-time DRL applications.

Abstract

Deep reinforcement learning (DRL) has emerged as a powerful paradigm for solving complex decision-making problems. However, DRL-based systems still face significant dependability challenges particularly in real-time environments due to the simulation-to-reality gap, out-of-distribution observations, and the critical impact of latency. Latency-induced faults, in particular, can lead to unsafe or unstable behaviour, yet existing fault-tolerance approaches to DRL systems lack formal methods to rigorously analyse and optimise performance and safety simultaneously in real-time settings. To address this, we propose a formal framework for designing and analysing real-time switching mechanisms between DRL agents and alternative controllers. Our approach leverages Timed Automata (TAs) for explicit switch logic design, which is then syntactically converted to a Markov Decision Process (MDP) for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFormal Methods in Verification · Reinforcement Learning in Robotics · Machine Learning and Algorithms