Near-Optimal Design for Fault-Tolerant Systems with Homogeneous Components under Incomplete Information
Jalal Arabneydi, Amir G. Aghdam

TL;DR
This paper develops a near-optimal control strategy for fault-tolerant systems with homogeneous components, balancing detection and repair costs under probabilistic failure, verified through numerical simulations.
Contribution
It introduces a Bellman equation-based approach to design near-optimal fault-tolerance strategies for systems with incomplete information.
Findings
Proposed solution effectively balances detection and repair costs.
Numerical simulations confirm near-optimal performance.
Method applicable to systems with probabilistic component failures.
Abstract
In this paper, we study a fault-tolerant control for systems consisting of multiple homogeneous components such as parallel processing machines. This type of system is often more robust to uncertainty compared to those with a single component. The state of each component is either in the operating mode or faulty. At any time instant, each component may independently become faulty according to a Bernoulli probability distribution. If a component is faulty, it remains so until it is fixed. The objective is to design a fault-tolerant system by sequentially choosing one of the following three options: (a) do nothing at zero cost; b) detect the number of faulty components at the cost of inspection, and c) fix the system at the cost of repairing faulty components. A Bellman equation is developed to identify a near-optimal solution for the problem. The efficacy of the proposed solution is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFault Detection and Control Systems · Advanced Control Systems Optimization · Optimization and Search Problems
