An Uncertainty-Aware Resilience Micro-Agent for Causal Observability in the Computing Continuum
Suvi De Silva, Alfreds Lapkovskis, Alaa Saleh, Sasu Tarkoma, Praveen Kumar Donta

TL;DR
AURORA is a lightweight, uncertainty-aware micro-agent framework that improves diagnosis and mitigation of grey failures in the computing continuum by leveraging causal analysis and confidence-based decision making.
Contribution
It introduces a novel micro-agent architecture integrating causal inference, uncertainty estimation, and a dual-gated mechanism for safe, efficient fault diagnosis and repair.
Findings
AURORA achieves 0% destructive actions in experiments.
It maintains 62.0% repair accuracy.
It has a mean time to repair of 3ms.
Abstract
Grey failures in the computing continuum produce ambiguous overlapping symptoms that existing approaches fail to diagnose reliably, either due to a lack of causal awareness or acting under high epistemic uncertainty, risking destructive interventions. This paper presents an uncertainty-aware resilience micro-agent for causal observability (AURORA), a lightweight framework for diagnosing and mitigating grey failures in edge-tier environments. The framework employs parallel micro-agents that integrate the free-energy principle, causal do-calculus, and localized causal state-graphs to support counterfactual root-cause analysis within each fault's Markov blanket. Restricting inference to causally relevant variables reduces computational overhead while preserving diagnostic fidelity. AURORA further introduces a dual-gated execution mechanism that authorizes remediation only when causal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
