An Uncertainty-Aware Resilience Micro-Agent for Causal Observability in the Computing Continuum

Suvi De Silva; Alfreds Lapkovskis; Alaa Saleh; Sasu Tarkoma; Praveen Kumar Donta

arXiv:2605.10718·cs.DC·May 12, 2026

An Uncertainty-Aware Resilience Micro-Agent for Causal Observability in the Computing Continuum

Suvi De Silva, Alfreds Lapkovskis, Alaa Saleh, Sasu Tarkoma, Praveen Kumar Donta

PDF

TL;DR

AURORA is a lightweight, uncertainty-aware micro-agent framework that improves diagnosis and mitigation of grey failures in the computing continuum by leveraging causal analysis and confidence-based decision making.

Contribution

It introduces a novel micro-agent architecture integrating causal inference, uncertainty estimation, and a dual-gated mechanism for safe, efficient fault diagnosis and repair.

Findings

01

AURORA achieves 0% destructive actions in experiments.

02

It maintains 62.0% repair accuracy.

03

It has a mean time to repair of 3ms.

Abstract

Grey failures in the computing continuum produce ambiguous overlapping symptoms that existing approaches fail to diagnose reliably, either due to a lack of causal awareness or acting under high epistemic uncertainty, risking destructive interventions. This paper presents an uncertainty-aware resilience micro-agent for causal observability (AURORA), a lightweight framework for diagnosing and mitigating grey failures in edge-tier environments. The framework employs parallel micro-agents that integrate the free-energy principle, causal do-calculus, and localized causal state-graphs to support counterfactual root-cause analysis within each fault's Markov blanket. Restricting inference to causally relevant variables reduces computational overhead while preserving diagnostic fidelity. AURORA further introduces a dual-gated execution mechanism that authorizes remediation only when causal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.