Scalable Explainability-as-a-Service (XaaS) for Edge AI Systems

Samaresh Kumar Singh; Joyjit Roy

arXiv:2602.04120·cs.LG·April 28, 2026

Scalable Explainability-as-a-Service (XaaS) for Edge AI Systems

Samaresh Kumar Singh, Joyjit Roy

PDF

TL;DR

This paper introduces XaaS, a scalable distributed architecture that decouples explanation generation from inference in edge AI, reducing latency and redundancy while maintaining explanation quality across diverse IoT applications.

Contribution

The paper proposes a novel XaaS architecture with a distributed explanation cache, verification protocol, and adaptive engine, enabling efficient, scalable explainability for edge AI systems.

Findings

01

XaaS reduces explanation latency by 38% in real-world edge applications.

02

The architecture maintains high explanation fidelity across heterogeneous devices.

03

Decoupling inference from explanation generation improves scalability and resource efficiency.

Abstract

Though Explainable AI (XAI) has made significant advancements, its inclusion in edge and IoT systems is typically ad-hoc and inefficient. Most current methods are "coupled" in such a way that they generate explanations simultaneously with model inferences. As a result, these approaches incur redundant computation, high latency and poor scalability when deployed across heterogeneous sets of edge devices. In this work we propose Explainability-as-a-Service (XaaS), a distributed architecture for treating explainability as a first-class system service (as opposed to a model-specific feature). The key innovation in our proposed XaaS architecture is that it decouples inference from explanation generation allowing edge devices to request, cache and verify explanations subject to resource and latency constraints. To achieve this, we introduce three main innovations: (1) A distributed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.