Application-level observability for adaptive Edge to Cloud continuum systems
Kaddour Sidi, Daniel Balouek (IMT Atlantique - DAPI, STACK), Baptiste Jonglez (LS2N, Inria, STACK)

TL;DR
This paper presents an application-level observability framework for Edge-to-Cloud systems that enables real-time monitoring and autonomous adaptation to meet performance goals in dynamic environments.
Contribution
It introduces an integrated observability framework combining instrumentation and feedback mechanisms for adaptive Edge-to-Cloud applications.
Findings
Improved scalability and fault tolerance.
Enhanced responsiveness to workload changes.
Effective maintenance of performance objectives.
Abstract
Modern Edge-to-Cloud (E2C) systems require fine-grained observability to ensure adaptive behavior and compliance with performance objectives across heterogeneous and dynamic environments. This work introduces an application-level observability framework that integrates developer-driven instrumentation and SLO-aware feedback for autonomous adaptation. By combining OpenTelemetry, Prometheus, K3s, and Chaos Mesh, the framework enables real-time monitoring and adaptive control across the continuum. A video processing use case demonstrates how application-level metrics guide automatic adjustments to maintain target frame rate, latency, and detection accuracy under variable workloads and injected faults. Preliminary results highlight improved scalability, fault tolerance, and responsiveness, providing a practical foundation for adaptive, SLO-compliant E2C applications.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReal-Time Systems Scheduling · Software System Performance and Reliability · Distributed systems and fault tolerance
