Offloading tracing for real-time systems using a scalable cloud infrastructure
David Jannis Schmidt, Grigory Fridman, Florian von Zabiensky

TL;DR
This paper introduces a scalable cloud-based architecture for real-time system tracing that enables long-term, collaborative analysis and improves efficiency over traditional local tools.
Contribution
It presents a novel microservices and edge computing architecture for offloading trace processing to the cloud, supporting scalable, long-term, and collaborative analysis of real-time systems.
Findings
Handles many parallel tracing sessions efficiently
Per-session throughput decreases with system load
Overall throughput increases with more sessions
Abstract
Real-time embedded systems require precise timing and fault detection to ensure correct behavior. Traditional tracing tools often rely on local desktops with limited processing and storage capabilities, which hampers large-scale analysis. This paper presents a scalable, cloud-based architecture for software tracing in real-time systems based on microservices and edge computing. Our approach shifts the trace processing workload from the developer's machine to the cloud, using a dedicated tracing component that captures trace data and forwards it to a scalable backend via WebSockets and Apache Kafka. This enables long-term monitoring and collaborative analysis of target executions, e.g., to detect and investigate sporadic errors. We demonstrate how this architecture supports scalable analysis of parallel tracing sessions and lays the foundation for future integration of rule-based testing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
