Architecture Dependent Temporal Observability Under Deployment Interference in Edge Inference Systems
Akul Swami, Nikhil Chougule

TL;DR
This paper demonstrates that deployment interference can corrupt both inference timing and its measurement infrastructure in edge systems, revealing failure modes hidden by software-only timing analysis.
Contribution
It empirically shows that external timing observability can fail independently of software reports, affecting the accuracy of latency measurements in edge inference systems.
Findings
External timing measurements reveal failures not visible in software logs.
Different inference architectures exhibit distinct latency distribution patterns.
Storage stress can cause complete timing measurement failures despite normal software logs.
Abstract
Edge inference systems are typically evaluated with software-reported latency collected under controlled conditions. We argue, and demonstrate empirically, that deployment interference can corrupt not only the inference timing being measured but the timing observability infrastructure that measures it, and that the two failures can occur independently. We pair software-reported timing with externally observable GPIO intervals captured by a Saleae Logic Pro 8 logic analyzer on an NVIDIA Jetson Orin Nano, running MobileNetV2 under two inference architectures (TensorRT FP16 GPU and ONNX Runtime CPU) across baseline, light memory pressure, and storage writeback stress. Across 35 paired capture runs (3500 samples) plus 3 storage-stress runs where external pairing failed (300 software-only samples), we observe three findings the software-only view does not surface. (1) The two architectures…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
