Nahida: In-Band Distributed Tracing with eBPF
Wanqi Yang, Pengfei Chen, Kai Liu, Huxing Zhang

TL;DR
Nahida is a non-invasive, eBPF-based distributed tracing system that accurately tracks requests in microservices environments with minimal overhead, improving fault diagnosis without intrusive instrumentation.
Contribution
Introduces Nahida, a novel eBPF-based tracing system that overcomes limitations of existing methods by providing accurate, non-invasive, kernel-level request tracking across diverse microservices.
Findings
Tracks over 92% of requests with stable accuracy
Handles multi-threaded applications effectively
Introduces negligible overhead of 1.55%-2.1% in latency
Abstract
Microservices are commonly used in modern cloud-native applications to achieve agility. However, the complexity of service dependencies in large-scale microservices systems can lead to anomaly propagation, making fault troubleshooting a challenge. To address this issue, distributed tracing systems have been proposed to trace complete request execution paths, enabling developers to troubleshoot anomalous services. However, existing distributed tracing systems have limitations such as invasive instrumentation, trace loss, or inaccurate trace correlation. To overcome these limitations, we propose a new tracing system based on eBPF (extended Berkeley Packet Filter), named Nahida, that can track complete requests in the kernel without intrusion, regardless of programming language or implementation. Our evaluation results show that Nahida can track over 92% of requests with stable accuracy,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware System Performance and Reliability · IoT and Edge/Fog Computing · Cloud Computing and Resource Management
