HBTM: A Heartbeat-based Behavior Detection Mechanism for POSIX Threads and OpenMP Applications
Weidong Wang, Chunhua Liao, Liqiang Wang, Daniel J. Quinlan, Wei Lu

TL;DR
This paper introduces HBTM, a heartbeat-based detection mechanism for POSIX threads and OpenMP applications, capable of identifying thread behaviors with low latency and minimal overhead on large-scale systems.
Contribution
It proposes a unified, efficient heartbeat-based detection mechanism with centralized and decentralized implementations for runtime behavior monitoring.
Findings
Supports detection of thread failures, busy waiting, and exits.
Achieves near 1% overhead in performance tests.
Provides short latency detection.
Abstract
Extreme-scale computing involves hundreds of millions of threads with multi-level parallelism running on large-scale hierarchical and heterogeneous hardware. In POSIX threads and OpenMP applications, some key behaviors occurring in runtime such as thread failure, busy waiting, and exit need to be accurately and timely detected. However, for the most of these applications, there are lack of unified and efficient detection mechanisms to do this. In this paper, a heartbeat-based behavior detection mechanism for POSIX threads (Pthreads) and OpenMP applications (HBTM) is proposed. In the design, two types of implementations are conducted, centralized and decentralized respectively. In both implementations, unified API has been designed to guarantee the generality of the mechanism. Meanwhile, a ring-based detection algorithm is designed to ease the burden of the centra thread at runtime. To…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware System Performance and Reliability · Cloud Computing and Resource Management · Parallel Computing and Optimization Techniques
