Automatic Detection of Performance Anomalies in Task-Parallel Programs
Andi Drebes, Karine Heydemann, Antoniu Pop, Albert Cohen, Nathalie, Drach

TL;DR
This paper discusses ongoing work to extend the Aftermath tool for automatic detection of performance anomalies in task-parallel programs, aiding developers in identifying bottlenecks efficiently.
Contribution
The paper introduces two extensions to the Aftermath tool that automate the detection of performance issues in task-parallel applications, improving debugging efficiency.
Findings
Enhanced detection of performance bottlenecks
Improved visualization for task-level analysis
Guided debugging process for developers
Abstract
To efficiently exploit the resources of new many-core architectures, integrating dozens or even hundreds of cores per chip, parallel programming models have evolved to expose massive amounts of parallelism, often in the form of fine-grained tasks. Task-parallel languages, such as OpenStream, X10, Habanero Java and C or StarSs, simplify the development of applications for new architectures, but tuning task-parallel applications remains a major challenge. Performance bottlenecks can occur at any level of the implementation, from the algorithmic level (e.g., lack of parallelism or over-synchronization), to interactions with the operating and runtime systems (e.g., data placement on NUMA architectures), to inefficient use of the hardware (e.g., frequent cache misses or misaligned memory accesses); detecting such issues and determining the exact cause is a difficult task. In previous work,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
