Performance Characterization of Multi-threaded Graph Processing Applications on Intel Many-Integrated-Core Architecture
Lei Jiang, Langshi Chen, Judy Qiu

TL;DR
This paper empirically evaluates the performance of multi-threaded graph processing applications on Intel Xeon Phi KNL architecture, comparing it with CPUs and GPUs, and provides insights into architectural impacts and optimization opportunities.
Contribution
It offers a comprehensive performance characterization of KNL for graph processing, highlighting architectural effects and proposing future optimization directions.
Findings
KNL can significantly accelerate multi-threaded graph applications.
Different applications and datasets require tailored thread counts for optimal performance.
Underutilization of AVX512 SIMD units and NUMA effects influence performance.
Abstract
Intel Xeon Phi many-integrated-core (MIC) architectures usher in a new era of terascale integration. Among emerging killer applications, parallel graph processing has been a critical technique to analyze connected data. In this paper, we empirically evaluate various computing platforms including an Intel Xeon E5 CPU, a Nvidia Geforce GTX1070 GPU and an Xeon Phi 7210 processor codenamed Knights Landing (KNL) in the domain of parallel graph processing. We show that the KNL gains encouraging performance when processing graphs, so that it can become a promising solution to accelerating multi-threaded graph applications. We further characterize the impact of KNL architectural enhancements on the performance of a state-of-the art graph framework.We have four key observations: 1 Different graph applications require distinctive numbers of threads to reach the peak performance. For the same…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
