Performance Characterization of Multi-threaded Graph Processing   Applications on Intel Many-Integrated-Core Architecture

Lei Jiang; Langshi Chen; Judy Qiu

arXiv:1708.04701·cs.DC·February 26, 2019

Performance Characterization of Multi-threaded Graph Processing Applications on Intel Many-Integrated-Core Architecture

Lei Jiang, Langshi Chen, Judy Qiu

PDF

TL;DR

This paper empirically evaluates the performance of multi-threaded graph processing applications on Intel Xeon Phi KNL architecture, comparing it with CPUs and GPUs, and provides insights into architectural impacts and optimization opportunities.

Contribution

It offers a comprehensive performance characterization of KNL for graph processing, highlighting architectural effects and proposing future optimization directions.

Findings

01

KNL can significantly accelerate multi-threaded graph applications.

02

Different applications and datasets require tailored thread counts for optimal performance.

03

Underutilization of AVX512 SIMD units and NUMA effects influence performance.

Abstract

Intel Xeon Phi many-integrated-core (MIC) architectures usher in a new era of terascale integration. Among emerging killer applications, parallel graph processing has been a critical technique to analyze connected data. In this paper, we empirically evaluate various computing platforms including an Intel Xeon E5 CPU, a Nvidia Geforce GTX1070 GPU and an Xeon Phi 7210 processor codenamed Knights Landing (KNL) in the domain of parallel graph processing. We show that the KNL gains encouraging performance when processing graphs, so that it can become a promising solution to accelerating multi-threaded graph applications. We further characterize the impact of KNL architectural enhancements on the performance of a state-of-the art graph framework.We have four key observations: 1 Different graph applications require distinctive numbers of threads to reach the peak performance. For the same…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.