Study and evaluation of an Irregular Graph Algorithm on Multicore and GPU Processor Architectures
Varun Nagpal

TL;DR
This paper evaluates the performance of an irregular graph algorithm, Triad Census, on multicore and GPU architectures, demonstrating significant speedups and comparing results with supercomputers, highlighting the potential of lower-cost platforms.
Contribution
It provides the first performance evaluation and comparison of Triad Census on multicore and GPGPU platforms, previously only accelerated on supercomputers.
Findings
Maximum speedup of 56x on multicore and 58.4x on GPU.
Performance on multicore surpasses previous supercomputer results.
GPGPU performance is comparable to multicore but slower than supercomputers.
Abstract
One area of Computing applications which poses significant challenge of performance scalability on Chip Multiprocessors(CMP's) are Irregular applications. Such applications have very little computation and unpredictable memory access patterns making them memory-bound in contrast to compute-bound applications. Since the gap between processor and memory performance continues to exist, difficulty to hide and decrease this gap is one of the important factors which results in poor performance of these applications on CMP's. The goal of this thesis is to overcome many such challenges posed during performance acceleration of an irregular graph algorithm called Triad Census. We accelerated the Triad Census algorithm on two significantly different Chip Multiprocessors: Dual-socket Intel Xeon Multicore (8 hardware threads per socket) and 240-processor core NVIDIA Tesla C1060 GPGPU(128 hardware…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Network Packet Processing and Optimization · Interconnection Networks and Systems
