
TL;DR
This paper introduces a GPU-based parallel algorithm for counting triangles in large graphs, significantly outperforming CPU methods and enabling rapid analysis of massive networks.
Contribution
The paper presents a novel CUDA GPU algorithm for triangle counting, with detailed implementation and high-performance results.
Findings
8 to 15 times speedup over CPU implementation
Capable of counting 3.8 billion triangles in under 10 seconds
Efficiently processes large graphs with 89 million edges
Abstract
The clustering coefficient and the transitivity ratio are concepts often used in network analysis, which creates a need for fast practical algorithms for counting triangles in large graphs. Previous research in this area focused on sequential algorithms, MapReduce parallelization, and fast approximations. In this paper we propose a parallel triangle counting algorithm for CUDA GPU. We describe the implementation details necessary to achieve high performance and present the experimental evaluation of our approach. Our algorithm achieves 8 to 15 times speedup over the CPU implementation and is capable of finding 3.8 billion triangles in an 89 million edges graph in less than 10 seconds on the Nvidia Tesla C2050 GPU.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
