Efficient Triangle Counting in Large Graphs via Degree-based Vertex Partitioning
Mihail N. Kolountzakis, Gary L. Miller, Richard Peng, Charalampos E., Tsourakakis

TL;DR
This paper introduces a fast, space-efficient triangle counting algorithm for large graphs that combines sampling and degree-based partitioning, adaptable to streaming models, with proven accuracy and practical effectiveness.
Contribution
It presents a novel triangle counting algorithm that integrates sampling with degree-based vertex partitioning, optimized for large-scale and streaming graph data.
Findings
Achieves $O(m + rac{m^{3/2} riangle ext{log}n}{t ext{epsilon}^2})$ runtime
Uses space $O(m^{1/2} ext{log}n + rac{m^{3/2} riangle ext{log}n}{t ext{epsilon}^2})$ in streaming model
Demonstrates excellent results on networks with millions of edges
Abstract
The number of triangles is a computationally expensive graph statistic which is frequently used in complex network analysis (e.g., transitivity ratio), in various random graph models (e.g., exponential random graph model) and in important real world applications such as spam detection, uncovering of the hidden thematic structure of the Web and link recommendation. Counting triangles in graphs with millions and billions of edges requires algorithms which run fast, use small amount of space, provide accurate estimates of the number of triangles and preferably are parallelizable. In this paper we present an efficient triangle counting algorithm which can be adapted to the semistreaming model. The key idea of our algorithm is to combine the sampling algorithm of Tsourakakis et al. and the partitioning of the set of vertices into a high degree and a low degree subset respectively as in the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
