Efficient Algorithms for Approximate Triangle Counting
Mostafa Haghir Chehreghani

TL;DR
This paper introduces new randomized algorithms for approximate triangle counting in graphs, offering efficient sampling methods with strong error bounds and applicability to streaming data.
Contribution
The paper presents two novel sampling techniques, $q$-optimal and edge sampling, that improve the efficiency of approximate triangle counting algorithms.
Findings
Algorithms run in $O(sm)$ and $O(sn)$ time with error guarantees.
Provides an $1 extpm \epsilon$ approximation using known bounds on triangle incident edges.
Supports streaming data with minimal passes and space usage.
Abstract
Counting the number of triangles in a graph has many important applications in network analysis. Several frequently computed metrics like the clustering coefficient and the transitivity ratio need to count the number of triangles in the network. Furthermore, triangles are one of the most important graph classes considered in network mining. In this paper, we present a new randomized algorithm for approximate triangle counting. The algorithm can be adopted with different sampling methods and give effective triangle counting methods. In particular, we present two sampling methods, called the \textit{-optimal sampling} and the \textit{edge sampling}, which respectively give and time algorithms with nice error bounds ( and are respectively the number of edges and vertices in the graph and is the number of samples). Among others, we show, for example, that if an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Topological and Geometric Data Analysis · Data Management and Algorithms
