Covering a Graph with Dense Subgraph Families, via Triangle-Rich Sets

Sabyasachi Basu; Daniel Paul-Pena; Kun Qian; C. Seshadhri; Edward W; Huang; Karthik Subbian

arXiv:2407.16850·cs.SI·July 25, 2024

Covering a Graph with Dense Subgraph Families, via Triangle-Rich Sets

Sabyasachi Basu, Daniel Paul-Pena, Kun Qian, C. Seshadhri, Edward W, Huang, Karthik Subbian

PDF

1 Repo

TL;DR

This paper introduces RTRExtractor, an efficient algorithm for discovering large collections of dense, triangle-rich subgraphs in massive graphs, enabling meaningful graph coverage and community detection.

Contribution

It presents a new mathematical formulation of dense subgraph families using triangle-rich sets and provides a provable, scalable algorithm for their discovery.

Findings

01

RTRExtractor processes graphs with hundreds of millions of edges within minutes.

02

It achieves high coverage with dense subgraphs, covering a quarter of vertices in large datasets.

03

The output correlates with meaningful vertex groups in citation networks.

Abstract

Graphs are a fundamental data structure used to represent relationships in domains as diverse as the social sciences, bioinformatics, cybersecurity, the Internet, and more. One of the central observations in network science is that real-world graphs are globally sparse, yet contains numerous "pockets" of high edge density. A fundamental task in graph mining is to discover these dense subgraphs. Most common formulations of the problem involve finding a single (or a few) "optimally" dense subsets. But in most real applications, one does not care for the optimality. Instead, we want to find a large collection of dense subsets that covers a significant fraction of the input graph. We give a mathematical formulation of this problem, using a new definition of regularly triangle-rich (RTR) families. These families capture the notion of dense subgraphs that contain many triangles and have…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

amazon-science/amazon-RTRExtractor
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.