Mining Large Quasi-cliques with Quality Guarantees from Vertex Neighborhoods
Aritra Konar, and Nicholas D. Sidiropoulos

TL;DR
This paper shows that large, dense subgraphs called quasi-cliques can be efficiently mined from real-world graphs by analyzing vertex neighborhoods, leveraging properties like heavy-tailed degree distributions and high clustering coefficients.
Contribution
It introduces a simple neighborhood-based method for extracting large quasi-cliques, supported by empirical evidence from real-world graphs showing its effectiveness.
Findings
Vertex neighborhoods often contain large maximal cliques.
The best neighborhood density compares favorably to dedicated density-maximization algorithms.
Refinement techniques can grow larger quasi-cliques in graphs with small clustering coefficients.
Abstract
Mining dense subgraphs is an important primitive across a spectrum of graph-mining tasks. In this work, we formally establish that two recurring characteristics of real-world graphs, namely heavy-tailed degree distributions and large clustering coefficients, imply the existence of substantially large vertex neighborhoods with high edge-density. This observation suggests a very simple approach for extracting large quasi-cliques: simply scan the vertex neighborhoods, compute the clustering coefficient of each vertex, and output the best such subgraph. The implementation of such a method requires counting the triangles in a graph, which is a well-studied problem in graph mining. When empirically tested across a number of real-world graphs, this approach reveals a surprise: vertex neighborhoods include maximal cliques of non-trivial sizes, and the density of the best neighborhood often…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
