Mining Large Quasi-cliques with Quality Guarantees from Vertex   Neighborhoods

Aritra Konar; and Nicholas D. Sidiropoulos

arXiv:2008.07996·cs.SI·August 19, 2020

Mining Large Quasi-cliques with Quality Guarantees from Vertex Neighborhoods

Aritra Konar, and Nicholas D. Sidiropoulos

PDF

TL;DR

This paper shows that large, dense subgraphs called quasi-cliques can be efficiently mined from real-world graphs by analyzing vertex neighborhoods, leveraging properties like heavy-tailed degree distributions and high clustering coefficients.

Contribution

It introduces a simple neighborhood-based method for extracting large quasi-cliques, supported by empirical evidence from real-world graphs showing its effectiveness.

Findings

01

Vertex neighborhoods often contain large maximal cliques.

02

The best neighborhood density compares favorably to dedicated density-maximization algorithms.

03

Refinement techniques can grow larger quasi-cliques in graphs with small clustering coefficients.

Abstract

Mining dense subgraphs is an important primitive across a spectrum of graph-mining tasks. In this work, we formally establish that two recurring characteristics of real-world graphs, namely heavy-tailed degree distributions and large clustering coefficients, imply the existence of substantially large vertex neighborhoods with high edge-density. This observation suggests a very simple approach for extracting large quasi-cliques: simply scan the vertex neighborhoods, compute the clustering coefficient of each vertex, and output the best such subgraph. The implementation of such a method requires counting the triangles in a graph, which is a well-studied problem in graph mining. When empirically tested across a number of real-world graphs, this approach reveals a surprise: vertex neighborhoods include maximal cliques of non-trivial sizes, and the density of the best neighborhood often…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.