Mining Attribute-structure Correlated Patterns in Large Attributed   Graphs

Arlei Silva; Wagner Meira Jr.; Mohammed J. Zaki

arXiv:1201.6568·cs.DB·February 1, 2012

Mining Attribute-structure Correlated Patterns in Large Attributed Graphs

Arlei Silva, Wagner Meira Jr., Mohammed J. Zaki

PDF

Open Access

TL;DR

This paper introduces a method for mining dense subgraphs in large attributed graphs that are strongly correlated with specific attribute sets, using statistical significance and efficient algorithms to reveal meaningful patterns.

Contribution

It presents a novel approach combining statistical significance measures with an efficient algorithm for structural correlation pattern mining in large attributed graphs.

Findings

01

Identifies relevant attribute-structure correlation patterns in real-world graphs.

02

Provides a scalable algorithm capable of analyzing large attributed networks.

03

Demonstrates the method's effectiveness on collaboration, music, and citation networks.

Abstract

In this work, we study the correlation between attribute sets and the occurrence of dense subgraphs in large attributed graphs, a task we call structural correlation pattern mining. A structural correlation pattern is a dense subgraph induced by a particular attribute set. Existing methods are not able to extract relevant knowledge regarding how vertex attributes interact with dense subgraphs. Structural correlation pattern mining combines aspects of frequent itemset and quasi-clique mining problems. We propose statistical significance measures that compare the structural correlation of attribute sets against their expected values using null models. Moreover, we evaluate the interestingness of structural correlation patterns in terms of size and density. An efficient algorithm that combines search and pruning strategies in the identification of the most relevant structural correlation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Mining Algorithms and Applications · Rough Sets and Fuzzy Logic · Imbalanced Data Classification Techniques