Clustering with Lattices in the Analysis of Graph Patterns
Edgar H. de Graaf, Joost N. Kok, Walter A. Kosters

TL;DR
This paper introduces Lattice2SAR, a framework that leverages lattice structures from graph mining to improve clustering and analysis of frequent subgraph patterns, especially in molecular datasets.
Contribution
The work presents a novel method for integrating lattice information into clustering to efficiently analyze frequent subgraph patterns in molecules.
Findings
Reduces data access by using lattice information for clustering.
Enables direct identification of patterns in molecules.
Improves analysis efficiency of frequent subgraph data.
Abstract
Mining frequent subgraphs is an area of research where we have a given set of graphs (each graph can be seen as a transaction), and we search for (connected) subgraphs contained in many of these graphs. In this work we will discuss techniques used in our framework Lattice2SAR for mining and analysing frequent subgraph data and their corresponding lattice information. Lattice information is provided by the graph mining algorithm gSpan; it contains all supergraph-subgraph relations of the frequent subgraph patterns -- and their supports. Lattice2SAR is in particular used in the analysis of frequent graph patterns where the graphs are molecules and the frequent subgraphs are fragments. In the analysis of fragments one is interested in the molecules where patterns occur. This data can be very extensive and in this paper we focus on a technique of making it better available by using the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Mining Algorithms and Applications · Advanced Database Systems and Queries · Rough Sets and Fuzzy Logic
