A large-scale and fault-tolerant approach of subgraph mining using   density-based partitioning

Sabeur Aridhi; Laurent d'Orazio; Mondher Maddouri; Engelbert Mephu; Nguifo

arXiv:1212.0017·cs.DB·August 24, 2016·24 cites

A large-scale and fault-tolerant approach of subgraph mining using density-based partitioning

Sabeur Aridhi, Laurent d'Orazio, Mondher Maddouri, Engelbert Mephu, Nguifo

PDF

Open Access

TL;DR

This paper presents a scalable, fault-tolerant subgraph mining method using density-based partitioning within the MapReduce framework, significantly reducing execution time on large graph databases.

Contribution

It introduces a novel density-based partitioning technique for subgraph mining that enhances scalability and fault tolerance in large distributed environments.

Findings

01

Decreases execution time significantly

02

Scales subgraph discovery to large databases

03

Balances computational load effectively

Abstract

Recently, graph mining approaches have become very popular, especially in domains such as bioinformatics, chemoinformatics and social networks. In this scope, one of the most challenging tasks is frequent subgraph discovery. This task has been motivated by the tremendously increasing size of existing graph databases. Since then, an important problem of designing efficient and scaling approaches for frequent subgraph discovery in large clusters, has taken place. However, failures are a norm rather than being an exception in large clusters. In this context, the MapReduce framework was designed so that node failures are automatically handled by the framework. In this paper, we propose a large-scale and fault-tolerant approach of subgraph mining by means of a density-based partitioning technique, using MapReduce. Our partitioning aims to balance computation load on a collection of machines.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Database Systems and Queries · Cloud Computing and Resource Management · Graph Theory and Algorithms