Mandolin: A Knowledge Discovery Framework for the Web of Data
Tommaso Soru, Diego Esteves, Edgard Marx, Axel-Cyrille Ngonga Ngomo

TL;DR
Mandolin is a comprehensive framework that automates knowledge discovery on RDF datasets by integrating rule mining, grounding, and inference, demonstrating scalability and competitive link prediction performance.
Contribution
It introduces Mandolin, the first complete workflow for knowledge discovery on RDF data, combining multiple techniques into a scalable, unified system.
Findings
Scales well on large RDF datasets
Achieves comparable link prediction accuracy
Integrates multiple knowledge discovery steps
Abstract
Markov Logic Networks join probabilistic modeling with first-order logic and have been shown to integrate well with the Semantic Web foundations. While several approaches have been devised to tackle the subproblems of rule mining, grounding, and inference, no comprehensive workflow has been proposed so far. In this paper, we fill this gap by introducing a framework called Mandolin, which implements a workflow for knowledge discovery specifically on RDF datasets. Our framework imports knowledge from referenced graphs, creates similarity relationships among similar literals, and relies on state-of-the-art techniques for rule mining, grounding, and inference computation. We show that our best configuration scales well and achieves at least comparable results with respect to other statistical-relational-learning algorithms on link prediction.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Quality and Management · Semantic Web and Ontologies · Advanced Graph Neural Networks
