Unlocking Advanced Graph Machine Learning Insights through Knowledge Completion on Neo4j Graph Database
Rosario Napoli, Antonio Celesti, Massimo Villari, Maria Fazio

TL;DR
This paper introduces a novel architecture integrating Knowledge Completion into Graph Database-based Machine Learning, revealing hidden knowledge to improve data analysis and model performance on complex interconnected datasets.
Contribution
It proposes a scalable transitive relationship model with decay functions to enable deterministic knowledge flows, enhancing GML applications with knowledge completion capabilities.
Findings
Reshapes dataset topology and dynamics
Improves GML model performance
Highlights importance of hidden knowledge
Abstract
Graph Machine Learning (GML) with Graph Databases (GDBs) has gained significant relevance in recent years, due to its ability to handle complex interconnected data and apply ML techniques using Graph Data Science (GDS). However, a critical gap exists in the current way GDB-GML applications analyze data, especially in terms of Knowledge Completion (KC) in Knowledge Graphs (KGs). In particular, current architectures ignore KC, working on datasets that appear incomplete or fragmented, despite they actually contain valuable hidden knowledge. This limitation may cause wrong interpretations when these data are used as input for GML models. This paper proposes an innovative architecture that integrates a KC phase into GDB-GML applications, demonstrating how revealing hidden knowledge can heavily impact datasets' behavior and metrics. For this purpose, we introduce scalable transitive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
