PyGraft: Configurable Generation of Synthetic Schemas and Knowledge Graphs at Your Fingertips
Nicolas Hubert, Pierre Monnin, Mathieu d'Aquin, Davy Monticolo,, Armelle Brun

TL;DR
PyGraft is a Python tool that generates customizable, logically consistent synthetic schemas and knowledge graphs to enhance benchmarking and evaluation of graph-based machine learning models across diverse domains.
Contribution
The paper introduces PyGraft, a novel tool for creating highly customizable, domain-agnostic synthetic schemas and KGs with logical consistency, addressing limitations of existing datasets for benchmarking.
Findings
Generates schemas with RDFS and OWL constructs
Produces KGs that emulate real-world characteristics and scale
Ensures logical consistency with a DL reasoner
Abstract
Knowledge graphs (KGs) have emerged as a prominent data representation and management paradigm. Being usually underpinned by a schema (e.g., an ontology), KGs capture not only factual information but also contextual knowledge. In some tasks, a few KGs established themselves as standard benchmarks. However, recent works outline that relying on a limited collection of datasets is not sufficient to assess the generalization capability of an approach. In some data-sensitive fields such as education or medicine, access to public datasets is even more limited. To remedy the aforementioned issues, we release PyGraft, a Python-based tool that generates highly customized, domain-agnostic schemas and KGs. The synthesized schemas encompass various RDFS and OWL constructs, while the synthesized KGs emulate the characteristics and scale of real-world KGs. Logical consistency of the generated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies · Data Quality and Management · Advanced Graph Neural Networks
