TL;DR
RDFGraphGen is an open-source tool that generates synthetic RDF graphs based on SHACL shapes, enabling customizable and scalable datasets for testing RDF applications across domains.
Contribution
It introduces a domain-agnostic RDF graph generator that uses SHACL constraints for flexible, realistic, and scalable synthetic data creation.
Findings
Scalable generation of small, medium, and large RDF graphs
Supports configurable graph structures and value constraints
Includes predefined values for schema.org classes
Abstract
Developing and testing modern RDF-based applications often requires access to RDF datasets with certain characteristics. Unfortunately, it is very difficult to publicly find domain-specific knowledge graphs that conform to a particular set of characteristics. Hence, in this paper we propose RDFGraphGen, an open-source RDF graph generator that uses characteristics provided in the form of SHACL (Shapes Constraint Language) shapes to generate synthetic RDF graphs. RDFGraphGen is domain-agnostic, with configurable graph structure, value constraints, and distributions. It also comes with a number of predefined values for popular schema.org classes and properties, for more realistic graphs. Our results show that RDFGraphGen is scalable and can generate small, medium, and large RDF graphs in any domain.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
