Detecting Data Type Inconsistencies in a Property Graph Database
Joshua R. Porter, Michael N. Young, Aleks Y. M. Ontman

TL;DR
This paper introduces a tool for detecting data type inconsistencies in property graph databases, enabling rapid analysis and ensuring data integrity during large-scale data imports.
Contribution
The paper presents a novel tool that efficiently identifies data type inconsistencies in schema-less property graph databases, aiding debugging and data quality enforcement.
Findings
Enabled debugging of complex ETL processes
Achieved 100% data type consistency in large knowledge graphs
Provided detailed reports on property data types
Abstract
Some property graph databases do not have a fixed schema, which can result in data type inconsistencies for properties on nodes and relationships, especially when importing data into a running database. Here we present a tool which can rapidly produce a detailed report on every property in the graph. When executed on a large knowledge graph, it allowed us to debug a complex ETL process and enforce 100% data type consistency.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGraph Theory and Algorithms · Semantic Web and Ontologies · Advanced Database Systems and Queries
