The Fifth Graph Normal Form (5GNF): A Trait-Based Framework for Metadata Normalization in Property Graphs
Yahya Sa'd, Vojtech Merunka, Renzo Angles

TL;DR
The paper proposes 5GNF, a trait-based normalization framework for property graphs that reduces redundancy, enhances semantic clarity, and improves maintainability by externalizing recurring metadata as shared traits.
Contribution
It introduces the 5GNF normalization framework, formalizes trait functional dependencies, and provides an algorithm for extracting reusable traits in property graphs.
Findings
Reduces redundant metadata in property graphs.
Maintains competitive query performance after normalization.
Improves semantic clarity and reusability of graph schemas.
Abstract
Graph databases are widely used in systems that manage rich metadata, yet current modelling practices often embed descriptive attributes directly in nodes, leading to redundancy and inconsistent semantics. This paper introduces the Fifth Graph Normal Form (5GNF), a trait-based normalization framework for property graphs that represents recurring metadata as canonical Trait Nodes connected through HAS_TRAIT relationships. We formalize trait functional dependencies (tFDs) and present the TraitExtraction5GNF algorithm for identifying and extracting reusable traits. The approach is implemented in Neo4j and evaluated using the widely used Northwind dataset, which contains substantial duplication in location and shipping metadata. The normalization process externalizes recurring metadata into shared traits, removes thousands of redundant attribute instances, reduces schema complexity, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGraph Theory and Algorithms · Semantic Web and Ontologies · Advanced Graph Neural Networks
