Exploring Structural Complexity in Normative RAG with Graph-based approaches: A case study on the ETSI Standards
Aiman Al Masoud, Marco Arazzi, Simone Germani, Antonino Nocera

TL;DR
This paper evaluates graph-based Retrieval-Augmented Generation techniques tailored for processing complex normative standards, demonstrating that incorporating structural information improves retrieval performance in standards like ETSI.
Contribution
It introduces a specialized graph RAG methodology for normative documents and empirically compares various indexing strategies to enhance retrieval accuracy.
Findings
Structural and lexical embedding improves retrieval performance.
Graph RAG approaches outperform vanilla retrieval methods.
Evaluation on ETSI standards shows scalable benefits.
Abstract
Industrial standards and normative documents exhibit intricate hierarchical structures, domain-specific lexicons, and extensive cross-referential dependencies, which making it challenging to process them directly by Large Language Models (LLMs). While Retrieval-Augmented Generation (RAG) provides a computationally efficient alternative to LLM fine-tuning, standard "vanilla" vector-based retrieval may fail to capture the latent structural and relational features intrinsic in normative documents. With the objective of shedding light on the most promising technique for building high-performance RAG solutions for normative, standards, and regulatory documents, this paper investigates the efficacy of Graph RAG architectures, which represent information as interconnected nodes, thus moving from simple semantic similarity toward a more robust, relation-aware retrieval mechanism. Despite…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
