Enhancing Retrieval-Augmented Generation with Entity Linking for Educational Platforms
Francesco Granata, Francesco Poggi, Misael Mongiov\`i

TL;DR
This paper introduces ELERAG, an improved retrieval-augmented generation system for educational question-answering in Italian, integrating entity linking and hybrid re-ranking to enhance factual accuracy in domain-specific contexts.
Contribution
The study presents a novel hybrid RAG architecture that combines entity linking and re-ranking strategies, demonstrating improved factual accuracy over standard methods in educational domains.
Findings
ELERAG outperforms baseline models in domain-specific datasets.
Cross-Encoder methods excel in general-domain datasets.
Hybrid strategies effectively address domain mismatch issues.
Abstract
In the era of Large Language Models (LLMs), Retrieval-Augmented Generation (RAG) architectures are gaining significant attention for their ability to ground language generation in reliable knowledge sources. Despite their effectiveness, RAG systems based solely on semantic similarity often fail to ensure factual accuracy in specialized domains, where terminological ambiguity can affect retrieval relevance. This study proposes ELERAG, an enhanced RAG architecture that integrates a factual signal derived from Entity Linking to improve the accuracy of educational question-answering systems in Italian. The system includes a Wikidata-based Entity Linking module and implements a hybrid re-ranking strategy based on Reciprocal Rank Fusion (RRF). To validate our approach, we compared it against standard baselines and state-of-the-art methods, including a Weighted-Score Re-ranking, a standalone…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
