KnowShiftQA: How Robust are RAG Systems when Textbook Knowledge Shifts in K-12 Education?
Tianshi Zheng, Weihan Li, Jiaxin Bai, Weiqi Wang, Yangqiu Song

TL;DR
This paper evaluates the robustness of Retrieval-Augmented Generation systems in K-12 education when textbook knowledge shifts, revealing significant performance drops and challenges in integrating contextual and parametric knowledge.
Contribution
It introduces KnowShiftQA, a novel dataset simulating textbook knowledge shifts to systematically assess RAG system robustness in educational contexts.
Findings
Most RAG systems experience performance decline with knowledge discrepancies.
Questions requiring integration of textbook and LLM knowledge are particularly challenging.
The study highlights the need for more robust RAG systems in dynamic knowledge environments.
Abstract
Retrieval-Augmented Generation (RAG) systems show remarkable potential as question answering tools in the K-12 Education domain, where knowledge is typically queried within the restricted scope of authoritative textbooks. However, discrepancies between these textbooks and the parametric knowledge inherent in Large Language Models (LLMs) can undermine the effectiveness of RAG systems. To systematically investigate RAG system robustness against such knowledge discrepancies, we introduce KnowShiftQA. This novel question answering dataset simulates these discrepancies by applying deliberate hypothetical knowledge updates to both answers and source documents, reflecting how textbook knowledge can shift. KnowShiftQA comprises 3,005 questions across five subjects, designed with a comprehensive question typology focusing on context utilization and knowledge integration. Our extensive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIntelligent Tutoring Systems and Adaptive Learning · Online Learning and Analytics · Educational Strategies and Epistemologies
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Linear Layer · Attention Is All You Need · Dense Connections · Byte Pair Encoding · Residual Connection · Multi-Head Attention · Weight Decay · WordPiece · Softmax
