PustakAI: Curriculum-Aligned and Interactive Textbooks Using Large Language Models
Shivam Sharma (1), Riya Naik (1), Tejas Gawas (1), Heramb Patil (1), Kunal Korgaonkar (1) ((1) CSIS Department, BITS Pilani K K Birla Goa Campus, India)

TL;DR
This paper introduces PustakAI, a curriculum-aligned question-answering dataset for Indian school textbooks, evaluating various large language models' effectiveness in educational contexts.
Contribution
It presents a novel dataset aligned with the NCERT curriculum and analyzes the performance of multiple open-source and high-end LLMs in educational applications.
Findings
Open-source LLMs show limitations in curriculum-specific tasks.
High-end LLMs perform better but still face challenges in accuracy and pedagogical relevance.
Prompting techniques influence model alignment with curriculum demands.
Abstract
Large Language Models (LLMs) have demonstrated remarkable capabilities in understanding and generating human-like content. This has revolutionized various sectors such as healthcare, software development, and education. In education, LLMs offer potential for personalized and interactive learning experiences, especially in regions with limited teaching resources. However, adapting these models effectively to curriculum-specific content, such as the National Council of Educational Research and Training (NCERT) syllabus in India, presents unique challenges in terms of accuracy, alignment, and pedagogical relevance. In this paper, we present the framework "PustakAI"\footnote{Pustak means `book' in many Indian languages.} for the design and evaluation of a novel question-answering dataset "NCERT-QA" aligned with the NCERT curriculum for English and Science subjects of grades 6 to 8. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Text Readability and Simplification · Computational and Text Analysis Methods
