CourseGPT-zh: an Educational Large Language Model Based on Knowledge Distillation Incorporating Prompt Optimization
Zheyan Qu, Lu Yin, Zitong Yu, Wenbo Wang, Xing zhang

TL;DR
CourseGPT-zh is a specialized educational large language model developed through knowledge distillation and prompt optimization, enabling effective, customizable, and low-cost course-specific NLP applications with improved response quality.
Contribution
The paper introduces a novel framework for course-specific LLM training using knowledge distillation and discrete prompt optimization, enhancing response quality and domain specialization.
Findings
CourseGPT-zh outperforms comparable open-source models in specialized knowledge QA.
Prompt optimization effectively improves ChatGPT response quality.
The framework enables low-cost, customizable deployment of educational LLMs.
Abstract
Large language models (LLMs) have demonstrated astonishing capabilities in natural language processing (NLP) tasks, sparking interest in their application to professional domains with higher specialized requirements. However, restricted access to closed-source LLMs via APIs and the difficulty in collecting massive high-quality datasets pose obstacles to the development of large language models in education fields of various courses. Given these challenges, we propose CourseGPT-zh, a course-oriented education LLM that supports customization and low-cost deployment. To address the comprehensiveness and diversity requirements of course-specific corpora, we design a high-quality question-answering corpus distillation framework incorporating prompt optimization, which effectively mines textbook knowledge and enhances its diversity. Moreover, considering the alignment of LLM responses with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIntelligent Tutoring Systems and Adaptive Learning · Educational Technology and Assessment · Topic Modeling
