TL;DR
This paper introduces EduAlign, a framework that uses a multi-dimensional reward model to fine-tune large language models, making them more helpful, personalized, and creative as educational tutors.
Contribution
The paper presents a novel multi-dimensional reward model and a fine-tuning process to align LLMs with pedagogical principles, enhancing educational effectiveness.
Findings
Fine-tuned models show improved alignment with educational principles.
The reward model reliably scores LLM outputs on helpfulness, personalization, and creativity.
Experimental results demonstrate significant improvements over baseline models.
Abstract
The integration of large language models (LLMs) into education presents unprecedented opportunities for scalable personalized learning. However, standard LLMs often function as generic information providers, lacking alignment with fundamental pedagogical principles such as helpfulness, student-centered personalization, and creativity cultivation. To bridge this gap, we propose EduAlign, a novel framework designed to guide LLMs toward becoming more effective and responsible educational assistants. EduAlign consists of two main stages. In the first stage, we curate a dataset of 8k educational interactions and annotate them-both manually and automatically-along three key educational dimensions: Helpfulness, Personalization, and Creativity (HPC). These annotations are used to train HPC-RM, a multi-dimensional reward model capable of accurately scoring LLM outputs according to these…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗sii-research/InnoSpark-72B-0710model· 17 dl· ♡ 617 dl♡ 6
- 🤗sii-research/InnoSpark-7B-0715model· 6 dl· ♡ 26 dl♡ 2
- 🤗sii-research/InnoSpark-0.5B-0717model· 6 dl· ♡ 16 dl♡ 1
- 🤗sii-research/InnoSpark-HPC-RM-32Bmodel· 8 dl· ♡ 28 dl♡ 2
- 🤗sii-research/InnoSpark-R-72B-0701model· 7 dl· ♡ 37 dl♡ 3
- 🤗sii-research/InnoSpark-72B-1124model· 5 dl5 dl
- 🤗sii-research/InnoSpark-72B-1224model· 3 dl· ♡ 23 dl♡ 2
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
