Aligning Teacher with Student Preferences for Tailored Training Data Generation
Yantao Liu, Zhao Zhang, Zijun Yao, Shulin Cao, Lei Hou and, Juanzi Li

TL;DR
ARTE is a novel framework that aligns teacher models with student preferences to generate personalized training data, improving knowledge distillation for deploying lightweight LLMs on edge devices.
Contribution
This paper introduces ARTE, a new method for tailoring training data by aligning teacher models with student preferences, enhancing model distillation and personalization.
Findings
ARTE outperforms existing instruction-tuning datasets on academic benchmarks.
Aligned teacher models generate more personalized and effective training examples.
The framework generalizes well across different tasks and student profiles.
Abstract
Large Language Models (LLMs) have shown significant promise as copilots in various tasks. Local deployment of LLMs on edge devices is necessary when handling privacy-sensitive data or latency-sensitive tasks. The computational constraints of such devices make direct deployment of powerful large-scale LLMs impractical, necessitating the Knowledge Distillation from large-scale models to lightweight models. Lots of work has been done to elicit diversity and quality training examples from LLMs, but little attention has been paid to aligning teacher instructional content based on student preferences, akin to "responsive teaching" in pedagogy. Thus, we propose ARTE, dubbed Aligning TeacheR with StudenT PreferencEs, a framework that aligns the teacher model with student preferences to generate tailored training examples for Knowledge Distillation. Specifically, we elicit draft questions and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEducational Assessment and Improvement
MethodsSoftmax · Attention Is All You Need · ALIGN · Knowledge Distillation
