Aligning Teacher with Student Preferences for Tailored Training Data   Generation

Yantao Liu; Zhao Zhang; Zijun Yao; Shulin Cao; Lei Hou and; Juanzi Li

arXiv:2406.19227·cs.CL·June 28, 2024

Aligning Teacher with Student Preferences for Tailored Training Data Generation

Yantao Liu, Zhao Zhang, Zijun Yao, Shulin Cao, Lei Hou and, Juanzi Li

PDF

Open Access

TL;DR

ARTE is a novel framework that aligns teacher models with student preferences to generate personalized training data, improving knowledge distillation for deploying lightweight LLMs on edge devices.

Contribution

This paper introduces ARTE, a new method for tailoring training data by aligning teacher models with student preferences, enhancing model distillation and personalization.

Findings

01

ARTE outperforms existing instruction-tuning datasets on academic benchmarks.

02

Aligned teacher models generate more personalized and effective training examples.

03

The framework generalizes well across different tasks and student profiles.

Abstract

Large Language Models (LLMs) have shown significant promise as copilots in various tasks. Local deployment of LLMs on edge devices is necessary when handling privacy-sensitive data or latency-sensitive tasks. The computational constraints of such devices make direct deployment of powerful large-scale LLMs impractical, necessitating the Knowledge Distillation from large-scale models to lightweight models. Lots of work has been done to elicit diversity and quality training examples from LLMs, but little attention has been paid to aligning teacher instructional content based on student preferences, akin to "responsive teaching" in pedagogy. Thus, we propose ARTE, dubbed Aligning TeacheR with StudenT PreferencEs, a framework that aligns the teacher model with student preferences to generate tailored training examples for Knowledge Distillation. Specifically, we elicit draft questions and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEducational Assessment and Improvement

MethodsSoftmax · Attention Is All You Need · ALIGN · Knowledge Distillation