Letting Tutor Personas "Speak Up" for LLMs: Learning Steering Vectors from Dialogue via Preference Optimization
Jaewook Lee, Alexander Scarlatos, Simon Woodhead, Andrew Lan

TL;DR
This paper introduces a method to encode diverse tutor personas into large language models by learning steering vectors from dialogue data, enabling more personalized and interpretable tutoring interactions without explicit prompts.
Contribution
It proposes a novel approach using preference optimization to learn activation-space directions that control tutor-specific behaviors in LLMs, capturing pedagogical diversity.
Findings
Steering vectors improve semantic alignment with ground-truth tutor responses.
The method increases preference-based evaluation scores.
Learned directions reveal interpretable differences in tutoring styles.
Abstract
With the emergence of large language models (LLMs) as a powerful class of generative artificial intelligence (AI), their use in tutoring has become increasingly prominent. Prior works on LLM-based tutoring typically learn a single tutor policy and do not capture the diversity of tutoring styles. In real-world tutor-student interactions, pedagogical intent is realized through adaptive instructional strategies, with tutors varying the level of scaffolding, instructional directiveness, feedback, and affective support in response to learners' needs. These differences can all impact dialogue dynamics and student engagement. In this paper, we explore how tutor personas embedded in human tutor-student dialogues can be used to guide LLM behavior without relying on explicitly prompted instructions. We modify Bidirectional Preference Optimization (BiPO) to learn a steering vector, an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPersona Design and Applications · Intelligent Tutoring Systems and Adaptive Learning · AI in Service Interactions
