Learning to Learn from Language Feedback with Social Meta-Learning
Jonathan Cook, Diego Antognini, Martin Klissarov, Claudiu Musat, Edward Grefenstette

TL;DR
This paper introduces a social meta-learning approach to train large language models to proactively solicit and learn from language feedback, improving their adaptability and problem-solving in conversational contexts.
Contribution
It presents a novel finetuning methodology inspired by social meta-learning, enabling LLMs to better utilize feedback and handle ambiguous, multi-turn tasks across domains.
Findings
Models trained with SML ask for less unnecessary information.
SML-trained models outperform baselines in multi-turn problem solving.
Improved generalization across different problem domains.
Abstract
Large language models (LLMs) often struggle to learn from corrective feedback within a conversational context. They are rarely proactive in soliciting this feedback, even when faced with ambiguity, which can make their dialogues feel static, one-sided, and lacking the adaptive qualities of human conversation. To address these limitations, we draw inspiration from social meta-learning (SML) in humans - the process of learning how to learn from others. We formulate SML as a finetuning methodology, training LLMs to solicit and learn from language feedback in simulated pedagogical dialogues, where static tasks are converted into interactive social learning problems. SML effectively teaches models to use conversation to solve problems they are unable to solve in a single turn. This capability generalises across domains; SML on math problems produces models that better use feedback to solve…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Intelligent Tutoring Systems and Adaptive Learning · Innovative Teaching and Learning Methods
