Teaching to Learn: Sequential Teaching of Agents with Inner States
Mustafa Mert Celikok, Pierre-Alexandre Murena, Samuel Kaski

TL;DR
This paper extends sequential machine teaching to learners with changing inner states, proposing an optimal control approach to improve teaching strategies for meta-learning and distinguishing manipulative from educational teaching.
Contribution
It introduces a multi-agent framework with learners' inner states, and develops an optimal control method for teaching learners that adapt their learning algorithms.
Findings
Modeling learners with inner states affects future learning performance.
Optimal control approach improves teaching strategies for meta-learning.
Distinguishes manipulative from educational teaching methods.
Abstract
In sequential machine teaching, a teacher's objective is to provide the optimal sequence of inputs to sequential learners in order to guide them towards the best model. In this paper we extend this setting from current static one-data-set analyses to learners which change their learning algorithm or latent state to improve during learning, and to generalize to new datasets. We introduce a multi-agent formulation in which learners' inner state may change with the teaching interaction, which affects the learning performance in future tasks. In order to teach such learners, we propose an optimal control approach that takes the future performance of the learner after teaching into account. This provides tools for modelling learners having inner states, and machine teaching of meta-learning algorithms. Furthermore, we distinguish manipulative teaching, which can be done by effectively hiding…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Machine Learning and Algorithms · Optimization and Search Problems
