The intrinsic motivation of reinforcement and imitation learning for sequential tasks
Sao Mai Nguyen

TL;DR
This paper introduces a unified intrinsic motivation framework for reinforcement and imitation learning, enabling autonomous curriculum design and active tutoring requests to improve learning efficiency in sequential tasks.
Contribution
It proposes a novel formulation of intrinsic motivation based on empirical progress, allowing learners to actively select learning strategies and request tutoring, enhancing robustness and speed.
Findings
Learners can autonomously choose between exploration and imitation.
Active tutoring requests improve learning speed and robustness.
Framework supports emergence of symbolic task representations.
Abstract
This work in the field of developmental cognitive robotics aims to devise a new domain bridging between reinforcement learning and imitation learning, with a model of the intrinsic motivation for learning agents to learn with guidance from tutors multiple tasks, including sequential tasks. The main contribution has been to propose a common formulation of intrinsic motivation based on empirical progress for a learning agent to choose automatically its learning curriculum by actively choosing its learning strategy for simple or sequential tasks: which task to learn, between autonomous exploration or imitation learning, between low-level actions or task decomposition, between several tutors. The originality is to design a learner that benefits not only passively from data provided by tutors, but to actively choose when to request tutoring and what and whom to ask. The learner is thus more…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics
