The intrinsic motivation of reinforcement and imitation learning for   sequential tasks

Sao Mai Nguyen

arXiv:2412.20573·cs.AI·December 31, 2024

The intrinsic motivation of reinforcement and imitation learning for sequential tasks

Sao Mai Nguyen

PDF

Open Access

TL;DR

This paper introduces a unified intrinsic motivation framework for reinforcement and imitation learning, enabling autonomous curriculum design and active tutoring requests to improve learning efficiency in sequential tasks.

Contribution

It proposes a novel formulation of intrinsic motivation based on empirical progress, allowing learners to actively select learning strategies and request tutoring, enhancing robustness and speed.

Findings

01

Learners can autonomously choose between exploration and imitation.

02

Active tutoring requests improve learning speed and robustness.

03

Framework supports emergence of symbolic task representations.

Abstract

This work in the field of developmental cognitive robotics aims to devise a new domain bridging between reinforcement learning and imitation learning, with a model of the intrinsic motivation for learning agents to learn with guidance from tutors multiple tasks, including sequential tasks. The main contribution has been to propose a common formulation of intrinsic motivation based on empirical progress for a learning agent to choose automatically its learning curriculum by actively choosing its learning strategy for simple or sequential tasks: which task to learn, between autonomous exploration or imitation learning, between low-level actions or task decomposition, between several tutors. The originality is to design a learner that benefits not only passively from data provided by tutors, but to actively choose when to request tutoring and what and whom to ask. The learner is thus more…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics