Learning to Solve Tasks with Exploring Prior Behaviours
Ruiqi Zhu, Siyuan Li, Tianhong Dai, Chongjie Zhang, Oya Celiktutan

TL;DR
This paper introduces IRDEC, a method enabling agents to learn necessary prior behaviors through exploration, improving task-solving in sparse reward scenarios without extra demonstrations of those behaviors.
Contribution
The paper presents IRDEC, a novel approach that allows agents to acquire prior behaviors via intrinsic rewards, enhancing their ability to solve tasks with varied initial conditions.
Findings
IRDEC outperforms baselines on navigation tasks.
IRDEC effectively learns prior behaviors without additional demonstrations.
The method improves success rates in robotic manipulation with sparse rewards.
Abstract
Demonstrations are widely used in Deep Reinforcement Learning (DRL) for facilitating solving tasks with sparse rewards. However, the tasks in real-world scenarios can often have varied initial conditions from the demonstration, which would require additional prior behaviours. For example, consider we are given the demonstration for the task of \emph{picking up an object from an open drawer}, but the drawer is closed in the training. Without acquiring the prior behaviours of opening the drawer, the robot is unlikely to solve the task. To address this, in this paper we propose an Intrinsic Rewards Driven Example-based Control \textbf{(IRDEC)}. Our method can endow agents with the ability to explore and acquire the required prior behaviours and then connect to the task-specific behaviours in the demonstration to solve sparse-reward tasks without requiring additional demonstration of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Machine Learning and Algorithms · Robot Manipulation and Learning
