Reward Learning using Structural Motifs in Inverse Reinforcement   Learning

Raeid Saqur

arXiv:2209.13489·cs.LG·September 28, 2022·1 cites

Reward Learning using Structural Motifs in Inverse Reinforcement Learning

Raeid Saqur

PDF

Open Access

TL;DR

This paper introduces SMIRL, a novel IRL method that leverages structural motifs like finite-state automata to improve reward learning in complex, long-horizon tasks, outperforming existing methods in various environments.

Contribution

The paper proposes SMIRL, which incorporates task structure via automata into IRL, enhancing learning efficiency and success in complex tasks compared to baseline methods.

Findings

01

Successfully learns complex tasks where baselines fail

02

Outperforms baselines in sample efficiency on simpler tasks

03

Shows promising results in continuous domains with compositional rewards

Abstract

The Inverse Reinforcement Learning (\textit{IRL}) problem has seen rapid evolution in the past few years, with important applications in domains like robotics, cognition, and health. In this work, we explore the inefficacy of current IRL methods in learning an agent's reward function from expert trajectories depicting long-horizon, complex sequential tasks. We hypothesize that imbuing IRL models with structural motifs capturing underlying tasks can enable and enhance their performance. Subsequently, we propose a novel IRL method, SMIRL, that first learns the (approximate) structure of a task as a finite-state-automaton (FSA), then uses the structural motif to solve the IRL problem. We test our model on both discrete grid world and high-dimensional continuous domain environments. We empirically show that our proposed approach successfully learns all four complex tasks, where two…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Machine Learning in Materials Science · Machine Learning and Algorithms

MethodsTest