SHIRE: Enhancing Sample Efficiency using Human Intuition in   REinforcement Learning

Amogh Joshi; Adarsh Kumar Kosta; Kaushik Roy

arXiv:2409.09990·cs.LG·April 29, 2025

SHIRE: Enhancing Sample Efficiency using Human Intuition in REinforcement Learning

Amogh Joshi, Adarsh Kumar Kosta, Kaushik Roy

PDF

Open Access

TL;DR

SHIRE integrates human intuition into deep reinforcement learning via probabilistic graphical models, significantly improving sample efficiency and policy explainability in robotic tasks with minimal overhead.

Contribution

This work introduces SHIRE, a novel framework that encodes human intuition into RL training, enhancing sample efficiency and explainability in robotic applications.

Findings

01

Achieved 25-78% sample efficiency gains across evaluated environments.

02

Enhanced policy explainability through encoded elementary behaviors.

03

Demonstrated real-world applicability with a practical demonstration.

Abstract

The ability of neural networks to perform robotic perception and control tasks such as depth and optical flow estimation, simultaneous localization and mapping (SLAM), and automatic control has led to their widespread adoption in recent years. Deep Reinforcement Learning has been used extensively in these settings, as it does not have the unsustainable training costs associated with supervised learning. However, DeepRL suffers from poor sample efficiency, i.e., it requires a large number of environmental interactions to converge to an acceptable solution. Modern RL algorithms such as Deep Q Learning and Soft Actor-Critic attempt to remedy this shortcoming but can not provide the explainability required in applications such as autonomous robotics. Humans intuitively understand the long-time-horizon sequential tasks common in robotics. Properly using such intuition can make RL policies…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics