SHIRE: Enhancing Sample Efficiency using Human Intuition in REinforcement Learning
Amogh Joshi, Adarsh Kumar Kosta, Kaushik Roy

TL;DR
SHIRE integrates human intuition into deep reinforcement learning via probabilistic graphical models, significantly improving sample efficiency and policy explainability in robotic tasks with minimal overhead.
Contribution
This work introduces SHIRE, a novel framework that encodes human intuition into RL training, enhancing sample efficiency and explainability in robotic applications.
Findings
Achieved 25-78% sample efficiency gains across evaluated environments.
Enhanced policy explainability through encoded elementary behaviors.
Demonstrated real-world applicability with a practical demonstration.
Abstract
The ability of neural networks to perform robotic perception and control tasks such as depth and optical flow estimation, simultaneous localization and mapping (SLAM), and automatic control has led to their widespread adoption in recent years. Deep Reinforcement Learning has been used extensively in these settings, as it does not have the unsustainable training costs associated with supervised learning. However, DeepRL suffers from poor sample efficiency, i.e., it requires a large number of environmental interactions to converge to an acceptable solution. Modern RL algorithms such as Deep Q Learning and Soft Actor-Critic attempt to remedy this shortcoming but can not provide the explainability required in applications such as autonomous robotics. Humans intuitively understand the long-time-horizon sequential tasks common in robotics. Properly using such intuition can make RL policies…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics
