TL;DR
This paper introduces a deep inverse reinforcement learning approach that combines exteroceptive and proprioceptive data to model terrain traversability for legged robots, leading to more energy-efficient navigation policies.
Contribution
It proposes a novel method integrating robot-specific inertial features into reward modeling, improving terrain traversability predictions and energy efficiency in legged robots.
Findings
Enhanced reward modeling with inertial features improves terrain assessment.
Energy-aware policies outperform demonstration in energy consumption.
Method validated on MIT Mini-Cheetah robot and simulator.
Abstract
This work reports on developing a deep inverse reinforcement learning method for legged robots terrain traversability modeling that incorporates both exteroceptive and proprioceptive sensory data. Existing works use robot-agnostic exteroceptive environmental features or handcrafted kinematic features; instead, we propose to also learn robot-specific inertial features from proprioceptive sensory data for reward approximation in a single deep neural network. Incorporating the inertial features can improve the model fidelity and provide a reward that depends on the robot's state during deployment. We train the reward network using the Maximum Entropy Deep Inverse Reinforcement Learning (MEDIRL) algorithm and propose simultaneously minimizing a trajectory ranking loss to deal with the suboptimality of legged robot demonstrations. The demonstrated trajectories are ranked by locomotion energy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
