Feasibility-aware Imitation Learning from Observation with Multimodal Feedback
Kei Takahashi, Hikaru Sasaki, Takamitsu Matsubara

TL;DR
This paper introduces FABCO, a feasibility-aware imitation learning method that uses multimodal feedback and robot dynamics models to learn robust robot control policies from observation, overcoming demonstrator-robot differences.
Contribution
FABCO is the first approach to integrate feasibility estimation with multimodal feedback for imitation learning from observation, improving policy robustness and feasibility.
Findings
FABCO improves imitation learning performance by over 3.2 times.
Feasibility feedback reduces infeasible demonstrations and enhances policy stability.
Multimodal feedback from visual and haptic senses promotes feasible demonstrations.
Abstract
Imitation learning frameworks that learn robot control policies from demonstrators' motions via hand-mounted demonstration interfaces have attracted increasing attention. However, due to differences in physical characteristics between demonstrators and robots, this approach faces two limitations: i) the demonstration data do not include robot actions, and ii) the demonstrated motions may be infeasible for robots. These limitations make policy learning difficult. To address them, we propose Feasibility-Aware Behavior Cloning from Observation (FABCO). FABCO integrates behavior cloning from observation, which complements robot actions using robot dynamics models, with feasibility estimation. In feasibility estimation, the demonstrated motions are evaluated using a robot-dynamics model, learned from the robot's execution data, to assess reproducibility under the robot's dynamics. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Reinforcement Learning in Robotics · Social Robot Interaction and HRI
