An Incremental Inverse Reinforcement Learning Approach for Motion Planning with Separated Path and Velocity Preferences
Armin Avaei, Linda van der Spaa, Luka Peternel, Jens Kober

TL;DR
This paper presents an incremental inverse reinforcement learning method that separately learns and optimizes path and velocity preferences for robotic motion planning, enabling robots to adapt to individual user preferences efficiently.
Contribution
It introduces a novel approach to separately learn and optimize path and velocity preferences from demonstrations for improved robot trajectory planning.
Findings
The method successfully generalizes preferences to new scenarios.
Non-expert users can teach preferences with few feedback iterations.
Implemented on a 7-DoF robot arm demonstrating practical applicability.
Abstract
Humans often demonstrate diverse behaviors due to their personal preferences, for instance, related to their individual execution style or personal margin for safety. In this paper, we consider the problem of integrating both path and velocity preferences into trajectory planning for robotic manipulators. We first learn reward functions that represent the user path and velocity preferences from kinesthetic demonstration. We then optimize the trajectory in two steps: first the path and then the velocity, to produce trajectories that adhere to both task requirements and user preferences. We design a set of parameterized features that capture the fundamental preferences in a pick-and-place type of object-transportation task, both in shape and timing of the motion. We demonstrate that our method is capable of generalizing such preferences to new scenarios. We implement our algorithm on a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Reinforcement Learning in Robotics · Robotic Path Planning Algorithms
