Efficient Model Learning for Human-Robot Collaborative Tasks

Stefanos Nikolaidis; Keren Gu; Ramya Ramakrishnan; and Julie Shah

arXiv:1405.6341·cs.RO·June 15, 2017

Efficient Model Learning for Human-Robot Collaborative Tasks

Stefanos Nikolaidis, Keren Gu, Ramya Ramakrishnan, and Julie Shah

PDF

TL;DR

This paper introduces an automatic framework for learning human models from joint-action demonstrations, enabling robots to adapt their policies for collaborative tasks with new users using inverse reinforcement learning and POMDPs.

Contribution

It presents a novel, fully automatic method combining clustering, inverse reinforcement learning, and POMDPs to personalize and robustify robot policies in human-robot collaboration.

Findings

01

The framework accurately infers human types both offline and online.

02

Robust policies are generated that adapt to new users' preferences.

03

Experimental validation shows effective human-robot collaboration.

Abstract

We present a framework for learning human user models from joint-action demonstrations that enables the robot to compute a robust policy for a collaborative task with a human. The learning takes place completely automatically, without any human intervention. First, we describe the clustering of demonstrated action sequences into different human types using an unsupervised learning algorithm. These demonstrated sequences are also used by the robot to learn a reward function that is representative for each type, through the employment of an inverse reinforcement learning algorithm. The learned model is then used as part of a Mixed Observability Markov Decision Process formulation, wherein the human type is a partially observable variable. With this framework, we can infer, either offline or online, the human type of a new user that was not included in the training set, and can compute a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.