Enhancing Robot Learning through Learned Human-Attention Feature Maps

Daniel Scheuchenstuhl; Stefan Ulmer; Felix Resch; Luigi Berducci; Radu; Grosu

arXiv:2308.15327·cs.RO·August 30, 2023

Enhancing Robot Learning through Learned Human-Attention Feature Maps

Daniel Scheuchenstuhl, Stefan Ulmer, Felix Resch, Luigi Berducci, Radu, Grosu

PDF

Open Access 1 Repo

TL;DR

This paper introduces a method to improve robot learning by modeling human attention and integrating it as auxiliary feature maps, enhancing robustness and learning speed in visual tasks.

Contribution

The paper presents a novel approach to emulate human attention and incorporate it into robot learning, improving performance in object detection and imitation learning tasks.

Findings

01

Improved robustness to out-of-distribution samples

02

Faster learning in low-data regimes

03

Effective integration of human attention into robot perception

Abstract

Robust and efficient learning remains a challenging problem in robotics, in particular with complex visual inputs. Inspired by human attention mechanism, with which we quickly process complex visual scenes and react to changes in the environment, we think that embedding auxiliary information about focus point into robot learning would enhance efficiency and robustness of the learning process. In this paper, we propose a novel approach to model and emulate the human attention with an approximate prediction model. We then leverage this output and feed it as a structured auxiliary feature map into downstream learning tasks. We validate this idea by learning a prediction model from human-gaze recordings of manual driving in the real world. We test our approach on two learning tasks - object detection and imitation learning. Our experiments demonstrate that the inclusion of predicted human…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cps-tuwien/learning_human_attention
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Visual Attention and Saliency Detection · Multimodal Machine Learning Applications

MethodsFocus