Deep Active Visual Attention for Real-time Robot Motion Generation:   Emergence of Tool-body Assimilation and Adaptive Tool-use

Hyogo Hiruma; Hiroshi Ito; Hiroki Mori; and Tetsuya Ogata

arXiv:2206.14530·cs.RO·June 30, 2022

Deep Active Visual Attention for Real-time Robot Motion Generation: Emergence of Tool-body Assimilation and Adaptive Tool-use

Hyogo Hiruma, Hiroshi Ito, Hiroki Mori, and Tetsuya Ogata

PDF

Open Access

TL;DR

This paper introduces a novel robot motion generation model with active top-down visual attention inspired by human cognition, enabling flexible perception and tool-body assimilation during tool-use tasks.

Contribution

It proposes a state-driven attention module that actively modifies perception targets based on task states, mimicking biological tool-body assimilation.

Findings

01

Improved flexibility in visual perception during tool use

02

Stable attention and motion with untrained tools

03

Resilience to distractions during tasks

Abstract

Sufficiently perceiving the environment is a critical factor in robot motion generation. Although the introduction of deep visual processing models have contributed in extending this ability, existing methods lack in the ability to actively modify what to perceive; humans perform internally during visual cognitive processes. This paper addresses the issue by proposing a novel robot motion generation model, inspired by a human cognitive structure. The model incorporates a state-driven active top-down visual attention module, which acquires attentions that can actively change targets based on task states. We term such attentions as role-based attentions, since the acquired attention directed to targets that shared a coherent role throughout the motion. The model was trained on a robot tool-use task, in which the role-based attentions perceived the robot grippers and tool as identical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVisual Attention and Saliency Detection · Robot Manipulation and Learning · Face Recognition and Perception