Deep Active Visual Attention for Real-time Robot Motion Generation: Emergence of Tool-body Assimilation and Adaptive Tool-use
Hyogo Hiruma, Hiroshi Ito, Hiroki Mori, and Tetsuya Ogata

TL;DR
This paper introduces a novel robot motion generation model with active top-down visual attention inspired by human cognition, enabling flexible perception and tool-body assimilation during tool-use tasks.
Contribution
It proposes a state-driven attention module that actively modifies perception targets based on task states, mimicking biological tool-body assimilation.
Findings
Improved flexibility in visual perception during tool use
Stable attention and motion with untrained tools
Resilience to distractions during tasks
Abstract
Sufficiently perceiving the environment is a critical factor in robot motion generation. Although the introduction of deep visual processing models have contributed in extending this ability, existing methods lack in the ability to actively modify what to perceive; humans perform internally during visual cognitive processes. This paper addresses the issue by proposing a novel robot motion generation model, inspired by a human cognitive structure. The model incorporates a state-driven active top-down visual attention module, which acquires attentions that can actively change targets based on task states. We term such attentions as role-based attentions, since the acquired attention directed to targets that shared a coherent role throughout the motion. The model was trained on a robot tool-use task, in which the role-based attentions perceived the robot grippers and tool as identical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Robot Manipulation and Learning · Face Recognition and Perception
