Imitation and Mirror Systems in Robots through Deep Modality Blending Networks
M. Y. Seker, A. Ahmetoglu, Y. Nagai, M. Asada, E. Oztop, E. Ugur

TL;DR
This paper introduces deep modality blending networks (DMBN), a novel deep learning approach that creates a shared multi-modal latent space enabling robots to understand, imitate, and generate multi-sensory action trajectories, inspired by biological mirror systems.
Contribution
The paper presents a new deep learning architecture, DMBN, that blends multi-modal signals into a common latent space to facilitate action understanding and imitation in robots, inspired by mirror neuron systems.
Findings
DMBN can generate multi-modal trajectories conditioned on desired sensory/motor inputs.
The system enables anatomical and effect-based imitation from different perspectives.
DMBN demonstrates robust retrieval and prediction capabilities with partial multi-modal data.
Abstract
Learning to interact with the environment not only empowers the agent with manipulation capability but also generates information to facilitate building of action understanding and imitation capabilities. This seems to be a strategy adopted by biological systems, in particular primates, as evidenced by the existence of mirror neurons that seem to be involved in multi-modal action understanding. How to benefit from the interaction experience of the robots to enable understanding actions and goals of other agents is still a challenging question. In this study, we propose a novel method, deep modality blending networks (DMBN), that creates a common latent space from multi-modal experience of a robot by blending multi-modal signals with a stochastic weighting mechanism. We show for the first time that deep learning, when combined with a novel modality blending scheme, can facilitate action…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAction Observation and Synchronization · Robot Manipulation and Learning · Human Pose and Action Recognition
