Learning Multimodal Attention for Manipulating Deformable Objects with Changing States
Namiko Saito, Mayu Tatsumi, Ayuna Kubo, Kanata Suzuki, Hiroshi Ito, Shigeki Sugano, Tetsuya Ogata

TL;DR
This paper introduces a multimodal attention-based neural network enabling robots to perceive and manipulate deformable objects with changing states, demonstrated through a cooking task involving eggs.
Contribution
The authors develop a predictive recurrent neural network with an attention mechanism for real-time perception and motion generation in dynamic, deformable object manipulation tasks.
Findings
Robot successfully cooked eggs with unknown ingredients.
Model adapted stirring method based on egg state.
Achieved human-like manipulation skills.
Abstract
To support humans in their daily lives, robots are required to autonomously learn, adapt to objects and environments, and perform the appropriate actions. We tackled on the task of cooking scrambled eggs using real ingredients, in which the robot needs to perceive the states of the egg and adjust stirring movement in real time, while the egg is heated and the state changes continuously. In previous works, handling changing objects was found to be challenging because sensory information includes dynamical, both important or noisy information, and the modality which should be focused on changes every time, making it difficult to realize both perception and motion generation in real time. We propose a predictive recurrent neural network with an attention mechanism that can weigh the sensor input, distinguishing how important and reliable each modality is, that realize quick and efficient…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Robotics and Automated Systems · Robotic Locomotion and Control
