Scalable predictive processing framework for multitask caregiving robots
Hayato Idei, Tamon Miyake, Tetsuya Ogata, and Yuichi Yamashita

TL;DR
This paper introduces a hierarchical predictive processing neural network inspired by the human brain, enabling multitask caregiving robots to learn and adapt to diverse tasks with robustness and minimal task-specific engineering.
Contribution
The paper presents a scalable, multimodal predictive processing framework that directly integrates high-dimensional sensory inputs for flexible multitask caregiving robot control.
Findings
Hierarchical latent dynamics regulate task transitions and infer occluded states.
Model demonstrates robustness to degraded visual inputs.
Asymmetric interference observed in multitask learning, with limited cross-task influence.
Abstract
The rapid aging of societies is intensifying demand for autonomous care robots; however, most existing systems are task-specific and rely on handcrafted preprocessing, limiting their ability to generalize across diverse scenarios. A prevailing theory in cognitive neuroscience proposes that the human brain operates through hierarchical predictive processing, which underlies flexible cognition and behavior by integrating multimodal sensory signals. Inspired by this principle, we introduce a hierarchical multimodal recurrent neural network grounded in predictive processing under the free-energy principle, capable of directly integrating over 30,000-dimensional visuo-proprioceptive inputs without dimensionality reduction. The model was able to learn two representative caregiving tasks, rigid-body repositioning and flexible-towel wiping, without task-specific feature engineering. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
