Scalable predictive processing framework for multitask caregiving robots

Hayato Idei; Tamon Miyake; Tetsuya Ogata; and Yuichi Yamashita

arXiv:2510.25053·cs.RO·October 30, 2025

Scalable predictive processing framework for multitask caregiving robots

Hayato Idei, Tamon Miyake, Tetsuya Ogata, and Yuichi Yamashita

PDF

TL;DR

This paper introduces a hierarchical predictive processing neural network inspired by the human brain, enabling multitask caregiving robots to learn and adapt to diverse tasks with robustness and minimal task-specific engineering.

Contribution

The paper presents a scalable, multimodal predictive processing framework that directly integrates high-dimensional sensory inputs for flexible multitask caregiving robot control.

Findings

01

Hierarchical latent dynamics regulate task transitions and infer occluded states.

02

Model demonstrates robustness to degraded visual inputs.

03

Asymmetric interference observed in multitask learning, with limited cross-task influence.

Abstract

The rapid aging of societies is intensifying demand for autonomous care robots; however, most existing systems are task-specific and rely on handcrafted preprocessing, limiting their ability to generalize across diverse scenarios. A prevailing theory in cognitive neuroscience proposes that the human brain operates through hierarchical predictive processing, which underlies flexible cognition and behavior by integrating multimodal sensory signals. Inspired by this principle, we introduce a hierarchical multimodal recurrent neural network grounded in predictive processing under the free-energy principle, capable of directly integrating over 30,000-dimensional visuo-proprioceptive inputs without dimensionality reduction. The model was able to learn two representative caregiving tasks, rigid-body repositioning and flexible-towel wiping, without task-specific feature engineering. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.