TL;DR
This paper introduces a multimodal active inference controller for industrial robots that integrates raw images and proprioception, improving control robustness and adaptability without retraining.
Contribution
It extends active inference control to high-dimensional multimodal inputs using a linearly coupled variational autoencoder, enabling scalable and robust robot control.
Findings
Enhanced tracking accuracy in goal-directed reaching
High robustness to noise and environmental changes
No need for relearning models or retuning parameters
Abstract
Active inference, a theoretical construct inspired by brain processing, is a promising alternative to control artificial agents. However, current methods do not yet scale to high-dimensional inputs in continuous control. Here we present a novel active inference torque controller for industrial arms that maintains the adaptive characteristics of previous proprioceptive approaches but also enables large-scale multimodal integration (e.g., raw images). We extended our previous mathematical formulation by including multimodal state representation learning using a linearly coupled multimodal variational autoencoder. We evaluated our model on a simulated 7DOF Franka Emika Panda robot arm and compared its behavior with a previous active inference baseline and the Panda built-in optimized controller. Results showed improved tracking and control in goal-directed reaching due to the increased…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
