Multimodal VAE Active Inference Controller

Cristian Meo; Pablo Lanillos

arXiv:2103.04412·cs.RO·March 9, 2021

Multimodal VAE Active Inference Controller

Cristian Meo, Pablo Lanillos

PDF

1 Repo

TL;DR

This paper introduces a multimodal active inference controller for industrial robots that integrates raw images and proprioception, improving control robustness and adaptability without retraining.

Contribution

It extends active inference control to high-dimensional multimodal inputs using a linearly coupled variational autoencoder, enabling scalable and robust robot control.

Findings

01

Enhanced tracking accuracy in goal-directed reaching

02

High robustness to noise and environmental changes

03

No need for relearning models or retuning parameters

Abstract

Active inference, a theoretical construct inspired by brain processing, is a promising alternative to control artificial agents. However, current methods do not yet scale to high-dimensional inputs in continuous control. Here we present a novel active inference torque controller for industrial arms that maintains the adaptive characteristics of previous proprioceptive approaches but also enables large-scale multimodal integration (e.g., raw images). We extended our previous mathematical formulation by including multimodal state representation learning using a linearly coupled multimodal variational autoencoder. We evaluated our model on a simulated 7DOF Franka Emika Panda robot arm and compared its behavior with a previous active inference baseline and the Panda built-in optimized controller. Results showed improved tracking and control in goal-directed reaching due to the increased…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Cmeo97/MAIF
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.