Analyzing Multimodal Integration in the Variational Autoencoder from an Information-Theoretic Perspective
Carlotta Langer, Yasmin Kim Georgie, Ilja Porohovoj, Verena Vanessa, Hafner, Nihat Ay

TL;DR
This paper uses information-theoretic measures to analyze how multimodal variational autoencoders integrate different sensory inputs, focusing on the importance of each modality for accurate data reconstruction in robotic systems.
Contribution
It introduces a novel information-theoretic analysis framework for multimodal VAEs, evaluating the significance of each modality and the effects of missing information on reconstruction quality.
Findings
Different weighting schedules affect multimodal integration capabilities.
Loss of precision measures reveal the impact of missing modalities.
Analysis informs better design of multimodal VAEs for robotics.
Abstract
Human perception is inherently multimodal. We integrate, for instance, visual, proprioceptive and tactile information into one experience. Hence, multimodal learning is of importance for building robotic systems that aim at robustly interacting with the real world. One potential model that has been proposed for multimodal integration is the multimodal variational autoencoder. A variational autoencoder (VAE) consists of two networks, an encoder that maps the data to a stochastic latent space and a decoder that reconstruct this data from an element of this latent space. The multimodal VAE integrates inputs from different modalities at two points in time in the latent space and can thereby be used as a controller for a robotic agent. Here we use this architecture and introduce information-theoretic measures in order to analyze how important the integration of the different modalities are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Laser and Thermal Forming Techniques · Infrared Target Detection Methodologies
