A Markov Random Field Multi-Modal Variational AutoEncoder

Fouad Oubari; Mohamed El Baha; Raphael Meunier; Rodrigue D\'ecatoire,; Mathilde Mougeot

arXiv:2408.09576·cs.LG·February 10, 2025

A Markov Random Field Multi-Modal Variational AutoEncoder

Fouad Oubari, Mohamed El Baha, Raphael Meunier, Rodrigue D\'ecatoire,, Mathilde Mougeot

PDF

Open Access

TL;DR

This paper presents a novel multimodal VAE that integrates a Markov Random Field into the prior and posterior to better model complex intermodal interactions, improving representation of multimodal data.

Contribution

It introduces a new multimodal VAE architecture with MRF integration, enhancing the modeling of complex intermodal relationships beyond existing methods.

Findings

01

Competitive performance on PolyMNIST dataset

02

Superior handling of complex intermodal dependencies in synthetic data

03

Effective modeling of intricate intermodal relationships

Abstract

Recent advancements in multimodal Variational AutoEncoders (VAEs) have highlighted their potential for modeling complex data from multiple modalities. However, many existing approaches use relatively straightforward aggregating schemes that may not fully capture the complex dynamics present between different modalities. This work introduces a novel multimodal VAE that incorporates a Markov Random Field (MRF) into both the prior and posterior distributions. This integration aims to capture complex intermodal interactions more effectively. Unlike previous models, our approach is specifically designed to model and leverage the intricacies of these relationships, enabling a more faithful representation of multimodal data. Our experiments demonstrate that our model performs competitively on the standard PolyMNIST dataset and shows superior performance in managing complex intermodal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFire Detection and Safety Systems