Unified Cross-Modal Medical Image Synthesis with Hierarchical Mixture of Product-of-Experts

Reuben Dorent; Nazim Haouchine; Alexandra Golby; Sarah Frisken; Tina Kapur; William Wells

arXiv:2410.19378·cs.CV·October 8, 2025

Unified Cross-Modal Medical Image Synthesis with Hierarchical Mixture of Product-of-Experts

Reuben Dorent, Nazim Haouchine, Alexandra Golby, Sarah Frisken, Tina Kapur, William Wells

PDF

Open Access 1 Repo

TL;DR

This paper introduces MMHVAE, a hierarchical variational auto-encoder model that effectively synthesizes missing medical images across different modalities, addressing challenges in high-resolution generation, missing data estimation, multimodal fusion, and dataset-level learning.

Contribution

The paper presents a novel deep mixture model, MMHVAE, for cross-modal medical image synthesis that handles incomplete data and learns complex multimodal representations.

Findings

01

Successfully synthesizes missing brain images in MRI and ultrasound modalities.

02

Outperforms existing methods in image quality and data completion tasks.

03

Demonstrates robustness to incomplete datasets during training.

Abstract

We propose a deep mixture of multimodal hierarchical variational auto-encoders called MMHVAE that synthesizes missing images from observed images in different modalities. MMHVAE's design focuses on tackling four challenges: (i) creating a complex latent representation of multimodal data to generate high-resolution images; (ii) encouraging the variational distributions to estimate the missing information needed for cross-modal image synthesis; (iii) learning to fuse multimodal information in the context of missing data; (iv) leveraging dataset-level information to handle incomplete data sets at training time. Extensive experiments are performed on the challenging problem of pre-operative brain multi-parametric magnetic resonance and intra-operative ultrasound imaging.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

reubendo/mhvae
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Retrieval and Classification Techniques · Advanced Image and Video Retrieval Techniques · Generative Adversarial Networks and Image Synthesis