Unified Cross-Modal Medical Image Synthesis with Hierarchical Mixture of Product-of-Experts
Reuben Dorent, Nazim Haouchine, Alexandra Golby, Sarah Frisken, Tina Kapur, William Wells

TL;DR
This paper introduces MMHVAE, a hierarchical variational auto-encoder model that effectively synthesizes missing medical images across different modalities, addressing challenges in high-resolution generation, missing data estimation, multimodal fusion, and dataset-level learning.
Contribution
The paper presents a novel deep mixture model, MMHVAE, for cross-modal medical image synthesis that handles incomplete data and learns complex multimodal representations.
Findings
Successfully synthesizes missing brain images in MRI and ultrasound modalities.
Outperforms existing methods in image quality and data completion tasks.
Demonstrates robustness to incomplete datasets during training.
Abstract
We propose a deep mixture of multimodal hierarchical variational auto-encoders called MMHVAE that synthesizes missing images from observed images in different modalities. MMHVAE's design focuses on tackling four challenges: (i) creating a complex latent representation of multimodal data to generate high-resolution images; (ii) encouraging the variational distributions to estimate the missing information needed for cross-modal image synthesis; (iii) learning to fuse multimodal information in the context of missing data; (iv) leveraging dataset-level information to handle incomplete data sets at training time. Extensive experiments are performed on the challenging problem of pre-operative brain multi-parametric magnetic resonance and intra-operative ultrasound imaging.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Retrieval and Classification Techniques · Advanced Image and Video Retrieval Techniques · Generative Adversarial Networks and Image Synthesis
