Reliable Multi-modal Medical Image-to-image Translation Independent of Pixel-wise Aligned Data
Langrui Zhou, Guang Li

TL;DR
This paper introduces MITIA, a novel multi-modal medical image translation model that reliably generates images without requiring pixel-wise aligned training data, effectively handling misaligned datasets.
Contribution
MITIA is the first model to enable reliable multi-modal medical image translation without pixel-wise alignment, using prior extraction and regularization to improve performance on misaligned data.
Findings
Outperforms six state-of-the-art methods on misaligned datasets
Achieves high-quality translation on well-aligned datasets
Demonstrates robustness to various misalignment errors
Abstract
The current mainstream multi-modal medical image-to-image translation methods face a contradiction. Supervised methods with outstanding performance rely on pixel-wise aligned training data to constrain the model optimization. However, obtaining pixel-wise aligned multi-modal medical image datasets is challenging. Unsupervised methods can be trained without paired data, but their reliability cannot be guaranteed. At present, there is no ideal multi-modal medical image-to-image translation method that can generate reliable translation results without the need for pixel-wise aligned data. This work aims to develop a novel medical image-to-image translation model that is independent of pixel-wise aligned data (MITIA), enabling reliable multi-modal medical image-to-image translation under the condition of misaligned training data. The proposed MITIA model utilizes a prior extraction network…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
