Missing Data Imputation by Reducing Mutual Information with Rectified Flows

Jiahao Yu; Qizhen Ying; Leyang Wang; Ziyue Jiang; Song Liu

arXiv:2505.11749·stat.ML·November 26, 2025

Missing Data Imputation by Reducing Mutual Information with Rectified Flows

Jiahao Yu, Qizhen Ying, Leyang Wang, Ziyue Jiang, Song Liu

PDF

Open Access

TL;DR

This paper presents a new iterative missing data imputation method that reduces mutual information between data and missingness, leveraging rectified flows and outperforming existing techniques on various datasets.

Contribution

Introduces a mutual information reduction framework for data imputation using rectified flows, unifying and improving upon existing methods.

Findings

01

Superior imputation accuracy on synthetic datasets.

02

Effective handling of real-world missing data scenarios.

03

Theoretical connection to existing imputation techniques.

Abstract

This paper introduces a novel iterative method for missing data imputation that sequentially reduces the mutual information between data and the corresponding missingness mask. Inspired by GAN-based approaches that train generators to decrease the predictability of missingness patterns, our method explicitly targets this reduction in mutual information. Specifically, our algorithm iteratively minimizes the KL divergence between the joint distribution of the imputed data and missingness mask, and the product of their marginals from the previous iteration. We show that the optimal imputation under this framework can be achieved by solving an ODE whose velocity field minimizes a rectified flow training objective. We further illustrate that some existing imputation techniques can be interpreted as approximate special cases of our mutual-information-reducing framework. Comprehensive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace recognition and analysis · Domain Adaptation and Few-Shot Learning · Face and Expression Recognition