Optimal Transport for Structure Learning Under Missing Data

Vy Vo; He Zhao; Trung Le; Edwin V. Bonilla; Dinh Phung

arXiv:2402.15255·cs.LG·June 4, 2024·1 cites

Optimal Transport for Structure Learning Under Missing Data

Vy Vo, He Zhao, Trung Le, Edwin V. Bonilla, Dinh Phung

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel score-based causal structure learning method using optimal transport to handle missing data more effectively than traditional imputation-based approaches.

Contribution

It proposes a new optimal transport-based framework for causal discovery with missing data, improving accuracy and scalability over existing methods.

Findings

01

Outperforms competing methods in simulations and real data

02

Recovers true causal graphs more effectively

03

Demonstrates superior scalability and flexibility

Abstract

Causal discovery in the presence of missing data introduces a chicken-and-egg dilemma. While the goal is to recover the true causal structure, robust imputation requires considering the dependencies or, preferably, causal relations among variables. Merely filling in missing values with existing imputation methods and subsequently applying structure learning on the complete data is empirically shown to be sub-optimal. To address this problem, we propose a score-based algorithm for learning causal structures from missing data based on optimal transport. This optimal transport viewpoint diverges from existing score-based approaches that are dominantly based on expectation maximization. We formulate structure learning as a density fitting problem, where the goal is to find the causal model that induces a distribution of minimum Wasserstein distance with the observed data distribution. Our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

isvy08/otm
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Machine Learning and ELM · Neural Networks and Applications