LSMI-Sinkhorn: Semi-supervised Mutual Information Estimation with Optimal Transport
Yanbin Liu, Makoto Yamada, Yao-Hung Hubert Tsai, Tam Le, Ruslan, Salakhutdinov, Yi Yang

TL;DR
This paper introduces LSMI-Sinkhorn, a semi-supervised method for mutual information estimation that effectively utilizes limited paired data and unpaired samples through optimal transport, with applications in image matching and summarization.
Contribution
The paper proposes a novel semi-supervised mutual information estimation method using optimal transport and quadratic programming, along with an efficient LSMI-Sinkhorn algorithm.
Findings
Accurately estimates mutual information with few paired samples.
Demonstrates effectiveness in image matching tasks.
Shows versatility across various machine learning applications.
Abstract
Estimating mutual information is an important statistics and machine learning problem. To estimate the mutual information from data, a common practice is preparing a set of paired samples . However, in many situations, it is difficult to obtain a large number of data pairs. To address this problem, we propose the semi-supervised Squared-loss Mutual Information (SMI) estimation method using a small number of paired samples and the available unpaired ones. We first represent SMI through the density ratio function, where the expectation is approximated by the samples from marginals and its assignment parameters. The objective is formulated using the optimal transport problem and quadratic programming. Then, we introduce the Least-Squares Mutual Information with Sinkhorn (LSMI-Sinkhorn)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition · Machine Learning and Algorithms · Machine Learning and ELM
