One-Way Matching of Datasets with Low Rank Signals

Shuxiao Chen; Sizun Jiang; Zongming Ma; Garry P. Nolan; Bokai Zhu

arXiv:2204.13858·math.ST·October 4, 2022·6 cites

One-Way Matching of Datasets with Low Rank Signals

Shuxiao Chen, Sizun Jiang, Zongming Ma, Garry P. Nolan, Bokai Zhu

PDF

Open Access

TL;DR

This paper investigates the limits and methods for matching datasets with low rank signals, establishing theoretical bounds and demonstrating practical effectiveness through simulations and single-cell data applications.

Contribution

It introduces a theoretical framework for one-way dataset matching with low rank signals and proposes a linear assignment method that achieves optimal convergence rates.

Findings

01

Linear assignment with projected data achieves fast convergence.

02

Theoretical bounds are supported by simulations.

03

Practical application demonstrated on single-cell datasets.

Abstract

We study one-way matching of a pair of datasets with low rank signals. Under a stylized model, we first derive information-theoretic limits of matching under a mismatch proportion loss. We then show that linear assignment with projected data achieves fast rates of convergence and sometimes even minimax rate optimality for this task. The theoretical error bounds are corroborated by simulated examples. Furthermore, we illustrate practical use of the matching procedure on two single-cell data examples.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Error Correcting Code Techniques · Machine Learning and Algorithms