Optimal Estimator for Linear Regression with Shuffled Labels

Hang Zhang; Ping Li

arXiv:2310.01326·stat.ML·October 3, 2023·1 cites

Optimal Estimator for Linear Regression with Shuffled Labels

Hang Zhang, Ping Li

PDF

Open Access

TL;DR

This paper introduces an efficient one-step estimator for linear regression with shuffled labels, providing conditions for permutation recovery across different signal-to-noise regimes and validating results through numerical experiments.

Contribution

It proposes a novel estimator with optimal computational complexity and characterizes the SNR thresholds for accurate permutation recovery in shuffled linear regression.

Findings

01

Estimator complexity is $O(n^3 + np^2m)$, matching linear assignment and least squares algorithms.

02

Sufficient SNR conditions for permutation recovery are established for various regimes.

03

Numerical experiments confirm theoretical SNR thresholds and estimator effectiveness.

Abstract

This paper considers the task of linear regression with shuffled labels, i.e., $Y = ΠXB + W$ , where $Y \in R^{n \times m}, P i \in R^{n \times n}, X \in R^{n \times p}, B \in R^{p \times m}$ , and $W \in R^{n \times m}$ , respectively, represent the sensing results, (unknown or missing) corresponding information, sensing matrix, signal of interest, and additive sensing noise. Given the observation $Y$ and sensing matrix $X$ , we propose a one-step estimator to reconstruct $(Π, B)$ . From the computational perspective, our estimator's complexity is $O (n^{3} + n p^{2} m)$ , which is no greater than the maximum complexity of a linear assignment algorithm (e.g., $O (n^{3})$ ) and a least square algorithm (e.g., $O (n p^{2} m)$ ). From the statistical perspective,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Blind Source Separation Techniques · Electrical and Bioimpedance Tomography

MethodsLinear Regression