Optimal Estimator for Linear Regression with Shuffled Labels
Hang Zhang, Ping Li

TL;DR
This paper introduces an efficient one-step estimator for linear regression with shuffled labels, providing conditions for permutation recovery across different signal-to-noise regimes and validating results through numerical experiments.
Contribution
It proposes a novel estimator with optimal computational complexity and characterizes the SNR thresholds for accurate permutation recovery in shuffled linear regression.
Findings
Estimator complexity is $O(n^3 + np^2m)$, matching linear assignment and least squares algorithms.
Sufficient SNR conditions for permutation recovery are established for various regimes.
Numerical experiments confirm theoretical SNR thresholds and estimator effectiveness.
Abstract
This paper considers the task of linear regression with shuffled labels, i.e., , where , and , respectively, represent the sensing results, (unknown or missing) corresponding information, sensing matrix, signal of interest, and additive sensing noise. Given the observation and sensing matrix , we propose a one-step estimator to reconstruct . From the computational perspective, our estimator's complexity is , which is no greater than the maximum complexity of a linear assignment algorithm (e.g., ) and a least square algorithm (e.g., ). From the statistical perspective,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Blind Source Separation Techniques · Electrical and Bioimpedance Tomography
MethodsLinear Regression
