Learning with Shared Representations: Statistical Rates and Efficient Algorithms
Xiaochun Niu, Lili Su, Jiaming Xu, Pengkun Yang

TL;DR
This paper provides a comprehensive theoretical analysis of collaborative learning with shared low-dimensional representations, establishing bounds on statistical error, and proposing an optimal spectral estimator for both linear and nonlinear models.
Contribution
It introduces new bounds on statistical error for shared representation learning, extending analysis to nonlinear models, and proposes an estimator that achieves optimal rates under certain conditions.
Findings
Spectral estimator achieves near-minimax optimal rates.
Two phases of the statistical rate are identified: parameter-counting and penalized regimes.
Results inform when collaboration improves learning performance.
Abstract
Collaborative learning through latent shared feature representations enables heterogeneous clients to train personalized models with improved performance and reduced sample complexity. Despite empirical success and extensive study, the theoretical understanding of such methods remains incomplete, even for representations restricted to low-dimensional linear subspaces. In this work, we establish new upper and lower bounds on the statistical error in learning low-dimensional shared representations across clients. Our analysis captures both statistical heterogeneity (including covariate and concept shifts) and variation in local dataset sizes, aspects often overlooked in prior work. We further extend these results to nonlinear models including logistic regression and one-hidden-layer ReLU networks. Specifically, we design a spectral estimator that leverages independent replicas of local…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Clustering Algorithms Research · Metaheuristic Optimization Algorithms Research · Data Mining Algorithms and Applications
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Logistic Regression
