Multi-dimensional domain generalization with low-rank structures
Sai Li, Linjun Zhang

TL;DR
This paper introduces a tensor-based approach for multi-dimensional domain generalization in linear regression, effectively handling underrepresented sub-populations with limited data, and provides theoretical guarantees and empirical validation.
Contribution
It proposes a novel tensor completion method leveraging group label structures for robust domain generalization in linear models, with proven optimality and practical validation.
Findings
The method achieves minimax optimality.
It improves prediction accuracy for minority groups.
The approach is validated on real-world data.
Abstract
In conventional statistical and machine learning methods, it is typically assumed that the test data are identically distributed with the training data. However, this assumption does not always hold, especially in applications where the target population are not well-represented in the training data. This is a notable issue in health-related studies, where specific ethnic populations may be underrepresented, posing a significant challenge for researchers aiming to make statistical inferences about these minority groups. In this work, we present a novel approach to addressing this challenge in linear regression models. We organize the model parameters for all the sub-populations into a tensor. By studying a structured tensor completion problem, we can achieve robust domain generalization, i.e., learning about sub-populations with limited or no available data. Our method novelly leverages…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTensor decomposition and applications · Statistical Methods in Epidemiology
MethodsLinear Regression
