Minimax Optimal Two-Stage Algorithm For Moment Estimation Under Covariate Shift

Zhen Zhang; Xin Liu; Shaoli Wang; Jiaye Teng

arXiv:2506.23453·stat.ML·July 1, 2025

Minimax Optimal Two-Stage Algorithm For Moment Estimation Under Covariate Shift

Zhen Zhang, Xin Liu, Shaoli Wang, Jiaye Teng

PDF

Open Access 3 Reviews

TL;DR

This paper introduces a minimax optimal two-stage algorithm for estimating moments under covariate shift, addressing distributional differences between training and testing data with theoretical guarantees and practical robustness.

Contribution

It proposes a novel two-stage method achieving minimax optimal bounds for moment estimation under covariate shift, including a truncated estimator for unknown distributions.

Findings

01

The proposed method attains minimax optimal bounds up to a logarithmic factor.

02

The truncated estimator ensures double robustness in practical scenarios.

03

Numerical experiments confirm the theoretical advantages of the approach.

Abstract

Covariate shift occurs when the distribution of input features differs between the training and testing phases. In covariate shift, estimating an unknown function's moment is a classical problem that remains under-explored, despite its common occurrence in real-world scenarios. In this paper, we investigate the minimax lower bound of the problem when the source and target distributions are known. To achieve the minimax optimal bound (up to a logarithmic factor), we propose a two-stage algorithm. Specifically, it first trains an optimal estimator for the function under the source distribution, and then uses a likelihood ratio reweighting procedure to calibrate the moment estimator. In practice, the source and target distributions are typically unknown, and estimating the likelihood ratio may be unstable. To solve this problem, we propose a truncated version of the estimator that ensures…

Peer Reviews

Decision·ICLR 2025 Poster

Reviewer 01Rating 6Confidence 3

Strengths

- The investigated problem of estimating moments under covariate shift appears fundamental, and the paper offers an approach that the authors show is minimax optimal, which seems a valuable contribution. - The technical results are presented and interpreted clearly.

Weaknesses

**Technical Contributions**: The novelty in attaining and proving the main theorems (Theorems 1 and 2), compared to the existing literature, is unclear. In particular, the proofs appear similar to those in Blanchet et al. (2024), with the main addition being separating and upper bounding the term $w(x)$. **Limitations of the minimax lower bound results**: If I understand correctly, the minimax lower bound result, as well as the proposed algorithm from the authors that achieve the lower bound, a

Reviewer 02Rating 5Confidence 3

Strengths

The paper is clearly laid out. Specifically, it provides the minimax lower bounds and develops an estimator that matches the lower bounds up to the log factor when both target and source distributions are known. Furthermore, a doubly robust estimator is developed when the distributions are unknown.

Weaknesses

Although the paper considers an interesting problem in theme of an important topic, namely covariate shift, there are two major weaknesses. (1) The problem of estimating the $q$-th moment of an unknown function $f$ is not well motivated. On page 1, it is stated that "This is a common scenario in many fields, such as counterfactual inference in causal inference (Ding, 2024)." However, this is not informative enough; Ding (2024) is a textbook and there is no concrete example by simply citing the

Reviewer 03Rating 8Confidence 3

Strengths

The paper is beautifully written with a solid theoretical results. Starting from the optimality, it presents an idealized estimator which attains the optimality, and most importantly, it provides a practically usable estimator and establishes theoretical results for the stabilized estimator. The structure and presentation of the theoretical statements hits perfect balance between technical details and insights for readers to follow. The results are stated in a way how each step of the proposed e

Weaknesses

No major weakness.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Control Systems and Identification · Target Tracking and Data Fusion in Sensor Networks