Active Subsampling for Measurement-Constrained M-Estimation of Individualized Thresholds with High-Dimensional Data
Jingyi Duan, Yang Ning

TL;DR
This paper introduces an active subsampling algorithm for high-dimensional measurement-constrained M-estimation of individualized thresholds, achieving faster convergence rates than traditional methods and demonstrating minimax optimality.
Contribution
The paper proposes a novel K-step active subsampling algorithm for high-dimensional threshold estimation under measurement constraints, with theoretical guarantees and practical validation.
Findings
Two-step algorithm achieves parametric convergence rate for certain smoothness levels.
Estimator outperforms minimax rates in some regimes with fewer samples.
Method demonstrates effectiveness on large real-world dataset.
Abstract
In the measurement-constrained problems, despite the availability of large datasets, we may be only affordable to observe the labels on a small portion of the large dataset. This poses a critical question that which data points are most beneficial to label given a budget constraint. In this paper, we focus on the estimation of the optimal individualized threshold in a measurement-constrained M-estimation framework. Our goal is to estimate a high-dimensional parameter in a linear threshold for a continuous variable such that the discrepancy between whether exceeds the threshold and a binary outcome is minimized. We propose a novel -step active subsampling algorithm to estimate , which iteratively samples the most informative observations and solves a regularized M-estimator. The theoretical properties of our estimator demonstrate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Advanced Statistical Process Monitoring
MethodsFocus
