Valid post-selection inference in Robust Q-learning

Jeremiah Jones; Ashkan Ertefaie; James R. McKay; David W. Oslin; Robert L. Strawderman

arXiv:2208.03233·stat.ME·October 14, 2025·1 cites

Valid post-selection inference in Robust Q-learning

Jeremiah Jones, Ashkan Ertefaie, James R. McKay, David W. Oslin, Robert L. Strawderman

PDF

Open Access

TL;DR

This paper develops a statistically valid post-selection inference method for robust Q-learning, enabling reliable variable selection in complex adaptive treatment strategies with theoretical guarantees and practical application.

Contribution

It adapts the UPoSI procedure to semiparametric robust Q-learning, allowing valid inference after data-driven variable selection in multistage decision models.

Findings

01

The proposed method provides valid confidence regions post-variable selection.

02

Simulation studies confirm the theoretical validity of the approach.

03

Application to substance abuse treatment demonstrates practical utility.

Abstract

Q-learning facilitates the development of an optimal adaptive treatment strategy through stagewise regression on a pre-specified set of tailoring variables and confounders. Semiparametric robust Q-learning eliminates the residual confounding that can occur when parametric working models for confounding influences are misspecified. However, in the presence of many potential tailoring variables, constructing an optimal adaptive treatment strategy using either approach may lead to including extraneous variables that contribute little or no benefit while increasing implementation costs, thereby placing an undue burden on patients. Using data-driven selection processes to identify a smaller set of informative prognostic factors is straightforward; however, proper statistical inference must account for this selection process. In this paper, we adapt the Universal Post-Selection Inference…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Statistical Methods in Clinical Trials · Gene expression and cancer classification