A nonparametric framework for treatment effect modifier discovery in high dimensions
Philippe Boileau, Ning Leng, Nima S. Hejazi, Mark van der Laan,, Sandrine Dudoit

TL;DR
This paper introduces a model-agnostic, high-dimensional framework for identifying treatment effect modifiers, with robust estimators and applications to gene expression data in breast cancer trials.
Contribution
It develops a novel, model-agnostic framework for treatment effect modifier discovery in high-dimensional data, including new estimators with proven asymptotic properties.
Findings
Estimators achieve asymptotic guarantees in realistic sample sizes.
Framework successfully identifies genes linked to treatment effects in breast cancer.
Open-source R package 'unihtee' implements the methodology.
Abstract
Heterogeneous treatment effects are driven by treatment effect modifiers, pre-treatment covariates that modify the effect of a treatment on an outcome. Current approaches for uncovering these variables are limited to low-dimensional data, data with weakly correlated covariates, or data generated according to parametric processes. We resolve these issues by developing a framework for defining model-agnostic treatment effect modifier variable importance parameters applicable to high-dimensional data with arbitrary correlation structure, deriving one-step, estimating equation and targeted maximum likelihood estimators of these parameters, and establishing these estimators' asymptotic properties. This framework is showcased by defining variable importance parameters for data-generating processes with continuous, binary, and time-to-event outcomes with binary treatments, and deriving…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods in Clinical Trials · Statistical Methods and Inference · Gene expression and cancer classification
