Efficient Estimation of Quantiles in Missing Data Models
Iv\'an D\'iaz

TL;DR
This paper introduces a new targeted maximum likelihood estimator for quantiles in missing data models, demonstrating superior efficiency and robustness over existing methods through simulations and providing practical R code for implementation.
Contribution
The paper presents a novel TMLE for quantiles in semiparametric missing data models, achieving local efficiency, $\sqrt{n}$-consistency, and doubly robustness.
Findings
TMLE outperforms existing estimators in simulations
Efficiency gains up to three times smaller variance
Enables more powerful hypothesis testing for treatment effects
Abstract
We propose a novel targeted maximum likelihood estimator (TMLE) for quantiles in semiparametric missing data models. Our proposed estimator is locally efficient, -consistent, asymptotically normal, and doubly robust, under regularity conditions. We use Monte Carlo simulation to compare our proposed method to existing estimators. The TMLE is superior to all competitors, with relative efficiency up to three times smaller than the inverse probability weighted estimator (IPW), and up to two times smaller than the augmented IPW. This research is motivated by a causal inference research question with highly variable treatment assignment probabilities, and a heavy tailed, highly variable outcome. Estimation of causal effects on the mean is a hard problem in such scenarios because the information bound is generally small. In our application, the efficiency bound for estimating the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
