Demystifying statistical learning based on efficient influence functions
Oliver Hines, Oliver Dukes, Karla Diaz-Ordaz, Stijn Vansteelandt

TL;DR
This paper explains how to derive and use efficient influence functions to improve the construction of estimators in statistical learning, addressing bias and variability issues in treatment effect estimation.
Contribution
It provides a detailed methodology for deriving efficient influence functions and demonstrates their application in constructing better estimators in machine learning contexts.
Findings
Efficient influence functions can be systematically derived for various estimands.
Using influence functions improves estimator bias and variability.
The methodology is broadly applicable across different statistical models.
Abstract
Evaluation of treatment effects and more general estimands is typically achieved via parametric modelling, which is unsatisfactory since model misspecification is likely. Data-adaptive model building (e.g. statistical/machine learning) is commonly employed to reduce the risk of misspecification. Naive use of such methods, however, delivers estimators whose bias may shrink too slowly with sample size for inferential methods to perform well, including those based on the bootstrap. Bias arises because standard data-adaptive methods are tuned towards minimal prediction error as opposed to e.g. minimal MSE in the estimator. This may cause excess variability that is difficult to acknowledge, due to the complexity of such strategies. Building on results from non-parametric statistics, targeted learning and debiased machine learning overcome these problems by constructing estimators using the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
