Demystifying statistical learning based on efficient influence functions

Oliver Hines; Oliver Dukes; Karla Diaz-Ordaz; Stijn Vansteelandt

arXiv:2107.00681·math.ST·January 14, 2022

Demystifying statistical learning based on efficient influence functions

Oliver Hines, Oliver Dukes, Karla Diaz-Ordaz, Stijn Vansteelandt

PDF

TL;DR

This paper explains how to derive and use efficient influence functions to improve the construction of estimators in statistical learning, addressing bias and variability issues in treatment effect estimation.

Contribution

It provides a detailed methodology for deriving efficient influence functions and demonstrates their application in constructing better estimators in machine learning contexts.

Findings

01

Efficient influence functions can be systematically derived for various estimands.

02

Using influence functions improves estimator bias and variability.

03

The methodology is broadly applicable across different statistical models.

Abstract

Evaluation of treatment effects and more general estimands is typically achieved via parametric modelling, which is unsatisfactory since model misspecification is likely. Data-adaptive model building (e.g. statistical/machine learning) is commonly employed to reduce the risk of misspecification. Naive use of such methods, however, delivers estimators whose bias may shrink too slowly with sample size for inferential methods to perform well, including those based on the bootstrap. Bias arises because standard data-adaptive methods are tuned towards minimal prediction error as opposed to e.g. minimal MSE in the estimator. This may cause excess variability that is difficult to acknowledge, due to the complexity of such strategies. Building on results from non-parametric statistics, targeted learning and debiased machine learning overcome these problems by constructing estimators using the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.