# Distributionally Robust Counterfactual Risk Minimization

**Authors:** Louis Faury, Ugo Tanielian, Flavian Vasile, Elena Smirnova, Elvis, Dohmatob

arXiv: 1906.06211 · 2019-12-17

## TL;DR

This paper proposes a distributionally robust optimization framework for counterfactual risk minimization, unifying existing methods and introducing new divergence-based estimators that improve performance on benchmark datasets.

## Contribution

It introduces a DRO-based framework for CRM, showing existing methods as special cases and proposing a new KL divergence-based estimator that outperforms state-of-the-art methods.

## Key findings

- The KL divergence-based estimator outperforms existing methods on benchmarks.
- Existing CRM solutions are special cases of the DRO framework.
- Using different probability divergences can improve counterfactual risk estimation.

## Abstract

This manuscript introduces the idea of using Distributionally Robust Optimization (DRO) for the Counterfactual Risk Minimization (CRM) problem. Tapping into a rich existing literature, we show that DRO is a principled tool for counterfactual decision making. We also show that well-established solutions to the CRM problem like sample variance penalization schemes are special instances of a more general DRO problem. In this unifying framework, a variety of distributionally robust counterfactual risk estimators can be constructed using various probability distances and divergences as uncertainty measures. We propose the use of Kullback-Leibler divergence as an alternative way to model uncertainty in CRM and derive a new robust counterfactual objective. In our experiments, we show that this approach outperforms the state-of-the-art on four benchmark datasets, validating the relevance of using other uncertainty measures in practical applications.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1906.06211/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/1906.06211/full.md

## References

33 references — full list in the complete paper: https://tomesphere.com/paper/1906.06211/full.md

---
Source: https://tomesphere.com/paper/1906.06211