# The Label Complexity of Active Learning from Observational Data

**Authors:** Songbai Yan, Kamalika Chaudhuri, Tara Javidi

arXiv: 1905.12791 · 2019-10-29

## TL;DR

This paper introduces a more label-efficient active learning algorithm for counterfactual learning from observational data, improving over previous methods by integrating a better risk minimizer and ensuring statistical consistency.

## Contribution

It develops a novel active learning approach that incorporates an efficient counterfactual risk minimizer, reducing label complexity and improving statistical guarantees.

## Key findings

- More label-efficient than prior methods
- Statistically consistent active learning algorithm
- Improved counterfactual risk estimation

## Abstract

Counterfactual learning from observational data involves learning a classifier on an entire population based on data that is observed conditioned on a selection policy. This work considers this problem in an active setting, where the learner additionally has access to unlabeled examples and can choose to get a subset of these labeled by an oracle.   Prior work on this problem uses disagreement-based active learning, along with an importance weighted loss estimator to account for counterfactuals, which leads to a high label complexity. We show how to instead incorporate a more efficient counterfactual risk minimizer into the active learning algorithm. This requires us to modify both the counterfactual risk to make it amenable to active learning, as well as the active learning process to make it amenable to the risk. We provably demonstrate that the result of this is an algorithm which is statistically consistent as well as more label-efficient than prior work.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1905.12791/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/1905.12791/full.md

## References

28 references — full list in the complete paper: https://tomesphere.com/paper/1905.12791/full.md

---
Source: https://tomesphere.com/paper/1905.12791