Information Theoretic Counterfactual Learning from Missing-Not-At-Random   Feedback

Zifeng Wang; Xi Chen; Rui Wen; Shao-Lun Huang; Ercan E.; Kuruoglu; Yefeng Zheng

arXiv:2009.02623·cs.LG·October 20, 2020·28 cites

Information Theoretic Counterfactual Learning from Missing-Not-At-Random Feedback

Zifeng Wang, Xi Chen, Rui Wen, Shao-Lun Huang, Ercan E., Kuruoglu, Yefeng Zheng

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces an information-theoretic counterfactual variational information bottleneck (CVIB) to improve debiasing in recommendation systems dealing with missing-not-at-random data, without relying on costly randomized controlled trials.

Contribution

It proposes a novel CVIB method that separates factual and counterfactual information, enabling effective debiasing without RCTs in recommendation systems.

Findings

01

CVIB significantly improves model performance on real-world datasets.

02

The method facilitates balanced learning between factual and counterfactual domains.

03

Empirical results demonstrate the effectiveness of CVIB in counterfactual learning.

Abstract

Counterfactual learning for dealing with missing-not-at-random data (MNAR) is an intriguing topic in the recommendation literature since MNAR data are ubiquitous in modern recommender systems. Missing-at-random (MAR) data, namely randomized controlled trials (RCTs), are usually required by most previous counterfactual learning methods for debiasing learning. However, the execution of RCTs is extraordinarily expensive in practice. To circumvent the use of RCTs, we build an information-theoretic counterfactual variational information bottleneck (CVIB), as an alternative for debiasing learning without RCTs. By separating the task-aware mutual information term in the original information bottleneck Lagrangian into factual and counterfactual parts, we derive a contrastive information loss and an additional output confidence penalty, which facilitates balanced learning between the factual and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

RyanWangZf/CVIB-Rec
pytorchOfficial

Videos

Information Theoretic Counterfactual Learning from Missing-Not-At-Random Feedback· slideslive

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Recommender Systems and Techniques · Advanced Bandit Algorithms Research