Transfer Learning in Latent Contextual Bandits with Covariate Shift   Through Causal Transportability

Mingwei Deng; Ville Kyrki; Dominik Baumann

arXiv:2502.20153·cs.LG·February 28, 2025

Transfer Learning in Latent Contextual Bandits with Covariate Shift Through Causal Transportability

Mingwei Deng, Ville Kyrki, Dominik Baumann

PDF

Open Access 1 Repo

TL;DR

This paper introduces a causal inference-based transfer learning framework for latent contextual bandits under covariate shift, improving learning efficiency by avoiding negative transfer through effective knowledge transfer.

Contribution

It develops algorithms that leverage causal transportability and variational autoencoders to transfer relevant knowledge in high-dimensional proxy settings, addressing negative transfer.

Findings

01

Empirically improved learning efficiency on synthetic datasets.

02

Effective knowledge transfer reduces negative transfer.

03

Demonstrates the utility of causal transportability in bandit settings.

Abstract

Transferring knowledge from one environment to another is an essential ability of intelligent systems. Nevertheless, when two environments are different, naively transferring all knowledge may deteriorate the performance, a phenomenon known as negative transfer. In this paper, we address this issue within the framework of multi-armed bandits from the perspective of causal inference. Specifically, we consider transfer learning in latent contextual bandits, where the actual context is hidden, but a potentially high-dimensional proxy is observable. We further consider a covariate shift in the context across environments. We show that naively transferring all knowledge for classical bandit algorithms in this setting led to negative transfer. We then leverage transportability theory from causal inference to develop algorithms that explicitly transfer effective knowledge for estimating the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Marksmug/CEVAE-Transportability
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Domain Adaptation and Few-Shot Learning · Gaussian Processes and Bayesian Inference

MethodsCausal inference