Combining Offline Causal Inference and Online Bandit Learning for Data Driven Decision
Li Ye, Yishi Lin, Hong Xie, John C.S. Lui

TL;DR
This paper introduces a unified framework combining offline causal inference with online bandit learning to improve decision-making using both logged and streaming data, with theoretical guarantees and empirical validation.
Contribution
It presents novel algorithms that integrate offline and online methods, along with the first regret bounds for forest-based bandit algorithms, enhancing decision accuracy.
Findings
Algorithms outperform methods using only logged or online data.
First upper regret bound established for forest-based bandit algorithms.
Empirical results on real datasets demonstrate superior performance.
Abstract
A fundamental question for companies with large amount of logged data is: How to use such logged data together with incoming streaming data to make good decisions? Many companies currently make decisions via online A/B tests, but wrong decisions during testing hurt users' experiences and cause irreversible damage. A typical alternative is offline causal inference, which analyzes logged data alone to make decisions. However, these decisions are not adaptive to the new incoming data, and so a wrong decision will continuously hurt users' experiences. To overcome the aforementioned limitations, we propose a framework to unify offline causal inference algorithms (e.g., weighting, matching) and online learning algorithms (e.g., UCB, LinUCB). We propose novel algorithms and derive bounds on the decision accuracy via the notion of "regret". We derive the first upper regret bound for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Machine Learning and Data Classification
MethodsCausal inference
