Statistical Inference for Online Decision Making via Stochastic Gradient   Descent

Haoyu Chen; Wenbin Lu; Rui Song

arXiv:2010.07341·stat.ML·October 16, 2020

Statistical Inference for Online Decision Making via Stochastic Gradient Descent

Haoyu Chen, Wenbin Lu, Rui Song

PDF

1 Repo

TL;DR

This paper introduces an efficient online stochastic gradient descent algorithm for decision making that supports parametric reward models, providing statistical inference tools like confidence intervals and hypothesis tests, validated through simulations and real data.

Contribution

It presents a fully online decision-making algorithm with theoretical guarantees for statistical inference, including asymptotic normality and consistent variance estimators.

Findings

01

Algorithm is computationally efficient and supports all parametric reward models.

02

Asymptotic normality of estimators established, enabling inference.

03

Validated through simulations and real-world news recommendation data.

Abstract

Online decision making aims to learn the optimal decision rule by making personalized decisions and updating the decision rule recursively. It has become easier than before with the help of big data, but new challenges also come along. Since the decision rule should be updated once per step, an offline update which uses all the historical data is inefficient in computation and storage. To this end, we propose a completely online algorithm that can make decisions and update the decision rule online via stochastic gradient descent. It is not only efficient but also supports all kinds of parametric reward models. Focusing on the statistical inference of online decision making, we establish the asymptotic normality of the parameter estimator produced by our algorithm and the online inverse probability weighted value estimator we used to estimate the optimal value. Online plugin estimators…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ideechy/Online-Decision-Making
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.