A Fast Convergence Theory for Offline Decision Making

Chenjie Mao; Qiaosheng Zhang

arXiv:2406.01378·cs.LG·December 4, 2024

A Fast Convergence Theory for Offline Decision Making

Chenjie Mao, Qiaosheng Zhang

PDF

Open Access

TL;DR

This paper introduces a unified framework and a new algorithm for offline decision making, providing the first generic fast convergence guarantees in function approximation settings for problems like offline RL and OPE.

Contribution

It proposes the DMOF framework and the EDD algorithm, establishing instance-dependent bounds and demonstrating fast convergence with dataset size, supported by a lower bound analysis.

Findings

01

EOEC measures problem correlation and decreases at rate 1/N with dataset size.

02

EDD achieves fast convergence guarantees under partial coverage assumptions.

03

Lower bounds validate the theoretical soundness of the proposed approach.

Abstract

This paper proposes the first generic fast convergence result in general function approximation for offline decision making problems, which include offline reinforcement learning (RL) and off-policy evaluation (OPE) as special cases. To unify different settings, we introduce a framework called Decision Making with Offline Feedback (DMOF), which captures a wide range of offline decision making problems. Within this framework, we propose a simple yet powerful algorithm called Empirical Decision with Divergence (EDD), whose upper bound can be termed as a coefficient named Empirical Offline Estimation Coefficient (EOEC). We show that EOEC is instance-dependent and actually measures the correlation of the problem. When assuming partial coverage in the dataset, EOEC will reduce in a rate of $1/ N$ where $N$ is the size of the dataset, endowing EDD with a fast convergence guarantee. Finally, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComplex Systems and Decision Making