Data-Driven Knowledge Transfer in Batch $Q^*$ Learning

Elynn Chen; Xi Chen; Wenbo Jing

arXiv:2404.15209·cs.LG·January 13, 2026

Data-Driven Knowledge Transfer in Batch $Q^*$ Learning

Elynn Chen, Xi Chen, Wenbo Jing

PDF

Open Access

TL;DR

This paper introduces a framework for transferring knowledge in batch $Q^*$ learning within MDPs, improving decision-making efficiency by leveraging source data to address data scarcity in new tasks.

Contribution

It proposes a Transferred Fitted $Q$-Iteration algorithm with function approximation and analyzes how task discrepancy affects transfer performance.

Findings

01

The method improves learning error rates over single-task approaches.

02

Theoretical analysis links performance to task discrepancy and sample sizes.

03

Empirical results confirm the effectiveness of the transfer approach.

Abstract

In data-driven decision-making in marketing, healthcare, and education, it is desirable to utilize a large amount of data from existing ventures to navigate high-dimensional feature spaces and address data scarcity in new ventures. We explore knowledge transfer in dynamic decision-making by concentrating on batch stationary environments and formally defining task discrepancies through the lens of Markov decision processes (MDPs). We propose a framework of Transferred Fitted $Q$ -Iteration algorithm with general function approximation, enabling the direct estimation of the optimal action-state function $Q^{*}$ using both target and source data. We establish the relationship between statistical performance and MDP task discrepancy under sieve approximation, shedding light on the impact of source and target sample sizes and task discrepancy on the effectiveness of knowledge transfer. We show…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Gaussian Processes and Bayesian Inference · Machine Learning and Data Classification