Deep Spectral Q-learning with Application to Mobile Health

Yuhe Gao; Chengchun Shi; Rui Song

arXiv:2301.00927·stat.ML·January 4, 2023

Deep Spectral Q-learning with Application to Mobile Health

Yuhe Gao, Chengchun Shi, Rui Song

PDF

Open Access

TL;DR

This paper introduces a deep spectral Q-learning method that combines PCA with deep Q-learning to optimize personalized treatment strategies in mobile health, effectively handling mixed frequency data and demonstrating theoretical convergence and practical utility.

Contribution

It presents a novel deep spectral Q-learning algorithm integrating PCA for mobile health applications, with proven convergence properties and demonstrated effectiveness through simulations and real data.

Findings

01

The mean return under the estimated policy converges to the optimal.

02

The method performs well in simulations and real diabetes data.

03

Theoretical convergence rate is established.

Abstract

Dynamic treatment regimes assign personalized treatments to patients sequentially over time based on their baseline information and time-varying covariates. In mobile health applications, these covariates are typically collected at different frequencies over a long time horizon. In this paper, we propose a deep spectral Q-learning algorithm, which integrates principal component analysis (PCA) with deep Q-learning to handle the mixed frequency data. In theory, we prove that the mean return under the estimated optimal policy converges to that under the optimal one and establish its rate of convergence. The usefulness of our proposal is further illustrated via simulations and an application to a diabetes dataset.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference

MethodsQ-Learning