Misspecified $Q$-Learning with Sparse Linear Function Approximation: Tight Bounds on Approximation Error
Ally Yalei Du, Lin F. Yang, Ruosong Wang

TL;DR
This paper establishes tight bounds on the sample complexity for misspecified sparse linear Q-learning, demonstrating that polynomial sample complexity is achievable under certain sparsity and misspecification conditions, unlike the non-sparse case.
Contribution
The paper introduces a novel elimination-based algorithm for sparse linear Q-learning and proves tight upper and lower bounds on its sample complexity.
Findings
Achieves $O(H\,\epsilon)$-optimal policy with polynomial sample complexity in feature dimension and horizon.
Standard Bellman backup approaches are suboptimal for this problem.
Provides a matching lower bound, completing the theoretical understanding of the problem.
Abstract
The recent work by Dong & Yang (2023) showed for misspecified sparse linear bandits, one can obtain an -optimal policy using a polynomial number of samples when the sparsity is a constant, where is the misspecification error. This result is in sharp contrast to misspecified linear bandits without sparsity, which require an exponential number of samples to get the same guarantee. In order to study whether the analog result is possible in the reinforcement learning setting, we consider the following problem: assuming the optimal -function is a -dimensional linear function with sparsity and misspecification error , whether we can obtain an -optimal policy using number of samples polynomially in the feature dimension . We first demonstrate why the standard approach based on Bellman backup or the existing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and ELM · Domain Adaptation and Few-Shot Learning · Sparse and Compressive Sensing Techniques
