Switching-Geometry Analysis of Deflated Q-Value Iteration
Donghwan Lee

TL;DR
This paper introduces a spectral radius framework for analyzing the convergence of deflated Q-value iteration in Markov decision processes, revealing sharper bounds and geometric insights.
Contribution
It provides the first JSR-based convergence analysis of deflated Q-VI, showing how deflation leads to more precise convergence rate characterization.
Findings
JSR of standard Q-VI equals the discount factor γ
Projected switching system can have JSR less than γ
Deflation yields sharper convergence bounds without changing optimal policies
Abstract
This paper develops a joint spectral radius (JSR) framework for analyzing rank-one deflated Q-value iteration (Q-VI) in discounted Markov decision process control. Focusing on an all-ones residual correction, we interpret the resulting algorithm through the geometry of switching systems and, to the best of our knowledge, give the first JSR-based convergence analysis of deflated Q-VI for policy optimization problems. Our analysis reveals that the standard Q-VI switching system model has JSR exactly the discount factor , since all admissible subsystems share the all-ones vector as an invariant direction. By passing to the quotient space that removes this direction, we obtain a projected switching system model whose JSR governs the relevant error dynamics and may be strictly smaller than . Therefore, the deflated Q-VI admits a potentially sharper convergence-rate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
