Switching-Geometry Analysis of Deflated Q-Value Iteration

Donghwan Lee

arXiv:2605.10811·math.OC·May 19, 2026

Switching-Geometry Analysis of Deflated Q-Value Iteration

Donghwan Lee

PDF

TL;DR

This paper introduces a spectral radius framework for analyzing the convergence of deflated Q-value iteration in Markov decision processes, revealing sharper bounds and geometric insights.

Contribution

It provides the first JSR-based convergence analysis of deflated Q-VI, showing how deflation leads to more precise convergence rate characterization.

Findings

01

JSR of standard Q-VI equals the discount factor γ

02

Projected switching system can have JSR less than γ

03

Deflation yields sharper convergence bounds without changing optimal policies

Abstract

This paper develops a joint spectral radius (JSR) framework for analyzing rank-one deflated Q-value iteration (Q-VI) in discounted Markov decision process control. Focusing on an all-ones residual correction, we interpret the resulting algorithm through the geometry of switching systems and, to the best of our knowledge, give the first JSR-based convergence analysis of deflated Q-VI for policy optimization problems. Our analysis reveals that the standard Q-VI switching system model has JSR exactly the discount factor $γ \in (0, 1)$ , since all admissible subsystems share the all-ones vector as an invariant direction. By passing to the quotient space that removes this direction, we obtain a projected switching system model whose JSR governs the relevant error dynamics and may be strictly smaller than $γ$ . Therefore, the deflated Q-VI admits a potentially sharper convergence-rate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.