Tensor-Efficient High-Dimensional Q-learning

Junyi Wu; Dan Li

arXiv:2511.03595·cs.LG·April 9, 2026

Tensor-Efficient High-Dimensional Q-learning

Junyi Wu, Dan Li

PDF

TL;DR

Tensor-Efficient Q-Learning (TEQL) leverages low-rank tensor structures in high-dimensional RL to improve sample efficiency and exploration, outperforming traditional methods in resource-constrained settings.

Contribution

The paper introduces TEQL, a novel tensor-based Q-learning algorithm that explicitly exploits low-rank structure for efficient exploration and learning in high-dimensional spaces.

Findings

01

TEQL outperforms matrix-based low-rank and deep RL methods in sample efficiency.

02

TEQL effectively exploits low-rank tensor structure for better exploration.

03

Experiments on control tasks validate TEQL's efficiency under limited sampling budgets.

Abstract

High-dimensional reinforcement learning(RL) faces challenges with complex calculations and low sample efficiency in large state-action spaces. Q-learning algorithms struggle particularly with the curse of dimensionality, where the number of state-action pairs grows exponentially with problem size. While neural network-based approaches like Deep Q-Networks have shown success, they do not explicitly exploit problem structure. Many high-dimensional control tasks exhibit low-rank structure in their value functions, and tensor-based methods using low-rank decomposition offer parameter-efficient representations. However, existing tensor-based Q-learning methods focus on representation fidelity without leveraging this structure for exploration. We propose Tensor-Efficient Q-Learning (TEQL), which represents the Q-function as a low-rank CP tensor over discretized state-action spaces and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.