Matrix-Space Reinforcement Learning for Reusing Local Transition Geometry

Zuyuan Zhang; Carlee Joe-Wong; Tian Lan

arXiv:2605.14304·cs.LG·May 15, 2026

Matrix-Space Reinforcement Learning for Reusing Local Transition Geometry

Zuyuan Zhang, Carlee Joe-Wong, Tian Lan

PDF

TL;DR

This paper introduces Matrix-Space Reinforcement Learning (MSRL), a geometric abstraction that captures local transition dynamics for improved transfer and generalization in sequential decision-making tasks.

Contribution

MSRL provides a novel matrix-based descriptor of trajectory segments that supports algebraic composition, transfer, and smooth value function approximation, enhancing reinforcement learning methods.

Findings

01

MSRL achieves a target AUC of 0.73, outperforming baseline methods.

02

The descriptor is well-defined up to coordinate gauge and complete for certain additive signals.

03

Conditioning value functions on MSRL descriptors enables effective transfer in new tasks.

Abstract

Compositional generalization in sequential decision-making requires identifying which parts of prior rollouts remain useful for new tasks. Existing methods reuse skills or predictive models, but often overlook rich local transition geometry and dynamics. We propose Matrix-Space Reinforcement Learning (MSRL), a geometric abstraction that represents trajectory segments through positive semidefinite matrix descriptors aggregating first- and second-order statistics of lifted one-step transitions. These descriptors expose shared hidden structure, support algebraic composition in an abstract matrix space, and reveal opportunities for transfer. We prove that the descriptor is well defined up to coordinate gauge, complete for the induced low-order additive signal class, additive under valid segment composition, and minimally sufficient among admissible additive descriptors. We further show that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.