Structure Detection for Contextual Reinforcement Learning

Tianyue Zhou; Jung-Hoon Cho; Cathy Wu

arXiv:2601.08120·cs.LG·March 31, 2026

Structure Detection for Contextual Reinforcement Learning

Tianyue Zhou, Jung-Hoon Cho, Cathy Wu

PDF

TL;DR

This paper introduces a dynamic framework for identifying problem structures in Contextual Reinforcement Learning, enabling adaptive task transfer strategies that improve performance across diverse environments.

Contribution

The work presents SD-MBTL, a novel framework that detects underlying structures in CMDPs and adaptively switches between transfer learning algorithms for better generalization.

Findings

01

M/GP-MBTL outperforms prior methods by 12.49% on aggregated metrics.

02

The framework effectively identifies structures like Mountain in CRL environments.

03

Experiments on synthetic and real benchmarks validate the adaptive approach's superiority.

Abstract

Contextual Reinforcement Learning (CRL) tackles the problem of solving a set of related Contextual Markov Decision Processes (CMDPs) that vary across different context variables. Traditional approaches--independent training and multi-task learning--struggle with either excessive computational costs or negative transfer. A recently proposed multi-policy approach, Model-Based Transfer Learning (MBTL), has demonstrated effectiveness by strategically selecting a few tasks to train and zero-shot transfer. However, CMDPs encompass a wide range of problems, exhibiting structural properties that vary from problem to problem. As such, different task selection strategies are suitable for different CMDPs. In this work, we introduce Structure Detection MBTL (SD-MBTL), a generic framework that dynamically identifies the underlying generalization structure of CMDP and selects an appropriate MBTL…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.