Detecting is Easy, Adapting is Hard: Local Expert Growth for Visual Model-Based Reinforcement Learning under Distribution Shift

Haiyang Zhao

arXiv:2604.27411·cs.LG·May 1, 2026

Detecting is Easy, Adapting is Hard: Local Expert Growth for Visual Model-Based Reinforcement Learning under Distribution Shift

Haiyang Zhao

PDF

TL;DR

This paper investigates the challenge of adapting visual model-based reinforcement learning agents to distribution shifts, proposing a local expert growth method that improves out-of-distribution performance without sacrificing in-distribution accuracy.

Contribution

The paper introduces JEPA-Indexed Local Expert Growth, a novel method that incrementally adds local experts for better adaptation under distribution shifts in visual MBRL.

Findings

01

The proposed method achieves significant OOD improvements across multiple shift conditions.

02

Local experts remain useful when the same shift is encountered again, indicating incremental knowledge growth.

03

Automatic in-distribution rejection is feasible with simple density models.

Abstract

Visual model-based reinforcement learning (MBRL) agents can perform well on the training distribution, but often break down once the test environment shifts. In visual MBRL, recognizing that a shift has occurred is often the easier part; the harder part is turning that recognition into useful action-level correction. We study several ways of responding to shift, including planning penalties, direct fine-tuning, global residual correction, and coarse gating. In our experiments, these approaches either do not improve closed-loop control or hurt in-distribution (ID) performance. Based on these negative results, we propose JEPA-Indexed Local Expert Growth. The method uses a frozen JEPA representation only for problem indexing, while cluster-specific residual experts add local action corrections on top of the original controller. The baseline controller itself is not modified. Using…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.