Detecting is Easy, Adapting is Hard: Local Expert Growth for Visual Model-Based Reinforcement Learning under Distribution Shift
Haiyang Zhao

TL;DR
This paper investigates the challenge of adapting visual model-based reinforcement learning agents to distribution shifts, proposing a local expert growth method that improves out-of-distribution performance without sacrificing in-distribution accuracy.
Contribution
The paper introduces JEPA-Indexed Local Expert Growth, a novel method that incrementally adds local experts for better adaptation under distribution shifts in visual MBRL.
Findings
The proposed method achieves significant OOD improvements across multiple shift conditions.
Local experts remain useful when the same shift is encountered again, indicating incremental knowledge growth.
Automatic in-distribution rejection is feasible with simple density models.
Abstract
Visual model-based reinforcement learning (MBRL) agents can perform well on the training distribution, but often break down once the test environment shifts. In visual MBRL, recognizing that a shift has occurred is often the easier part; the harder part is turning that recognition into useful action-level correction. We study several ways of responding to shift, including planning penalties, direct fine-tuning, global residual correction, and coarse gating. In our experiments, these approaches either do not improve closed-loop control or hurt in-distribution (ID) performance. Based on these negative results, we propose JEPA-Indexed Local Expert Growth. The method uses a frozen JEPA representation only for problem indexing, while cluster-specific residual experts add local action corrections on top of the original controller. The baseline controller itself is not modified. Using…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
