Dynamics-Aligned Shared Hypernetworks for Contextual RL under Discontinuous Shifts

Jan Benad; Pradeep Kr. Banerjee; Frank R\"oder; Nihat Ay; Martin V. Butz; Manfred Eppe

arXiv:2602.06550·cs.LG·May 12, 2026

Dynamics-Aligned Shared Hypernetworks for Contextual RL under Discontinuous Shifts

Jan Benad, Pradeep Kr. Banerjee, Frank R\"oder, Nihat Ay, Martin V. Butz, Manfred Eppe

PDF

TL;DR

This paper introduces DMA*-SH, a dynamics-aligned shared hypernetwork framework for zero-shot generalization in contextual reinforcement learning, especially under discontinuous shifts in latent contexts.

Contribution

The paper proposes a novel hypernetwork-based method trained via dynamics prediction that effectively handles discontinuous context shifts in RL environments.

Findings

01

DMA*-SH achieves zero-shot generalization on the Actuator Inversion Benchmark.

02

Outperforms domain randomization by 58.1% on AIB tasks.

03

Surpasses standard context-aware baselines by 11.5% on average.

Abstract

Zero-shot generalization in contextual reinforcement learning remains a core challenge, particularly when the context is latent and must be inferred from data. A canonical failure mode arises when latent context discontinuously changes how actions affect the environment, requiring incompatible control responses across contexts. We propose DMA*-SH, a framework where a single hypernetwork, trained solely via dynamics prediction, generates a small set of adapter weights shared across the dynamics model, policy, and action-value function. This shared modulation imparts an inductive bias matched to discontinuous context-to-dynamics shifts, while input/output normalization and random input masking stabilize context inference, promoting directionally concentrated representations. We provide theoretical support via expressivity separation results for hypernetwork modulation, and a variance…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.