Planning Under Observation Mismatch for Traffic Signal Control via Adaptive Modular World Models
Zherui Huang, Yicheng Liu, Chumeng Liang, Guanjie Zheng

TL;DR
This paper introduces Adaptive Modularized Model (AMM), a modular planning system that enables transferability in decision-making under observation mismatch, demonstrated on traffic signal control.
Contribution
The paper proposes a novel modular architecture with a domain-specific observation adapter and a shared dynamics model, facilitating transfer learning and fast adaptation.
Findings
AMM outperforms existing controllers in traffic signal tasks.
AMM achieves higher data efficiency in cross-domain transfer.
Experiments validate improved performance and adaptability.
Abstract
Deploying learned decision-making systems often requires transferring to new sites where the sensing pipeline differs. In such cases, observations can change in semantics and dimensionality even when action primitives and objectives remain comparable. In this work, we study transferable model-based planning under this observation mismatch, which remains challenging for existing learning-based approaches. We propose Adaptive Modularized Model (AMM), a modular planning architecture that separates a domain-specific observation adapter from a shared internal dynamics model defined in a common planning state space. The dynamics model is meta-learned from multiple source domains to enable fast adaptation with limited target interaction. At run time, AMM performs receding-horizon planning by rolling out candidate action sequences under the learned dynamics and selecting actions that optimize a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
