Mechanisms of Misgeneralization in Physical Sequence Modeling

Kento Nishi; Raphael Tang; Karun Kumar; Core Francisco Park; Hidenori Tanaka

arXiv:2605.20299·cs.LG·May 21, 2026

Mechanisms of Misgeneralization in Physical Sequence Modeling

Kento Nishi, Raphael Tang, Karun Kumar, Core Francisco Park, Hidenori Tanaka

PDF

TL;DR

This paper investigates why deep sequence models trained on physical data often produce plausible trajectories that do not match the intended distribution over physical quantities, revealing a mechanism called physical misgeneralization.

Contribution

It introduces the concept of physical misgeneralization, analyzes its mechanism using synthetic tasks, and proposes kernel-informed interventions to mitigate it.

Findings

01

Physical misgeneralization occurs when local errors propagate to shift distribution.

02

A data deviation kernel predicts which physical quantities are affected.

03

Mechanistic insights guide promising mitigation strategies.

Abstract

Generative sequence models are often trained to plan motion in physical domains, from robotics to mechanical simulations. When constructing a dataset to train such a model, engineers may curate demonstrations to specify how trajectories should be distributed over a physical quantity like travel distance or mechanical energy. For example, a roboticist building a maze navigation agent might choose demonstrations whose travel distances cover a fixed range uniformly, hoping to constrain the agent's expected power usage. We find that standard deep learning can violate this intent: each generated trajectory can seem plausible on its own, but the aggregate distribution over the physical quantity is wrong. We call this failure physical misgeneralization, and develop an account of its mechanism. Using controlled synthetic tasks, we show that physical misgeneralization arises when local errors…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.