Goal Force: Teaching Video Models To Accomplish Physics-Conditioned Goals

Nate Gillman; Yinghua Zhou; Zitian Tang; Evan Luo; Arjan Chakravarthy; Daksh Aggarwal; Michael Freeman; Charles Herrmann; Chen Sun

arXiv:2601.05848·cs.CV·March 24, 2026

Goal Force: Teaching Video Models To Accomplish Physics-Conditioned Goals

Nate Gillman, Yinghua Zhou, Zitian Tang, Evan Luo, Arjan Chakravarthy, Daksh Aggarwal, Michael Freeman, Charles Herrmann, Chen Sun

PDF

Open Access 1 Datasets

TL;DR

This paper introduces Goal Force, a framework for teaching video models to understand and generate physics-conditioned goals using explicit force vectors, enabling better physical reasoning and planning in complex scenarios.

Contribution

The paper proposes a novel force-based goal specification method and trains a video model on synthetic physics data, demonstrating zero-shot generalization to real-world physics tasks.

Findings

01

Model generalizes to complex real-world scenarios

02

Grounding in physical interactions enables physics-aware planning

03

Zero-shot transfer from synthetic to real-world physics tasks

Abstract

Recent advancements in video generation have enabled the development of ``world models'' capable of simulating potential futures for robotics and planning. However, specifying precise goals for these models remains a challenge; text instructions are often too abstract to capture physical nuances, while target images are frequently infeasible to specify for dynamic tasks. To address this, we introduce Goal Force, a novel framework that allows users to define goals via explicit force vectors and intermediate dynamics, mirroring how humans conceptualize physical tasks. We train a video generation model on a curated dataset of synthetic causal primitives-such as elastic collisions and falling dominos-teaching it to propagate forces through time and space. Despite being trained on simple physics data, our model exhibits remarkable zero-shot generalization to complex, real-world scenarios,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

brown-palm/goal-force-training-datasets
dataset· 33 dl
33 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Human Motion and Animation · Generative Adversarial Networks and Image Synthesis