CoFL: Continuous Flow Fields for Language-Conditioned Navigation

Haokun Liu; Zhaoqi Ma; Yicheng Chen; Masaki Kitagawa; Wentao Zhang; Zicen Xiong; Jinjie Li; Moju Zhao

arXiv:2603.02854·cs.RO·April 30, 2026

CoFL: Continuous Flow Fields for Language-Conditioned Navigation

Haokun Liu, Zhaoqi Ma, Yicheng Chen, Masaki Kitagawa, Wentao Zhang, Zicen Xiong, Jinjie Li, Moju Zhao

PDF

TL;DR

CoFL introduces an end-to-end navigation policy that learns continuous flow fields from BEV observations and language instructions, enabling real-time, zero-shot deployment in unseen scenes with improved safety and precision.

Contribution

It reformulates language-conditioned navigation as workspace-conditioned flow field learning, enabling dense spatial control supervision and robust real-time navigation.

Findings

01

Outperforms modular VLM-based planners in unseen scenes

02

Enables real-time, zero-shot deployment in real-world environments

03

Builds a large dataset of 500k BEV instruction-flow pairs for training

Abstract

Existing language-conditioned navigation systems typically rely on modular pipelines or trajectory generators, but the latter use each scene--instruction annotation mainly to supervise one start-conditioned rollout. To address these limitations, we present CoFL, an end-to-end policy that maps a bird's-eye view (BEV) observation and a language instruction to a continuous flow field for navigation. CoFL reformulates navigation as workspace-conditioned field learning rather than start-conditioned trajectory prediction: it learns local motion vectors at arbitrary BEV locations, turning each scene--instruction annotation into dense spatial control supervision. Trajectories are generated from any start by numerical integration of the predicted field, enabling simple real-time rollout and closed-loop recovery. To enable large-scale training and evaluation, we build a dataset of over 500k BEV…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.