VICON: Vision In-Context Operator Networks for Multi-Physics Fluid Dynamics Prediction

Yadi Cao; Yuxuan Liu; Liu Yang; Rose Yu; Hayden Schaeffer; Stanley Osher

arXiv:2411.16063·cs.LG·January 16, 2026

VICON: Vision In-Context Operator Networks for Multi-Physics Fluid Dynamics Prediction

Yadi Cao, Yuxuan Liu, Liu Yang, Rose Yu, Hayden Schaeffer, Stanley Osher

PDF

Open Access 1 Repo

TL;DR

VICON introduces a vision transformer-based approach for efficient multi-physics fluid dynamics prediction, significantly improving accuracy and robustness over state-of-the-art methods in realistic, variable-timestep scenarios.

Contribution

The paper presents VICON, a novel vision transformer-based architecture that enhances ICONs for dense 2D data, enabling faster, more accurate, and more robust fluid dynamics predictions across varying conditions.

Findings

01

VICON reduces rollout error by up to 44.7% compared to baselines.

02

VICON requires significantly less inference time, up to 72.5% of baseline times.

03

VICON maintains high performance even with irregular sampling and dropped frames.

Abstract

In-Context Operator Networks (ICONs) have demonstrated the ability to learn operators across diverse partial differential equations using few-shot, in-context learning. However, existing ICONs process each spatial point as an individual token, severely limiting computational efficiency when handling dense data in higher spatial dimensions. We propose Vision In-Context Operator Networks (VICON), which integrates vision transformer architectures to efficiently process 2D data through patch-wise operations while preserving ICON's adaptability to multiphysics systems and varying timesteps. Evaluated across three fluid dynamics benchmarks, VICON significantly outperforms state-of-the-art baselines: DPOT and MPP, reducing the averaged last-step rollout error by 37.9% compared to DPOT and 44.7% compared to MPP, while requiring only 72.5% and 34.8% of their respective inference times. VICON…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

eydcao/vicon
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Advanced Neural Network Applications · Autonomous Vehicle Technology and Safety

MethodsLinear Layer · Residual Connection · Softmax · Attention Is All You Need · Multi-Head Attention · Dense Connections · Layer Normalization · Vision Transformer