Policy-DRIFT: Dynamic Reward-Informed Flow Trajectory Steering

Atharva Mahajan; Abhijeet Vishwasrao; Yuning Wang; Ricardo Vinuesa

arXiv:2605.14022·physics.flu-dyn·May 15, 2026

Policy-DRIFT: Dynamic Reward-Informed Flow Trajectory Steering

Atharva Mahajan, Abhijeet Vishwasrao, Yuning Wang, Ricardo Vinuesa

PDF

TL;DR

Policy-DRIFT introduces a physics-grounded, reward-informed flow control method using generative models and terminal reward guidance, achieving significant drag reduction in turbulent channel flow with less energy.

Contribution

It presents a novel approach combining generative modeling with reinforcement learning to surpass existing performance limits in flow control.

Findings

01

Achieves 49% drag reduction in turbulent flow

02

Outperforms previous DRL benchmarks by 16%

03

Consumes 37 times less actuation energy

Abstract

Skin-friction drag induced by wall-bounded turbulent flows accounts for a substantial fraction of energy consumption across commercial aerospace, wind energy, and marine transport. Its active reduction is one of the highest-value targets in engineering fluid dynamics. Deep reinforcement learning (DRL) has emerged as the leading approach for real-time flow control, yet its performance ceiling is set not by algorithmic capability but by reward structure, the naive scalar objective does not optimally reflect the underlying physics. Policy-DRIFT bypasses this ceiling by relocating reward information from policy gradients to generative model inference: a conditional flow matching model (CFM) constructs a physically-grounded manifold of realisable flow states spanning multiple control regimes, Terminal Reward Guidance (TRG) steers samples toward reward-maximising targets at inference, and a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.