Policy-DRIFT: Dynamic Reward-Informed Flow Trajectory Steering
Atharva Mahajan, Abhijeet Vishwasrao, Yuning Wang, Ricardo Vinuesa

TL;DR
Policy-DRIFT introduces a physics-grounded, reward-informed flow control method using generative models and terminal reward guidance, achieving significant drag reduction in turbulent channel flow with less energy.
Contribution
It presents a novel approach combining generative modeling with reinforcement learning to surpass existing performance limits in flow control.
Findings
Achieves 49% drag reduction in turbulent flow
Outperforms previous DRL benchmarks by 16%
Consumes 37 times less actuation energy
Abstract
Skin-friction drag induced by wall-bounded turbulent flows accounts for a substantial fraction of energy consumption across commercial aerospace, wind energy, and marine transport. Its active reduction is one of the highest-value targets in engineering fluid dynamics. Deep reinforcement learning (DRL) has emerged as the leading approach for real-time flow control, yet its performance ceiling is set not by algorithmic capability but by reward structure, the naive scalar objective does not optimally reflect the underlying physics. Policy-DRIFT bypasses this ceiling by relocating reward information from policy gradients to generative model inference: a conditional flow matching model (CFM) constructs a physically-grounded manifold of realisable flow states spanning multiple control regimes, Terminal Reward Guidance (TRG) steers samples toward reward-maximising targets at inference, and a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
