Normalizing Flows are Capable Models for RL
Raj Ghugare, Benjamin Eysenbach

TL;DR
This paper demonstrates that normalizing flows are powerful and expressive models suitable for reinforcement learning, offering simpler algorithms and improved performance across various RL tasks.
Contribution
The authors introduce a novel NF architecture that integrates into RL as a policy, Q-function, and occupancy measure, challenging the belief that NFs lack expressivity.
Findings
Achieves higher performance in imitation learning
Performs well in offline and goal-conditioned RL
Enables simpler RL algorithms
Abstract
Modern reinforcement learning (RL) algorithms have found success by using powerful probabilistic models, such as transformers, energy-based models, and diffusion/flow-based models. To this end, RL researchers often choose to pay the price of accommodating these models into their algorithms -- diffusion models are expressive, but are computationally intensive due to their reliance on solving differential equations, while autoregressive transformer models are scalable but typically require learning discrete representations. Normalizing flows (NFs), by contrast, seem to provide an appealing alternative, as they enable likelihoods and sampling without solving differential equations or autoregressive architectures. However, their potential in RL has received limited attention, partly due to the prevailing belief that normalizing flows lack sufficient expressivity. We show that this is not…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Explainable Artificial Intelligence (XAI) · Artificial Intelligence in Games
MethodsNormalizing Flows · Diffusion
