FlowRL: A Taxonomy and Modular Framework for Reinforcement Learning with Diffusion Policies

Chenxiao Gao; Edward Chen; Tianyi Chen; Bo Dai

arXiv:2603.27450·cs.LG·March 31, 2026

FlowRL: A Taxonomy and Modular Framework for Reinforcement Learning with Diffusion Policies

Chenxiao Gao, Edward Chen, Tianyi Chen, Bo Dai

PDF

1 Repo

TL;DR

This paper introduces a comprehensive taxonomy and a modular JAX-based framework for reinforcement learning with diffusion policies, along with benchmarks to guide future research and practical applications.

Contribution

It provides a unified taxonomy, an open-source codebase for efficient training, and systematic benchmarks for diffusion-based RL methods.

Findings

01

Unified taxonomy for RL with diffusion policies

02

Open-source, high-throughput JAX-based codebase

03

Benchmark results across multiple simulation environments

Abstract

Thanks to their remarkable flexibility, diffusion models and flow models have emerged as promising candidates for policy representation. However, efficient reinforcement learning (RL) upon these policies remains a challenge due to the lack of explicit log-probabilities for vanilla policy gradient estimators. While numerous attempts have been proposed to address this, the field lacks a unified perspective to reconcile these seemingly disparate methods, thus hampering ongoing development. In this paper, we bridge this gap by introducing a comprehensive taxonomy for RL algorithms with diffusion/flow policies. To support reproducibility and agile prototyping, we introduce a modular, JAX-based open-source codebase that leverages JIT-compilation for high-throughput training. Finally, we provide systematic and standardized benchmarks across Gym-Locomotion, DeepMind Control Suite, and IsaacLab,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

typoverflow/flow-rl
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.