FLAC: Maximum Entropy RL via Kinetic Energy Regularized Bridge Matching
Lei Lv, Yunfei Li, Yu Luo, Fuchun Sun, Xiao Ma

TL;DR
FLAC introduces a likelihood-free reinforcement learning framework that uses kinetic energy regularization within a Schr"odinger Bridge formulation, enabling high-dimensional control without explicit action density estimation.
Contribution
It formulates maximum entropy RL as a Generalized Schr"odinger Bridge problem with kinetic energy regularization, providing a novel likelihood-free approach for iterative policies.
Findings
Achieves superior or comparable performance on high-dimensional benchmarks
Avoids explicit density estimation in policy optimization
Automatically tunes kinetic energy via dual mechanism
Abstract
Iterative generative policies, such as diffusion models and flow matching, offer superior expressivity for continuous control but complicate Maximum Entropy Reinforcement Learning because their action log-densities are not directly accessible. To address this, we propose Field Least-Energy Actor-Critic (FLAC), a likelihood-free framework that regulates policy stochasticity by penalizing the kinetic energy of the velocity field. Our key insight is to formulate policy optimization as a Generalized Schr\"odinger Bridge (GSB) problem relative to a high-entropy reference process (e.g., uniform). Under this view, the maximum-entropy principle emerges naturally as staying close to a high-entropy reference while optimizing return, without requiring explicit action densities. In this framework, kinetic energy serves as a physically grounded proxy for divergence from the reference: minimizing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Generative Adversarial Networks and Image Synthesis · Neural Networks and Reservoir Computing
