PowerFlow: Unlocking the Dual Nature of LLMs via Principled Distribution Matching
Ruishuo Chen, Yu Chen, Zhuoran Li, Longbo Huang

TL;DR
PowerFlow introduces a principled distribution matching framework for fine-tuning LLMs, enabling controlled sharpening or flattening of their output distributions to enhance reasoning or creativity, outperforming existing methods.
Contribution
It reformulates unsupervised LLM fine-tuning as a distribution matching problem using GFlowNet and a novel length-aware Trajectory-Balance objective, addressing biases and enabling dual control of LLM capabilities.
Findings
PowerFlow outperforms existing RLIF methods in various tasks.
It matches or exceeds supervised fine-tuning results.
The approach improves diversity and quality in creative tasks.
Abstract
Unsupervised Reinforcement Learning from Internal Feedback (RLIF) has emerged as a promising paradigm for eliciting the latent capabilities of Large Language Models (LLMs) without external supervision. However, current methods rely on heuristic intrinsic rewards, which often lack a well-defined theoretical optimization target and are prone to degenerative biases. In this work, we introduce PowerFlow, a principled framework that reformulates unsupervised fine-tuning as a distribution matching problem. By casting GFlowNet as an amortized variational sampler for unnormalized densities, we propose a length-aware Trajectory-Balance objective that explicitly neutralizes the structural length biases inherent in autoregressive generation. By targeting -power distributions, PowerFlow enables the directional elicitation of the dual nature of LLMs: sharpening the distribution ($\alpha >…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Explainable Artificial Intelligence (XAI) · Multimodal Machine Learning Applications
