PowerFlow: Unlocking the Dual Nature of LLMs via Principled Distribution Matching

Ruishuo Chen; Yu Chen; Zhuoran Li; Longbo Huang

arXiv:2603.18363·cs.CL·March 20, 2026

PowerFlow: Unlocking the Dual Nature of LLMs via Principled Distribution Matching

Ruishuo Chen, Yu Chen, Zhuoran Li, Longbo Huang

PDF

Open Access

TL;DR

PowerFlow introduces a principled distribution matching framework for fine-tuning LLMs, enabling controlled sharpening or flattening of their output distributions to enhance reasoning or creativity, outperforming existing methods.

Contribution

It reformulates unsupervised LLM fine-tuning as a distribution matching problem using GFlowNet and a novel length-aware Trajectory-Balance objective, addressing biases and enabling dual control of LLM capabilities.

Findings

01

PowerFlow outperforms existing RLIF methods in various tasks.

02

It matches or exceeds supervised fine-tuning results.

03

The approach improves diversity and quality in creative tasks.

Abstract

Unsupervised Reinforcement Learning from Internal Feedback (RLIF) has emerged as a promising paradigm for eliciting the latent capabilities of Large Language Models (LLMs) without external supervision. However, current methods rely on heuristic intrinsic rewards, which often lack a well-defined theoretical optimization target and are prone to degenerative biases. In this work, we introduce PowerFlow, a principled framework that reformulates unsupervised fine-tuning as a distribution matching problem. By casting GFlowNet as an amortized variational sampler for unnormalized densities, we propose a length-aware Trajectory-Balance objective that explicitly neutralizes the structural length biases inherent in autoregressive generation. By targeting $α$ -power distributions, PowerFlow enables the directional elicitation of the dual nature of LLMs: sharpening the distribution ($\alpha >…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Explainable Artificial Intelligence (XAI) · Multimodal Machine Learning Applications