RewardFlow: Generate Images by Optimizing What You Reward

Onkar Susladkar; Dong-Hwan Jang; Tushar Prakash; Adheesh Juvekar; Vedant Shah; Ayush Barik; Nabeel Bashir; Muntasir Wahed; Ritish Shrirao; Ismini Lourentzou

arXiv:2604.08536·cs.CV·April 10, 2026

RewardFlow: Generate Images by Optimizing What You Reward

Onkar Susladkar, Dong-Hwan Jang, Tushar Prakash, Adheesh Juvekar, Vedant Shah, Ayush Barik, Nabeel Bashir, Muntasir Wahed, Ritish Shrirao, Ismini Lourentzou

PDF

TL;DR

RewardFlow is a novel inference-time framework that optimizes multiple rewards to generate semantically aligned and high-fidelity images using pretrained diffusion models.

Contribution

It introduces a unified, inversion-free approach with a prompt-aware adaptive policy for multi-reward optimization in image generation.

Findings

01

Achieves state-of-the-art edit fidelity on image editing benchmarks.

02

Demonstrates improved compositional alignment in generated images.

03

Effectively integrates diverse rewards including language-vision reasoning.

Abstract

We introduce RewardFlow, an inversion-free framework that steers pretrained diffusion and flow-matching models at inference time through multi-reward Langevin dynamics. RewardFlow unifies complementary differentiable rewards for semantic alignment, perceptual fidelity, localized grounding, object consistency, and human preference, and further introduces a differentiable VQA-based reward that provides fine-grained semantic supervision through language-vision reasoning. To coordinate these heterogeneous objectives, we design a prompt-aware adaptive policy that extracts semantic primitives from the instruction, infers edit intent, and dynamically modulates reward weights and step sizes throughout sampling. Across several image editing and compositional generation benchmarks, RewardFlow delivers state-of-the-art edit fidelity and compositional alignment.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.