DDA-Thinker: Decoupled Dual-Atomic Reinforcement Learning for Reasoning-Driven Image Editing

Hanqing Yang; Qiang Zhou; Yongchao Du; Sashuai Zhou; Zhibin Wang; Jun Song; Tiezheng Ge; Cheng Yu; Bo Zheng

arXiv:2604.25477·cs.CV·April 29, 2026

DDA-Thinker: Decoupled Dual-Atomic Reinforcement Learning for Reasoning-Driven Image Editing

Hanqing Yang, Qiang Zhou, Yongchao Du, Sashuai Zhou, Zhibin Wang, Jun Song, Tiezheng Ge, Cheng Yu, Bo Zheng

PDF

TL;DR

DDA-Thinker introduces a decoupled reinforcement learning framework for reasoning-driven image editing, enabling independent planning optimization and achieving competitive results on benchmark datasets.

Contribution

The paper proposes a novel Thinker-centric, decoupled reinforcement learning approach with dual-atomic rewards for improved reasoning in image editing tasks.

Findings

01

Significant performance improvements on RISE-Bench and KRIS-Bench.

02

Competitive results with proprietary models under a fixed-editor setting.

03

Effective training curriculum through a two-stage data curation pipeline.

Abstract

Recent image editing models have achieved strong visual fidelity but often struggle with tasks requiring complex reasoning. To investigate and enhance the reasoning-grounded planning for image editing, we propose DDA-Thinker, a Thinker-centric framework designed for the independent optimization of a planning module (Thinker) over a fixed generative model (Editor). This decoupled Thinker-centric paradigm facilitates a controlled analysis of the planning module and makes its contribution under a fixed Editor easier to assess. To effectively guide this Thinker, we introduce a dual-atomic reinforcement learning framework. This framework decomposes feedback into two distinct atomic rewards implemented through verifiable checklists: a cognitive-atomic reward to directly assess the quality of the Thinker's executable plan, which serves as the actionable outcome of the Thinker's reasoning, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.