LORE: Latent Optimization for Precise Semantic Control in Rectified Flow-based Image Editing

Liangyang Ouyang; Jiafeng Mao

arXiv:2508.03144·cs.CV·August 22, 2025

LORE: Latent Optimization for Precise Semantic Control in Rectified Flow-based Image Editing

Liangyang Ouyang, Jiafeng Mao

PDF

Open Access

TL;DR

LORE introduces a training-free latent optimization method for precise, controllable, and general-purpose image editing using rectified flow models, addressing semantic bias issues in existing inversion-based techniques.

Contribution

LORE proposes a novel, training-free approach that directly optimizes inverted noise for improved semantic control in image editing without architectural changes or fine-tuning.

Findings

01

LORE outperforms baselines in semantic alignment and image quality.

02

LORE demonstrates stability and scalability across benchmarks.

03

LORE effectively addresses semantic bias in rectified flow-based editing.

Abstract

Text-driven image editing enables users to flexibly modify visual content through natural language instructions, and is widely applied to tasks such as semantic object replacement, insertion, and removal. While recent inversion-based editing methods using rectified flow models have achieved promising results in image quality, we identify a structural limitation in their editing behavior: the semantic bias toward the source concept encoded in the inverted noise tends to suppress attention to the target concept. This issue becomes particularly critical when the source and target semantics are dissimilar, where the attention mechanism inherently leads to editing failure or unintended modifications in non-target regions. In this paper, we systematically analyze and validate this structural flaw, and introduce LORE, a training-free and efficient image editing method. LORE directly optimizes…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning