FIA-Edit: Frequency-Interactive Attention for Efficient and High-Fidelity Inversion-Free Text-Guided Image Editing

Kaixiang Yang; Boyang Shen; Xin Li; Yuchen Dai; Yuxuan Luo; Yueran Ma; Wei Fang; Qiang Li; Zhiwei Wang

arXiv:2511.12151·cs.CV·November 18, 2025

FIA-Edit: Frequency-Interactive Attention for Efficient and High-Fidelity Inversion-Free Text-Guided Image Editing

Kaixiang Yang, Boyang Shen, Xin Li, Yuchen Dai, Yuxuan Luo, Yueran Ma, Wei Fang, Qiang Li, Zhiwei Wang

PDF

Open Access 1 Video

TL;DR

FIA-Edit introduces a frequency-interactive attention framework for efficient, high-fidelity, inversion-free text-guided image editing, effectively preserving source information and enabling medical image synthesis.

Contribution

The paper proposes FIA-Edit, a novel inversion-free editing method with frequency-interactive attention, improving source information integration and extending applications to medical image synthesis.

Findings

01

Supports high-fidelity editing with low computational cost (~6s per image)

02

Outperforms existing methods in visual quality and background fidelity

03

Enables medical image synthesis for data augmentation and classification

Abstract

Text-guided image editing has advanced rapidly with the rise of diffusion models. While flow-based inversion-free methods offer high efficiency by avoiding latent inversion, they often fail to effectively integrate source information, leading to poor background preservation, spatial inconsistencies, and over-editing due to the lack of effective integration of source information. In this paper, we present FIA-Edit, a novel inversion-free framework that achieves high-fidelity and semantically precise edits through a Frequency-Interactive Attention. Specifically, we design two key components: (1) a Frequency Representation Interaction (FRI) module that enhances cross-domain alignment by exchanging frequency components between source and target features within self-attention, and (2) a Feature Injection (FIJ) module that explicitly incorporates source-side queries, keys, values, and text…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

FIA-Edit: Frequency-Interactive Attention for Efficient and High-Fidelity Inversion-Free Text-Guided Image Editing· underline

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Multimodal Machine Learning Applications · Cell Image Analysis Techniques