TripleFDS: Triple Feature Disentanglement and Synthesis for Scene Text Editing
Yuchen Bao, Yiting Wang, Wenjian Huang, Haowei Wang, Shen Chen, Taiping Yao, Shouhong Ding, Jianguo Zhang

TL;DR
TripleFDS introduces a novel scene text editing framework that disentangles text style, content, and background, enabling more flexible and accurate image modifications with state-of-the-art results.
Contribution
It proposes a new triple feature disentanglement framework and dataset, enhancing controllability and visual consistency in scene text editing tasks.
Findings
Achieves state-of-the-art SSIM of 44.54 and accuracy of 93.58% on benchmarks.
Supports new editing operations like style replacement and background transfer.
Trained on 125,000 SCB Groups for robust disentanglement.
Abstract
Scene Text Editing (STE) aims to naturally modify text in images while preserving visual consistency, the decisive factors of which can be divided into three parts, i.e., text style, text content, and background. Previous methods have struggled with incomplete disentanglement of editable attributes, typically addressing only one aspect - such as editing text content - thus limiting controllability and visual consistency. To overcome these limitations, we propose TripleFDS, a novel framework for STE with disentangled modular attributes, and an accompanying dataset called SCB Synthesis. SCB Synthesis provides robust training data for triple feature disentanglement by utilizing the "SCB Group", a novel construct that combines three attributes per image to generate diverse, disentangled training groups. Leveraging this construct as a basic training unit, TripleFDS first disentangles triple…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Multimodal Machine Learning Applications · Computer Graphics and Visualization Techniques
