Realistic text replacement with non-uniform style conditioning

Arseny Nerinovsky; Igor Buzhinsky; Andey Filchencov

arXiv:2006.04170·cs.CV·June 9, 2020

Realistic text replacement with non-uniform style conditioning

Arseny Nerinovsky, Igor Buzhinsky, Andey Filchencov

PDF

TL;DR

This paper introduces a novel non-uniform style conditioning layer for realistic text replacement in images, enabling high-quality, indistinguishable edits without post-processing, and demonstrates superior performance on the ICDAR MLT dataset.

Contribution

The paper presents a new non-uniform style conditioning method integrated into an encoder-decoder ResNet architecture for realistic text replacement.

Findings

01

Outperforms existing methods on ICDAR MLT dataset

02

Achieves realistic, indistinguishable text replacement

03

Single-stage model with no post-processing

Abstract

In this work, we study the possibility of realistic text replacement, the goal of which is to replace text present in the image with user-supplied text. The replacement should be performed in a way that will not allow distinguishing the resulting image from the original one. We achieve this goal by developing a novel non-uniform style conditioning layer and apply it to an encoder-decoder ResNet based architecture. The resulting model is a single-stage model, with no post-processing. The proposed model achieves realistic text replacement and outperforms existing approaches on ICDAR MLT.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Methods1x1 Convolution · *Communicated@Fast*How Do I Communicate to Expedia? · Bottleneck Residual Block · Batch Normalization · Average Pooling · Max Pooling · Global Average Pooling · Residual Connection · Kaiming Initialization · Convolution