WAS: Dataset and Methods for Artistic Text Segmentation

Xudong Xie; Yuzhe Li; Yang Liu; Zhifei Zhang; Zhaowen Wang; Wei Xiong,; Xiang Bai

arXiv:2408.00106·cs.CV·August 2, 2024

WAS: Dataset and Methods for Artistic Text Segmentation

Xudong Xie, Yuzhe Li, Yang Liu, Zhifei Zhang, Zhaowen Wang, Wei Xiong,, Xiang Bai

PDF

Open Access 1 Repo

TL;DR

This paper introduces a new dataset and methods for artistic text segmentation, addressing challenges of complex stroke shapes and global structures, and achieves state-of-the-art results through innovative model components and data synthesis strategies.

Contribution

The paper presents a novel dataset for artistic text segmentation and proposes a decoder with layer-wise momentum query, a skeleton-assisted head, and a data synthesis strategy to improve segmentation performance.

Findings

01

Significant performance improvement on artistic text segmentation tasks.

02

State-of-the-art results on multiple public datasets.

03

Effective data augmentation strategy enhances model generalization.

Abstract

Accurate text segmentation results are crucial for text-related generative tasks, such as text image generation, text editing, text removal, and text style transfer. Recently, some scene text segmentation methods have made significant progress in segmenting regular text. However, these methods perform poorly in scenarios containing artistic text. Therefore, this paper focuses on the more challenging task of artistic text segmentation and constructs a real artistic text segmentation dataset. One challenge of the task is that the local stroke shapes of artistic text are changeable with diversity and complexity. We propose a decoder with the layer-wise momentum query to prevent the model from ignoring stroke regions of special shapes. Another challenge is the complexity of the global topological structure. We further design a skeleton-assisted head to guide the model to focus on the global…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

xdxie/WAS_WordArt-Segmentation
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Human Motion and Animation · Music and Audio Processing

MethodsDiffusion · Focus