EAFormer: Scene Text Segmentation with Edge-Aware Transformers

Haiyang Yu; Teng Fu; Bin Li; Xiangyang Xue

arXiv:2407.17020·cs.CV·July 25, 2024

EAFormer: Scene Text Segmentation with Edge-Aware Transformers

Haiyang Yu, Teng Fu, Bin Li, Xiangyang Xue

PDF

1 Repo

TL;DR

EAFormer introduces an edge-aware transformer approach for scene text segmentation, emphasizing text edges to improve accuracy, especially at boundaries, and demonstrates superior performance on relabeled benchmarks.

Contribution

The paper proposes a novel edge-guided transformer model that explicitly incorporates text edge information for improved scene text segmentation accuracy.

Findings

01

Outperforms previous methods on standard benchmarks.

02

Achieves higher accuracy with more precise annotations.

03

Effectively focuses on text edges for better segmentation results.

Abstract

Scene text segmentation aims at cropping texts from scene images, which is usually used to help generative models edit or remove texts. The existing text segmentation methods tend to involve various text-related supervisions for better performance. However, most of them ignore the importance of text edges, which are significant for downstream applications. In this paper, we propose Edge-Aware Transformers, termed EAFormer, to segment texts more accurately, especially at the edge of texts. Specifically, we first design a text edge extractor to detect edges and filter out edges of non-text areas. Then, we propose an edge-guided encoder to make the model focus more on text edges. Finally, an MLP-based decoder is employed to predict text masks. We have conducted extensive experiments on commonly-used benchmarks to verify the effectiveness of EAFormer. The experimental results demonstrate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

fudanvi/fudanocr
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsFocus