TL;DR
CharFormer is a novel glyph fusion and attention-based framework designed to improve character image denoising by preserving glyph integrity and enhancing recognition accuracy, outperforming existing methods.
Contribution
It introduces a glyph fusion and attention mechanism framework with a parallel target task to maintain glyph consistency during denoising, a novel approach in character image restoration.
Findings
Outperforms state-of-the-art denoising methods quantitatively
Preserves character glyphs effectively during denoising
Enhances character recognition performance after denoising
Abstract
Degraded images commonly exist in the general sources of character images, leading to unsatisfactory character recognition results. Existing methods have dedicated efforts to restoring degraded character images. However, the denoising results obtained by these methods do not appear to improve character recognition performance. This is mainly because current methods only focus on pixel-level information and ignore critical features of a character, such as its glyph, resulting in character-glyph damage during the denoising process. In this paper, we introduce a novel generic framework based on glyph fusion and attention mechanisms, i.e., CharFormer, for precisely recovering character images without changing their inherent glyphs. Unlike existing frameworks, CharFormer introduces a parallel target task for capturing additional information and injecting it into the image denoising backbone,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Absolute Position Encodings · Dropout · Byte Pair Encoding · Adam · Residual Connection · Label Smoothing · GBST
