TrueType Transformer: Character and Font Style Recognition in Outline   Format

Yusuke Nagata; Jinki Otao; Daichi Haraguchi; and Seiichi Uchida

arXiv:2203.05338·cs.CV·March 14, 2022

TrueType Transformer: Character and Font Style Recognition in Outline Format

Yusuke Nagata, Jinki Otao, Daichi Haraguchi, and Seiichi Uchida

PDF

Open Access 1 Repo

TL;DR

The paper introduces TrueType Transformer (T3), a neural network model that directly processes outline font data for character and style recognition, achieving resolution independence and leveraging local control point structures.

Contribution

The novel T3 model directly accepts outline data using Transformer architecture, enabling resolution-independent font and character style recognition without image conversion.

Findings

01

T3 effectively recognizes characters and font styles from outline data.

02

Control points significantly influence classification accuracy.

03

T3 demonstrates resolution-independent recognition capabilities.

Abstract

We propose TrueType Transformer (T3), which can perform character and font style recognition in an outline format. The outline format, such as TrueType, represents each character as a sequence of control points of stroke contours and is frequently used in born-digital documents. T3 is organized by a deep neural network, so-called Transformer. Transformer is originally proposed for sequential data, such as text, and therefore appropriate for handling the outline data. In other words, T3 directly accepts the outline data without converting it into a bitmap image. Consequently, T3 realizes a resolution-independent classification. Moreover, since the locations of the control points represent the fine and local structures of the font style, T3 is suitable for font style classification, where such structures are very important. In this paper, we experimentally show the applicability of T3 in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

uchidalab/TrueTypeTransformer
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHandwritten Text Recognition Techniques · Digital Media Forensic Detection · Image Retrieval and Classification Techniques

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Residual Connection · Layer Normalization · Adam · Absolute Position Encodings · Dense Connections · Position-Wise Feed-Forward Layer · Label Smoothing