Real-World Font Recognition Using Deep Network and Domain Adaptation

Zhangyang Wang; Jianchao Yang; Hailin Jin; Eli Shechtman; Aseem; Agarwala; Jonathan Brandt; Thomas S. Huang

arXiv:1504.00028·cs.CV·April 2, 2015·ICLR·5 cites

Real-World Font Recognition Using Deep Network and Domain Adaptation

Zhangyang Wang, Jianchao Yang, Hailin Jin, Eli Shechtman, Aseem, Agarwala, Jonathan Brandt, Thomas S. Huang

PDF

Open Access

TL;DR

This paper presents a deep learning approach with domain adaptation to improve font recognition accuracy on real-world images, bridging the gap between synthetic training data and real-world data.

Contribution

It introduces a domain adaptation technique using a Stacked Convolutional Auto-Encoder to enhance font recognition performance on real-world images.

Findings

01

Achieves over 80% top-5 accuracy on real-world font dataset.

02

Effectively reduces the domain gap between synthetic and real images.

03

Demonstrates the effectiveness of auto-encoder based domain adaptation in fine-grain classification.

Abstract

We address a challenging fine-grain classification problem: recognizing a font style from an image of text. In this task, it is very easy to generate lots of rendered font examples but very hard to obtain real-world labeled images. This real-to-synthetic domain gap caused poor generalization to new real data in previous methods (Chen et al. (2014)). In this paper, we refer to Convolutional Neural Networks, and use an adaptation technique based on a Stacked Convolutional Auto-Encoder that exploits unlabeled real-world images combined with synthetic data. The proposed method achieves an accuracy of higher than 80% (top-5) on a real-world dataset.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Generative Adversarial Networks and Image Synthesis · Image Enhancement Techniques