On the Ability of a CNN to Realize Image-to-Image Language Conversion

Kohei Baba; Seiichi Uchida; Brian Kenji Iwana

arXiv:2006.12316·cs.CV·June 23, 2020

On the Ability of a CNN to Realize Image-to-Image Language Conversion

Kohei Baba, Seiichi Uchida, Brian Kenji Iwana

PDF

TL;DR

This paper demonstrates that CNNs can perform image-to-image language conversion, specifically converting Korean Hangul characters into Latin phonetic symbols, even with limited data, by capturing structural features.

Contribution

It introduces a novel CNN architecture tailored for image-to-image language conversion tasks with significantly different input and output features.

Findings

01

CNN can successfully convert Hangul to Latin characters

02

The network captures structural features with limited data

03

The proposed network handles different feature spaces effectively

Abstract

The purpose of this paper is to reveal the ability that Convolutional Neural Networks (CNN) have on the novel task of image-to-image language conversion. We propose a new network to tackle this task by converting images of Korean Hangul characters directly into images of the phonetic Latin character equivalent. The conversion rules between Hangul and the phonetic symbols are not explicitly provided. The results of the proposed network show that it is possible to perform image-to-image language conversion. Moreover, it shows that it can grasp the structural features of Hangul even from limited learning data. In addition, it introduces a new network to use when the input and output have significantly different features.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.