Learning Character-level Compositionality with Visual Features

Frederick Liu; Han Lu; Chieh Lo; Graham Neubig

arXiv:1704.04859·cs.CL·May 9, 2017·5 cites

Learning Character-level Compositionality with Visual Features

Frederick Liu, Han Lu, Chieh Lo, Graham Neubig

PDF

Open Access 2 Repos

TL;DR

This paper introduces a novel approach to character embeddings by leveraging visual features extracted through CNNs, improving the handling of rare characters in East Asian languages for text classification.

Contribution

It proposes a visual character embedding method that captures compositionality at the character level using CNNs on character images, enhancing rare character processing.

Findings

01

Improved classification accuracy on Chinese, Japanese, and Korean datasets.

02

Embeddings focus on semantically meaningful parts of characters.

03

Model learns coherent visual representations of characters.

Abstract

Previous work has modeled the compositionality of words by creating character-level models of meaning, reducing problems of sparsity for rare words. However, in many writing systems compositionality has an effect even on the character-level: the meaning of a character is derived by the sum of its parts. In this paper, we model this effect by creating embeddings for characters based on their visual characteristics, creating an image for the character and running it through a convolutional neural network to produce a visual character embedding. Experiments on a text classification task demonstrate that such model allows for better processing of instances with rare characters in languages such as Chinese, Japanese, and Korean. Additionally, qualitative analyses demonstrate that our proposed model learns to focus on the parts of characters that carry semantic content, resulting in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications