Syllable-aware Neural Language Models: A Failure to Beat Character-aware   Ones

Zhenisbek Assylbekov; Rustem Takhanov; Bagdat Myrzakhmetov and; Jonathan N. Washington

arXiv:1707.06480·cs.CL·July 21, 2017

Syllable-aware Neural Language Models: A Failure to Beat Character-aware Ones

Zhenisbek Assylbekov, Rustem Takhanov, Bagdat Myrzakhmetov and, Jonathan N. Washington

PDF

1 Repo

TL;DR

This paper compares syllable-aware and character-aware neural language models, finding that syllable-aware models do not outperform character-aware ones in quality but are more parameter-efficient and faster to train.

Contribution

It demonstrates that syllable-aware models can match character-aware performance with fewer parameters and faster training, challenging assumptions about syllable segmentation benefits.

Findings

01

Syllable-aware models do not improve language modeling quality over character-aware models.

02

Syllable-aware models are 18%-33% smaller in parameters.

03

Syllable-aware models train 1.2-2.2 times faster.

Abstract

Syllabification does not seem to improve word-level RNN language modeling quality when compared to character-based segmentation. However, our best syllable-aware language model, achieving performance comparable to the competitive character-aware model, has 18%-33% fewer parameters and is trained 1.2-2.2 times faster.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zh3nis/lstm-syl
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.