Sequence-to-Sequence Neural Net Models for Grapheme-to-Phoneme   Conversion

Kaisheng Yao; Geoffrey Zweig

arXiv:1506.00196·cs.CL·August 21, 2015·68 cites

Sequence-to-Sequence Neural Net Models for Grapheme-to-Phoneme Conversion

Kaisheng Yao, Geoffrey Zweig

PDF

Open Access

TL;DR

This paper investigates sequence-to-sequence neural models for grapheme-to-phoneme conversion, demonstrating that bi-directional LSTM models can outperform traditional methods in this small-vocabulary task.

Contribution

It introduces the application of bi-directional LSTM neural networks with alignment information to improve grapheme-to-phoneme conversion accuracy.

Findings

01

Bi-directional LSTM models outperform traditional methods.

02

Neural models rival state-of-the-art performance.

03

Alignment information enhances model accuracy.

Abstract

Sequence-to-sequence translation methods based on generation with a side-conditioned language model have recently shown promising results in several tasks. In machine translation, models conditioned on source side words have been used to produce target-language text, and in image captioning, models conditioned images have been used to generate caption text. Past work with this approach has focused on large vocabulary tasks, and measured quality in terms of BLEU. In this paper, we explore the applicability of such models to the qualitatively different grapheme-to-phoneme task. Here, the input and output side vocabularies are small, plain n-gram models do well, and credit is only given when the output is exactly correct. We find that the simple side-conditioned generation approach is able to rival the state-of-the-art, and we are able to significantly advance the stat-of-the-art with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Multimodal Machine Learning Applications · Topic Modeling