Long Short-Term Memory for Japanese Word Segmentation

Yoshiaki Kitagawa; Mamoru Komachi

arXiv:1709.08011·cs.CL·September 28, 2018·20 cites

Long Short-Term Memory for Japanese Word Segmentation

Yoshiaki Kitagawa, Mamoru Komachi

PDF

Open Access

TL;DR

This paper introduces an LSTM-based neural network model for Japanese word segmentation, effectively handling orthographic variations and global context, achieving state-of-the-art accuracy across multiple corpora.

Contribution

It is the first to apply LSTM neural networks specifically to Japanese word segmentation, addressing unique orthographic challenges and incorporating global context.

Findings

01

Achieved state-of-the-art accuracy on Japanese corpora

02

Effectively handled orthographic variations in Japanese

03

Demonstrated the importance of global context in JWS

Abstract

This study presents a Long Short-Term Memory (LSTM) neural network approach to Japanese word segmentation (JWS). Previous studies on Chinese word segmentation (CWS) succeeded in using recurrent neural networks such as LSTM and gated recurrent units (GRU). However, in contrast to Chinese, Japanese includes several character types, such as hiragana, katakana, and kanji, that produce orthographic variations and increase the difficulty of word segmentation. Additionally, it is important for JWS tasks to consider a global context, and yet traditional JWS approaches rely on local features. In order to address this problem, this study proposes employing an LSTM-based approach to JWS. The experimental results indicate that the proposed model achieves state-of-the-art accuracy with respect to various Japanese corpora.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech Recognition and Synthesis