Non-linear Learning for Statistical Machine Translation

Shujian Huang; Huadong Chen; Xinyu Dai; Jiajun Chen

arXiv:1503.00107·cs.CL·March 3, 2015

Non-linear Learning for Statistical Machine Translation

Shujian Huang, Huadong Chen, Xinyu Dai, Jiajun Chen

PDF

Open Access

TL;DR

This paper introduces a neural network-based non-linear approach to model translation hypothesis quality in SMT, surpassing traditional linear models by capturing complex feature interactions.

Contribution

It presents a novel non-linear modeling framework for SMT hypothesis scoring using neural networks, enhancing expressive power over linear models.

Findings

01

Non-linear models outperform linear models in translation quality.

02

Neural network-based approach captures complex feature interactions.

03

Experimental results demonstrate improved translation performance.

Abstract

Modern statistical machine translation (SMT) systems usually use a linear combination of features to model the quality of each translation hypothesis. The linear combination assumes that all the features are in a linear relationship and constrains that each feature interacts with the rest features in an linear manner, which might limit the expressive power of the model and lead to a under-fit model on the current data. In this paper, we propose a non-linear modeling for the quality of translation hypotheses based on neural networks, which allows more complex interaction between features. A learning framework is presented for training the non-linear models. We also discuss possible heuristics in designing the network structure which may improve the non-linear learning performance. Experimental results show that with the basic features of a hierarchical phrase-based machine translation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Fuzzy Logic and Control Systems