A Hierarchical Neural Network for Sequence-to-Sequences Learning

Si Zuo; Zhimin Xu

arXiv:1811.09575·cs.CL·November 26, 2018·1 cites

A Hierarchical Neural Network for Sequence-to-Sequences Learning

Si Zuo, Zhimin Xu

PDF

Open Access

TL;DR

This paper introduces a hierarchical neural network architecture for sequence-to-sequence learning that improves translation quality for long sentences by splitting and reassembling sequences, outperforming traditional models.

Contribution

The paper proposes a novel hierarchical deep neural network that processes long sentences by splitting and hierarchical correction, enhancing translation accuracy in neural machine translation.

Findings

01

Achieves higher BLEU scores than traditional models

02

Lower perplexity indicates better language modeling

03

Improves translation of long sentences in NMT

Abstract

In recent years, the sequence-to-sequence learning neural networks with attention mechanism have achieved great progress. However, there are still challenges, especially for Neural Machine Translation (NMT), such as lower translation quality on long sentences. In this paper, we present a hierarchical deep neural network architecture to improve the quality of long sentences translation. The proposed network embeds sequence-to-sequence neural networks into a two-level category hierarchy by following the coarse-to-fine paradigm. Long sentences are input by splitting them into shorter sequences, which can be well processed by the coarse category network as the long distance dependencies for short sentences is able to be handled by network based on sequence-to-sequence neural network. Then they are concatenated and corrected by the fine category network. The experiments shows that our method…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications