A Hierarchical Neural Network for Sequence-to-Sequences Learning
Si Zuo, Zhimin Xu

TL;DR
This paper introduces a hierarchical neural network architecture for sequence-to-sequence learning that improves translation quality for long sentences by splitting and reassembling sequences, outperforming traditional models.
Contribution
The paper proposes a novel hierarchical deep neural network that processes long sentences by splitting and hierarchical correction, enhancing translation accuracy in neural machine translation.
Findings
Achieves higher BLEU scores than traditional models
Lower perplexity indicates better language modeling
Improves translation of long sentences in NMT
Abstract
In recent years, the sequence-to-sequence learning neural networks with attention mechanism have achieved great progress. However, there are still challenges, especially for Neural Machine Translation (NMT), such as lower translation quality on long sentences. In this paper, we present a hierarchical deep neural network architecture to improve the quality of long sentences translation. The proposed network embeds sequence-to-sequence neural networks into a two-level category hierarchy by following the coarse-to-fine paradigm. Long sentences are input by splitting them into shorter sequences, which can be well processed by the coarse category network as the long distance dependencies for short sentences is able to be handled by network based on sequence-to-sequence neural network. Then they are concatenated and corrected by the fine category network. The experiments shows that our method…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications
