Sequence-to-Sequence Learning with Latent Neural Grammars

Yoon Kim

arXiv:2109.01135·cs.CL·November 17, 2021

Sequence-to-Sequence Learning with Latent Neural Grammars

Yoon Kim

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a hierarchical sequence-to-sequence model using latent neural grammars, which induces source and target trees during training, aiming to improve compositional generalization and performance on various tasks.

Contribution

It proposes a novel neural grammar-based approach for sequence-to-sequence learning with latent trees, enhancing compositional generalization without manual feature engineering.

Findings

01

Performs well on compositional generalization tasks like SCAN

02

Effective in style transfer and small-scale translation

03

Outperforms standard baselines in tested domains

Abstract

Sequence-to-sequence learning with neural networks has become the de facto standard for sequence prediction tasks. This approach typically models the local distribution over the next word with a powerful neural network that can condition on arbitrary context. While flexible and performant, these models often require large datasets for training and can fail spectacularly on benchmarks designed to test for compositional generalization. This work explores an alternative, hierarchical approach to sequence-to-sequence learning with quasi-synchronous grammars, where each node in the target tree is transduced by a node in the source tree. Both the source and target trees are treated as latent and induced during training. We develop a neural parameterization of the grammar which enables parameter sharing over the combinatorial space of derivation rules without the need for manual feature…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yoonkim/neural-qcfg
pytorchOfficial

Videos

Sequence-to-Sequence Learning with Latent Neural Grammars· slideslive

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications