Non-Autoregressive Semantic Parsing for Compositional Task-Oriented   Dialog

Arun Babu; Akshat Shrivastava; Armen Aghajanyan; Ahmed Aly; Angela Fan; and Marjan Ghazvininejad

arXiv:2104.04923·cs.CL·April 13, 2021

Non-Autoregressive Semantic Parsing for Compositional Task-Oriented Dialog

Arun Babu, Akshat Shrivastava, Armen Aghajanyan, Ahmed Aly, Angela Fan, and Marjan Ghazvininejad

PDF

1 Repo

TL;DR

This paper introduces a non-autoregressive, convolutional neural network-based semantic parsing model that significantly reduces latency and model size while maintaining competitive accuracy across multiple datasets.

Contribution

It presents a novel non-autoregressive architecture combining CNNs for semantic parsing, enabling faster inference suitable for real-time conversational systems.

Findings

01

Achieves up to 81% latency reduction on TOP dataset

02

Reduces parameter size compared to RNN models

03

Maintains competitive performance on multiple datasets

Abstract

Semantic parsing using sequence-to-sequence models allows parsing of deeper representations compared to traditional word tagging based models. In spite of these advantages, widespread adoption of these models for real-time conversational use cases has been stymied by higher compute requirements and thus higher latency. In this work, we propose a non-autoregressive approach to predict semantic parse trees with an efficient seq2seq model architecture. By combining non-autoregressive prediction with convolutional neural networks, we achieve significant latency gains and parameter size reduction compared to traditional RNN models. Our novel architecture achieves up to an 81% reduction in latency on TOP dataset and retains competitive performance to non-pretrained models on three different semantic parsing datasets. Our code is available at https://github.com/facebookresearch/pytext

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

facebookresearch/pytext
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsTanh Activation · Sigmoid Activation · Long Short-Term Memory · Sequence to Sequence