Diagnosing Transformers in Task-Oriented Semantic Parsing

Shrey Desai; Ahmed Aly

arXiv:2105.13496·cs.CL·May 31, 2021

Diagnosing Transformers in Task-Oriented Semantic Parsing

Shrey Desai, Ahmed Aly

PDF

TL;DR

This paper investigates the strengths and weaknesses of transformer-based semantic parsers like BART and XLM-R in task-oriented settings, revealing challenges in disambiguation, syntax validity, and span extraction, but also their potential for error detection.

Contribution

It provides a comprehensive analysis of transformer-based parsers in semantic parsing, highlighting specific issues and their implications for deployment.

Findings

01

Transformers struggle with disambiguating intents and slots.

02

Transformers often produce syntactically invalid frames.

03

Span extraction is a major bottleneck for current models.

Abstract

Modern task-oriented semantic parsing approaches typically use seq2seq transformers to map textual utterances to semantic frames comprised of intents and slots. While these models are empirically strong, their specific strengths and weaknesses have largely remained unexplored. In this work, we study BART and XLM-R, two state-of-the-art parsers, across both monolingual and multilingual settings. Our experiments yield several key results: transformer-based parsers struggle not only with disambiguating intents/slots, but surprisingly also with producing syntactically-valid frames. Though pre-training imbues transformers with syntactic inductive biases, we find the ambiguity of copying utterance spans into frames often leads to tree invalidity, indicating span extraction is a major bottleneck for current parsers. However, as a silver lining, we show transformer-based parsers give sufficient…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · XLM-R · Linear Layer · Softmax · Adam · Tanh Activation · Dense Connections · Residual Connection