Analysing the potential of seq-to-seq models for incremental   interpretation in task-oriented dialogue

Dieuwke Hupkes; Sanne Bouwmeester; Raquel Fern\'andez

arXiv:1808.09178·cs.CL·August 29, 2018·1 cites

Analysing the potential of seq-to-seq models for incremental interpretation in task-oriented dialogue

Dieuwke Hupkes, Sanne Bouwmeester, Raquel Fern\'andez

PDF

Open Access

TL;DR

This study examines how seq-to-seq models handle disfluencies in task-oriented dialogues, revealing they are resilient to disfluencies and that such noise can even improve overall representation clarity.

Contribution

It demonstrates that seq-to-seq models are robust to disfluencies and provides insights into their internal representations and effects of disfluency data augmentation.

Findings

01

Disfluencies have minimal impact on task success.

02

Models develop limited awareness of disfluency structure.

03

Adding disfluencies can improve representation clarity.

Abstract

We investigate how encoder-decoder models trained on a synthetic dataset of task-oriented dialogues process disfluencies, such as hesitations and self-corrections. We find that, contrary to earlier results, disfluencies have very little impact on the task success of seq-to-seq models with attention. Using visualisation and diagnostic classifiers, we analyse the representations that are incrementally built by the model, and discover that models develop little to no awareness of the structure of disfluencies. However, adding disfluencies to the data appears to help the model create clearer representations overall, as evidenced by the attention patterns the different models exhibit.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Natural Language Processing Techniques