Do Response Selection Models Really Know What's Next? Utterance   Manipulation Strategies for Multi-turn Response Selection

Taesun Whang; Dongyub Lee; Dongsuk Oh; Chanhee Lee; Kijong Han,; Dong-hun Lee; Saebyeok Lee

arXiv:2009.04703·cs.CL·December 17, 2020

Do Response Selection Models Really Know What's Next? Utterance Manipulation Strategies for Multi-turn Response Selection

Taesun Whang, Dongyub Lee, Dongsuk Oh, Chanhee Lee, Kijong Han,, Dong-hun Lee, Saebyeok Lee

PDF

1 Repo 1 Video

TL;DR

This paper introduces utterance manipulation strategies (UMS) to improve multi-turn response selection models by emphasizing dialog coherence, demonstrating significant performance gains across various datasets and languages.

Contribution

The paper proposes self-supervised utterance manipulation strategies that enhance dialog coherence in response selection models, addressing limitations of existing approaches that ignore temporal dependencies.

Findings

01

UMS significantly improve response selection accuracy.

02

Models trained with UMS outperform state-of-the-art on multiple benchmarks.

03

UMS are effective across different languages and models.

Abstract

In this paper, we study the task of selecting the optimal response given a user and system utterance history in retrieval-based multi-turn dialog systems. Recently, pre-trained language models (e.g., BERT, RoBERTa, and ELECTRA) showed significant improvements in various natural language processing tasks. This and similar response selection tasks can also be solved using such language models by formulating the tasks as dialog--response binary classification tasks. Although existing works using this approach successfully obtained state-of-the-art results, we observe that language models trained in this manner tend to make predictions based on the relatedness of history and candidates, ignoring the sequential nature of multi-turn dialog systems. This suggests that the response selection task alone is insufficient for learning temporal dependencies between utterances. To this end, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

taesunwhang/UMS-ResSel
pytorchOfficial

Videos

Do Response Selection Models Really Know What’s Next? Utterance Manipulation Strategies for Multi-Turn Response Selection· underline

Taxonomy

MethodsLinear Layer · Layer Normalization · Weight Decay · Dropout · Linear Warmup With Linear Decay · RoBERTa · Dense Connections · WordPiece · Attention Dropout · Attention Is All You Need