An Evaluation Dataset and Strategy for Building Robust Multi-turn   Response Selection Model

Kijong Han; Seojin Lee; Wooin Lee; Joosung Lee; Dong-hun Lee

arXiv:2109.04834·cs.CL·September 13, 2021

An Evaluation Dataset and Strategy for Building Robust Multi-turn Response Selection Model

Kijong Han, Seojin Lee, Wooin Lee, Joosung Lee, Dong-hun Lee

PDF

Open Access 1 Repo

TL;DR

This paper introduces an adversarial dataset and a strategy to improve the robustness of multi-turn response selection models, addressing their tendency to rely on superficial patterns rather than contextual understanding.

Contribution

It provides an adversarial dataset for evaluating weaknesses and proposes a new strategy to enhance model robustness in open-domain Korean multi-turn response selection.

Findings

01

Models often rely on superficial keyword patterns.

02

The adversarial dataset reveals specific weaknesses.

03

The proposed strategy improves model robustness.

Abstract

Multi-turn response selection models have recently shown comparable performance to humans in several benchmark datasets. However, in the real environment, these models often have weaknesses, such as making incorrect predictions based heavily on superficial patterns without a comprehensive understanding of the context. For example, these models often give a high score to the wrong response candidate containing several keywords related to the context but using the inconsistent tense. In this study, we analyze the weaknesses of the open-domain Korean Multi-turn response selection models and publish an adversarial dataset to evaluate these weaknesses. We also suggest a strategy to build a robust model in this adversarial environment.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

kakaoenterprise/koradvmrstestdata
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Terrorism, Counterterrorism, and Political Violence · Adversarial Robustness in Machine Learning