An Evaluation Dataset and Strategy for Building Robust Multi-turn Response Selection Model
Kijong Han, Seojin Lee, Wooin Lee, Joosung Lee, Dong-hun Lee

TL;DR
This paper introduces an adversarial dataset and a strategy to improve the robustness of multi-turn response selection models, addressing their tendency to rely on superficial patterns rather than contextual understanding.
Contribution
It provides an adversarial dataset for evaluating weaknesses and proposes a new strategy to enhance model robustness in open-domain Korean multi-turn response selection.
Findings
Models often rely on superficial keyword patterns.
The adversarial dataset reveals specific weaknesses.
The proposed strategy improves model robustness.
Abstract
Multi-turn response selection models have recently shown comparable performance to humans in several benchmark datasets. However, in the real environment, these models often have weaknesses, such as making incorrect predictions based heavily on superficial patterns without a comprehensive understanding of the context. For example, these models often give a high score to the wrong response candidate containing several keywords related to the context but using the inconsistent tense. In this study, we analyze the weaknesses of the open-domain Korean Multi-turn response selection models and publish an adversarial dataset to evaluate these weaknesses. We also suggest a strategy to build a robust model in this adversarial environment.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Terrorism, Counterterrorism, and Political Violence · Adversarial Robustness in Machine Learning
