Data Augmentation for Neural Online Chat Response Selection

Wenchao Du; Alan W Black

arXiv:1809.00428·cs.CL·September 5, 2018·6 cites

Data Augmentation for Neural Online Chat Response Selection

Wenchao Du, Alan W Black

PDF

Open Access

TL;DR

This paper explores novel data augmentation techniques, permutation and flipping, for neural dialog response selection, demonstrating significant improvements in recall metrics across multiple datasets and languages.

Contribution

It introduces a combined data augmentation approach that merges original and synthesized data, enhancing model generalization in dialog response tasks.

Findings

01

Achieved 1-3 recall-at-1 points improvement over baselines.

02

Effective across Chinese and English datasets.

03

Works for both full-scale and small-scale models.

Abstract

Data augmentation seeks to manipulate the available data for training to improve the generalization ability of models. We investigate two data augmentation proxies, permutation and flipping, for neural dialog response selection task on various models over multiple datasets, including both Chinese and English languages. Different from standard data augmentation techniques, our method combines the original and synthesized data for prediction. Empirical results show that our approach can gain 1 to 3 recall-at-1 points over baseline models in both full-scale and small-scale settings.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Topic Modeling · Sentiment Analysis and Opinion Mining