CAiRE in DialDoc21: Data Augmentation for Information-Seeking Dialogue   System

Etsuko Ishii; Yan Xu; Genta Indra Winata; Zhaojiang Lin; Andrea; Madotto; Zihan Liu; Peng Xu; Pascale Fung

arXiv:2106.03530·cs.CL·June 9, 2021

CAiRE in DialDoc21: Data Augmentation for Information-Seeking Dialogue System

Etsuko Ishii, Yan Xu, Genta Indra Winata, Zhaojiang Lin, Andrea, Madotto, Zihan Liu, Peng Xu, Pascale Fung

PDF

Open Access 1 Repo

TL;DR

This paper presents a data augmentation approach combined with training techniques using pre-trained language models to improve information-seeking dialogue systems, achieving promising results in the DialDoc21 competition.

Contribution

It introduces data augmentation methods and training strategies with pre-trained models specifically for dialogue systems, demonstrating their effectiveness in a competitive setting.

Findings

01

Achieved 74.95 F1 score in subtask 1

02

Achieved 60.74 Exact Match score in subtask 1

03

Achieved 37.72 SacreBLEU score in subtask 2

Abstract

Information-seeking dialogue systems, including knowledge identification and response generation, aim to respond to users with fluent, coherent, and informative responses based on users' needs, which. To tackle this challenge, we utilize data augmentation methods and several training techniques with the pre-trained language models to learn a general pattern of the task and thus achieve promising performance. In DialDoc21 competition, our system achieved 74.95 F1 score and 60.74 Exact Match score in subtask 1, and 37.72 SacreBLEU score in subtask 2. Empirical analysis is provided to explain the effectiveness of our approaches.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

HLTCHKUST/CAiRE_in_DialDoc21
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Speech and dialogue systems · Natural Language Processing Techniques