DialogQAE: N-to-N Question Answer Pair Extraction from Customer Service   Chatlog

Xin Zheng; Tianyu Liu; Haoran Meng; Xu Wang; Yufan Jiang; Mengliang; Rao; Binghuai Lin; Zhifang Sui; Yunbo Cao

arXiv:2212.07112·cs.CL·December 15, 2022

DialogQAE: N-to-N Question Answer Pair Extraction from Customer Service Chatlog

Xin Zheng, Tianyu Liu, Haoran Meng, Xu Wang, Yufan Jiang, Mengliang, Rao, Binghuai Lin, Zhifang Sui, Yunbo Cao

PDF

Open Access

TL;DR

This paper introduces N-to-N question-answer pair extraction from customer service chatlogs, enabling more comprehensive knowledge base enrichment by capturing cross-utterance QA pairs, with new models and a benchmark for evaluation.

Contribution

It proposes a novel N-to-N QA extraction task, introduces generative/discriminative methods, and establishes a benchmark with evaluation metrics for dialogue-level QA extraction.

Findings

01

Models perform well on multiple datasets.

02

Extracted QA pairs reveal dialogue structure insights.

03

Proposed methods adapt across domains and languages.

Abstract

Harvesting question-answer (QA) pairs from customer service chatlog in the wild is an efficient way to enrich the knowledge base for customer service chatbots in the cold start or continuous integration scenarios. Prior work attempts to obtain 1-to-1 QA pairs from growing customer service chatlog, which fails to integrate the incomplete utterances from the dialog context for composite QA retrieval. In this paper, we propose N-to-N QA extraction task in which the derived questions and corresponding answers might be separated across different utterances. We introduce a suite of generative/discriminative tagging based methods with end-to-end and two-stage variants that perform well on 5 customer service datasets and for the first time setup a benchmark for N-to-N DialogQAE with utterance and session level evaluation metrics. With a deep dive into extracted QA pairs, we find that the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Speech and dialogue systems · Natural Language Processing Techniques

Methodstravel james · Balanced Selection