Converse, Focus and Guess -- Towards Multi-Document Driven Dialogue

Han Liu; Caixia Yuan; Xiaojie Wang; Yushu Yang; Huixing Jiang,; Zhongyuan Wang

arXiv:2102.02435·cs.CL·February 5, 2021

Converse, Focus and Guess -- Towards Multi-Document Driven Dialogue

Han Liu, Caixia Yuan, Xiaojie Wang, Yushu Yang, Huixing Jiang,, Zhongyuan Wang

PDF

Open Access 1 Repo

TL;DR

This paper introduces the MD3 task and dataset for multi-document driven dialogue, proposing a model that effectively guesses target documents through dialogue, outperforming baselines and nearing human performance.

Contribution

It presents a new task, dataset, and model for multi-document dialogue that can guess user target documents efficiently by integrating external knowledge and dialogue state tracking.

Findings

01

The MD3 model outperforms baseline methods.

02

The model approaches human-level performance.

03

Effective dialogue policy optimization reduces guessing turns.

Abstract

We propose a novel task, Multi-Document Driven Dialogue (MD3), in which an agent can guess the target document that the user is interested in by leading a dialogue. To benchmark progress, we introduce a new dataset of GuessMovie, which contains 16,881 documents, each describing a movie, and associated 13,434 dialogues. Further, we propose the MD3 model. Keeping guessing the target document in mind, it converses with the user conditioned on both document engagement and user feedback. In order to incorporate large-scale external documents into the dialogue, it pretrains a document representation which is sensitive to attributes it talks about an object. Then it tracks dialogue state by detecting evolvement of document belief and attribute belief, and finally optimizes dialogue policy in principle of entropy decreasing and reward increasing, which is expected to successfully guess the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

laddie132/MD3
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Speech and dialogue systems · Multimodal Machine Learning Applications