A Compare Aggregate Transformer for Understanding Document-grounded   Dialogue

Longxuan Ma; Weinan Zhang; Runxin Sun; Ting Liu

arXiv:2010.00190·cs.CL·October 2, 2020·1 cites

A Compare Aggregate Transformer for Understanding Document-grounded Dialogue

Longxuan Ma, Weinan Zhang, Runxin Sun, Ting Liu

PDF

Open Access

TL;DR

This paper introduces the Compare Aggregate Transformer (CAT), a novel model that denoises dialogue context and effectively utilizes external documents for improved response generation in document-grounded dialogue systems.

Contribution

The paper proposes a new Compare Aggregate Transformer with comparison mechanisms to reduce noise and enhance document information aggregation for dialogue response generation.

Findings

01

CAT outperforms state-of-the-art models on CMUDoG dataset

02

Introduces two metrics for evaluating document utilization efficiency

03

Effectively reduces noise in dialogue context processing

Abstract

Unstructured documents serving as external knowledge of the dialogues help to generate more informative responses. Previous research focused on knowledge selection (KS) in the document with dialogue. However, dialogue history that is not related to the current dialogue may introduce noise in the KS processing. In this paper, we propose a Compare Aggregate Transformer (CAT) to jointly denoise the dialogue context and aggregate the document information for response generation. We designed two different comparison mechanisms to reduce noise (before and during decoding). In addition, we propose two metrics for evaluating document utilization efficiency based on word overlap. Experimental results on the CMUDoG dataset show that the proposed CAT model outperforms the state-of-the-art approach and strong baselines.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems

MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Dense Connections · Dropout · Layer Normalization · Byte Pair Encoding · Label Smoothing · Multi-Head Attention · Attention Is All You Need