Toward Interpretability of Dual-Encoder Models for Dialogue Response   Suggestions

Yitong Li; Dianqi Li; Sushant Prakash; Peng Wang

arXiv:2003.04998·cs.CL·March 12, 2020·1 cites

Toward Interpretability of Dual-Encoder Models for Dialogue Response Suggestions

Yitong Li, Dianqi Li, Sushant Prakash, Peng Wang

PDF

Open Access

TL;DR

This paper introduces an attentive dual encoder model with a novel regularization loss and residual layer to enhance interpretability and accuracy in dialogue response suggestion tasks.

Contribution

It proposes a new attentive dual encoder architecture with mutual information regularization and residual connections for improved interpretability and performance.

Findings

01

Improved Recall@1 accuracy on Persona and Ubuntu datasets.

02

Enhanced interpretability through visualization of important words.

03

Model outperforms existing methods in response suggestion tasks.

Abstract

This work shows how to improve and interpret the commonly used dual encoder model for response suggestion in dialogue. We present an attentive dual encoder model that includes an attention mechanism on top of the extracted word-level features from two encoders, one for context and one for label respectively. To improve the interpretability in the dual encoder models, we design a novel regularization loss to minimize the mutual information between unimportant words and desired labels, in addition to the original attention method, so that important words are emphasized while unimportant words are de-emphasized. This can help not only with model interpretability, but can also further improve model accuracy. We propose an approximation method that uses a neural network to calculate the mutual information. Furthermore, by adding a residual layer between raw word embeddings and the final…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Speech and dialogue systems · Multimodal Machine Learning Applications

MethodsInterpretability