TripleNet: Triple Attention Network for Multi-Turn Response Selection in   Retrieval-based Chatbots

Wentao Ma; Yiming Cui; Nan Shao; Su He; Wei-Nan Zhang; Ting Liu,; Shijin Wang; Guoping Hu

arXiv:1909.10666·cs.CL·November 5, 2019

TripleNet: Triple Attention Network for Multi-Turn Response Selection in Retrieval-based Chatbots

Wentao Ma, Yiming Cui, Nan Shao, Su He, Wei-Nan Zhang, Ting Liu,, Shijin Wang, Guoping Hu

PDF

TL;DR

TripleNet introduces a novel triple attention mechanism that models the relationships among context, query, and response in multi-turn response selection, significantly improving performance over previous methods.

Contribution

The paper proposes TripleNet, a new model with triple attention that fully models the <context, query, response> triple, advancing multi-turn response selection in retrieval-based chatbots.

Findings

01

Outperforms state-of-the-art methods on large-scale datasets

02

Effectively models relationships among context, query, and response

03

Demonstrates significant accuracy improvements

Abstract

We consider the importance of different utterances in the context for selecting the response usually depends on the current query. In this paper, we propose the model TripleNet to fully model the task with the triple <context, query, response> instead of <context, response> in previous works. The heart of TripleNet is a novel attention mechanism named triple attention to model the relationships within the triple at four levels. The new mechanism updates the representation for each element based on the attention with the other two concurrently and symmetrically. We match the triple <C, Q, R> centered on the response from char to context level for prediction. Experimental results on two large-scale multi-turn response selection datasets show that the proposed model can significantly outperform the state-of-the-art methods. TripleNet source code is available at…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.