Meta-Context Transformers for Domain-Specific Response Generation

Debanjana Kar; Suranjana Samanta; Amar Prakash Azad

arXiv:2010.05572·cs.CL·October 13, 2020

Meta-Context Transformers for Domain-Specific Response Generation

Debanjana Kar, Suranjana Samanta, Amar Prakash Azad

PDF

1 Repo

TL;DR

This paper introduces DSRNet, a transformer-based model that enhances domain-specific dialogue response generation by infusing meta-attributes, leading to more relevant, coherent, and contextually appropriate responses in technical and restaurant domains.

Contribution

The paper proposes DSRNet, a novel transformer model that incorporates domain-specific meta-attributes to improve relevance and coherence in multi-turn, domain-specific dialogue generation.

Findings

01

DSRNet outperforms state-of-the-art models on Ubuntu and CamRest676 datasets.

02

Responses generated by DSRNet show higher relevance and better overlap with domain-specific key terms.

03

Infusion of meta-attributes improves attention over relevant domain-specific terms.

Abstract

Despite the tremendous success of neural dialogue models in recent years, it suffers a lack of relevance, diversity, and some times coherence in generated responses. Lately, transformer-based models, such as GPT-2, have revolutionized the landscape of dialogue generation by capturing the long-range structures through language modeling. Though these models have exhibited excellent language coherence, they often lack relevance and terms when used for domain-specific response generation. In this paper, we present DSRNet (Domain Specific Response Network), a transformer-based model for dialogue response generation by reinforcing domain-specific attributes. In particular, we extract meta attributes from context and infuse them with the context utterances for better attention over domain-specific key terms and relevance. We study the use of DSRNet in a multi-turn multi-interlocutor…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

amarazad/DSRNet
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsLinear Layer · Cosine Annealing · Attention Is All You Need · Adam · Byte Pair Encoding · Softmax · Multi-Head Attention · Layer Normalization · Dense Connections · Linear Warmup With Cosine Annealing