TL;DR
This paper introduces DSRNet, a transformer-based model that enhances domain-specific dialogue response generation by infusing meta-attributes, leading to more relevant, coherent, and contextually appropriate responses in technical and restaurant domains.
Contribution
The paper proposes DSRNet, a novel transformer model that incorporates domain-specific meta-attributes to improve relevance and coherence in multi-turn, domain-specific dialogue generation.
Findings
DSRNet outperforms state-of-the-art models on Ubuntu and CamRest676 datasets.
Responses generated by DSRNet show higher relevance and better overlap with domain-specific key terms.
Infusion of meta-attributes improves attention over relevant domain-specific terms.
Abstract
Despite the tremendous success of neural dialogue models in recent years, it suffers a lack of relevance, diversity, and some times coherence in generated responses. Lately, transformer-based models, such as GPT-2, have revolutionized the landscape of dialogue generation by capturing the long-range structures through language modeling. Though these models have exhibited excellent language coherence, they often lack relevance and terms when used for domain-specific response generation. In this paper, we present DSRNet (Domain Specific Response Network), a transformer-based model for dialogue response generation by reinforcing domain-specific attributes. In particular, we extract meta attributes from context and infuse them with the context utterances for better attention over domain-specific key terms and relevance. We study the use of DSRNet in a multi-turn multi-interlocutor…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsLinear Layer · Cosine Annealing · Attention Is All You Need · Adam · Byte Pair Encoding · Softmax · Multi-Head Attention · Layer Normalization · Dense Connections · Linear Warmup With Cosine Annealing
