BoK: Introducing Bag-of-Keywords Loss for Interpretable Dialogue   Response Generation

Suvodip Dey; Maunendra Sankar Desarkar

arXiv:2501.10328·cs.CL·January 20, 2025

BoK: Introducing Bag-of-Keywords Loss for Interpretable Dialogue Response Generation

Suvodip Dey, Maunendra Sankar Desarkar

PDF

1 Repo

TL;DR

This paper introduces the Bag-of-Keywords (BoK) loss, an auxiliary training method that improves dialogue response relevance and interpretability by focusing on key words, enhancing existing models and evaluation metrics.

Contribution

The paper proposes the novel BoK loss that predicts core keywords to improve response quality and interpretability in dialogue systems, applicable to encoder-decoder and decoder-only architectures.

Findings

01

BoK loss enhances dialogue generation quality.

02

BoK loss enables post-hoc interpretability.

03

BoK-LM as a reference-free evaluation metric performs comparably to state-of-the-art metrics.

Abstract

The standard language modeling (LM) loss by itself has been shown to be inadequate for effective dialogue modeling. As a result, various training approaches, such as auxiliary loss functions and leveraging human feedback, are being adopted to enrich open-domain dialogue systems. One such auxiliary loss function is Bag-of-Words (BoW) loss, defined as the cross-entropy loss for predicting all the words/tokens of the next utterance. In this work, we propose a novel auxiliary loss named Bag-of-Keywords (BoK) loss to capture the central thought of the response through keyword prediction and leverage it to enhance the generation of meaningful and interpretable responses in open-domain dialogue systems. BoK loss upgrades the BoW loss by predicting only the keywords or critical words/tokens of the next utterance, intending to estimate the core idea rather than the entire response. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

suvodipdey/bok
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.