TL;DR
This paper introduces the Bag-of-Keywords (BoK) loss, an auxiliary training method that improves dialogue response relevance and interpretability by focusing on key words, enhancing existing models and evaluation metrics.
Contribution
The paper proposes the novel BoK loss that predicts core keywords to improve response quality and interpretability in dialogue systems, applicable to encoder-decoder and decoder-only architectures.
Findings
BoK loss enhances dialogue generation quality.
BoK loss enables post-hoc interpretability.
BoK-LM as a reference-free evaluation metric performs comparably to state-of-the-art metrics.
Abstract
The standard language modeling (LM) loss by itself has been shown to be inadequate for effective dialogue modeling. As a result, various training approaches, such as auxiliary loss functions and leveraging human feedback, are being adopted to enrich open-domain dialogue systems. One such auxiliary loss function is Bag-of-Words (BoW) loss, defined as the cross-entropy loss for predicting all the words/tokens of the next utterance. In this work, we propose a novel auxiliary loss named Bag-of-Keywords (BoK) loss to capture the central thought of the response through keyword prediction and leverage it to enhance the generation of meaningful and interpretable responses in open-domain dialogue systems. BoK loss upgrades the BoW loss by predicting only the keywords or critical words/tokens of the next utterance, intending to estimate the core idea rather than the entire response. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
