A Study on Dialogue Reward Prediction for Open-Ended Conversational Agents
Heriberto Cuay\'ahuitl, Seonghan Ryu, Donghyeon Lee, Jihie Kim

TL;DR
This paper investigates how the length of dialogue history affects the accuracy of reward prediction in conversational agents, finding that longer histories improve performance and are essential for reliable reward estimation.
Contribution
It provides a systematic analysis of dialogue history length's impact on reward prediction accuracy, highlighting the importance of using extended context in conversational systems.
Findings
Longer dialogue histories (up to 25 sentences) improve reward prediction accuracy.
Extended context leads to stronger correlation between predicted and actual rewards.
Using lengthy histories enhances training of reward predictors for conversational agents.
Abstract
The amount of dialogue history to include in a conversational agent is often underestimated and/or set in an empirical and thus possibly naive way. This suggests that principled investigations into optimal context windows are urgently needed given that the amount of dialogue history and corresponding representations can play an important role in the overall performance of a conversational system. This paper studies the amount of history required by conversational agents for reliably predicting dialogue rewards. The task of dialogue reward prediction is chosen for investigating the effects of varying amounts of dialogue history and their impact on system performance. Experimental results using a dataset of 18K human-human dialogues report that lengthy dialogue histories of at least 10 sentences are preferred (25 sentences being the best in our experiments) over short ones, and that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Speech and dialogue systems · Natural Language Processing Techniques
