Modeling the One-to-Many Property in Open-Domain Dialogue with LLMs

Jing Yang Lee; Kong-Aik Lee; Woon-Seng Gan

arXiv:2506.15131·cs.CL·January 5, 2026

Modeling the One-to-Many Property in Open-Domain Dialogue with LLMs

Jing Yang Lee, Kong-Aik Lee, Woon-Seng Gan

PDF

Open Access 1 Video

TL;DR

This paper models the one-to-many property in open-domain dialogue using a two-stage approach with LLMs, improving response diversity and quality by leveraging a new dataset and novel strategies.

Contribution

It introduces o2mDial, a new dialogue corpus capturing multiple plausible responses, and proposes a two-stage framework with in-context learning and instruction tuning for better response diversity.

Findings

01

Enhanced response diversity in smaller LLMs

02

Response quality improved by up to 90%

03

Closer performance to larger models

Abstract

Open-domain Dialogue (OD) exhibits a one-to-many (o2m) property, whereby multiple appropriate responses exist for a single dialogue context. Despite prior research showing that modeling this property boosts response diversity, most modern LLM-based dialogue agents do not explicitly do so. In this work, we model the o2m property of OD in LLMs by decomposing OD generation into two key tasks: Multi-Response Generation (MRG) and Preference-based Selection (PS), which entail generating a set of n semantically and lexically diverse high-quality responses for a given dialogue context, followed by selecting a single response based on human preference, respectively. To facilitate MRG and PS, we introduce o2mDial, a dialogue corpus explicitly designed to capture the o2m property by featuring multiple plausible responses for each context. Leveraging o2mDial, we propose new in-context learning and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Modeling the One-to-Many Property in Open-Domain Dialogue with LLMs· underline

Taxonomy

TopicsMulti-Agent Systems and Negotiation · Semantic Web and Ontologies · Natural Language Processing Techniques