Evaluating Task-Oriented Dialogue Consistency through Constraint   Satisfaction

Tiziano Labruna; Bernardo Magnini

arXiv:2407.11857·cs.CL·July 17, 2024

Evaluating Task-Oriented Dialogue Consistency through Constraint Satisfaction

Tiziano Labruna, Bernardo Magnini

PDF

Open Access

TL;DR

This paper models task-oriented dialogue consistency as a Constraint Satisfaction Problem (CSP) to detect inconsistencies and evaluate the challenges faced by large language models in maintaining dialogue coherence.

Contribution

It introduces a novel CSP-based framework for assessing dialogue consistency and demonstrates its effectiveness over traditional pipeline approaches.

Findings

01

CSP effectively detects dialogue inconsistencies.

02

State-of-the-art LLMs struggle with dialogue re-lexicalization, achieving only 0.15 accuracy.

03

Domain knowledge constraints are the most difficult to satisfy.

Abstract

Task-oriented dialogues must maintain consistency both within the dialogue itself, ensuring logical coherence across turns, and with the conversational domain, accurately reflecting external knowledge. We propose to conceptualize dialogue consistency as a Constraint Satisfaction Problem (CSP), wherein variables represent segments of the dialogue referencing the conversational domain, and constraints among variables reflect dialogue properties, including linguistic, conversational, and domain-based aspects. To demonstrate the feasibility of the approach, we utilize a CSP solver to detect inconsistencies in dialogues re-lexicalized by an LLM. Our findings indicate that: (i) CSP is effective to detect dialogue inconsistencies; and (ii) consistent dialogue re-lexicalization is challenging for state-of-the-art LLMs, achieving only a 0.15 accuracy rate when compared to a CSP solver.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and dialogue systems · Service-Oriented Architecture and Web Services