Intent Mismatch Causes LLMs to Get Lost in Multi-Turn Conversation
Geng Liu, Fei Zhu, Rong Feng, Changyi Ma, Shiqi Wang, Gaofeng Meng

TL;DR
This paper identifies that the performance drop of LLMs in multi-turn conversations is due to intent misalignment, and proposes a Mediator-Assistant architecture to explicitly clarify user intent, significantly improving multi-turn interaction quality.
Contribution
The paper reveals that intent mismatch causes LLMs to struggle in multi-turn conversations and introduces a Mediator-Assistant framework to explicitly align user intent with model understanding.
Findings
Significant mitigation of performance degradation in multi-turn conversations.
Intent alignment gap is the main cause of 'Lost in Conversation' phenomenon.
Mediator-Assistant architecture improves LLM interaction quality.
Abstract
Multi-turn conversation has emerged as a predominant interaction paradigm for Large Language Models (LLMs). Users often employ follow-up questions to refine their intent, expecting LLMs to adapt dynamically. However, recent research reveals that LLMs suffer a substantial performance drop in multi-turn settings compared to single-turn interactions with fully specified instructions, a phenomenon termed ``Lost in Conversation'' (LiC). While this prior work attributes LiC to model unreliability, we argue that the root cause lies in an intent alignment gap rather than intrinsic capability deficits. In this paper, we first demonstrate that LiC is not a failure of model capability but rather a breakdown in interaction between users and LLMs. We theoretically show that scaling model size or improving training alone cannot resolve this gap, as it arises from structural ambiguity in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Speech and dialogue systems · Artificial Intelligence in Healthcare and Education
