A Survey of the Evolution of Language Model-Based Dialogue Systems: Data, Task and Models
Hongru Wang, Lingzhi Wang, Yiming Du, Liang Chen, Jingyan Zhou, Yufei Wang, Kam-Fai Wong

TL;DR
This survey reviews the evolution of language model-based dialogue systems, highlighting historical developments, recent advancements with pre-trained and large language models, and discussing future challenges and directions.
Contribution
It provides a systematic, chronological review of dialogue system evolution in relation to language model breakthroughs, offering insights into future research directions.
Findings
Categorizes dialogue system development stages aligned with LM breakthroughs
Highlights the impact of pre-trained and large language models on dialogue systems
Discusses open challenges and future research directions
Abstract
Dialogue systems (DS), including the task-oriented dialogue system (TOD) and the open-domain dialogue system (ODD), have always been a fundamental task in natural language processing (NLP), allowing various applications in practice. Owing to sophisticated training and well-designed model architecture, language models (LM) are usually adopted as the necessary backbone to build the dialogue system. Consequently, every breakthrough in LM brings about a shift in learning paradigm and research attention within dialogue system, especially the appearance of pre-trained language models (PLMs) and large language models (LLMs). In this paper, we take a deep look at the history of the dialogue system, especially its special relationship with the advancements of language models. Thus, our survey offers a systematic perspective, categorizing different stages in a chronological order aligned with LM…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Speech and dialogue systems · Natural Language Processing Techniques
MethodsSigmoid Activation · Tanh Activation · Focus · Long Short-Term Memory
