Dialogue Session Segmentation by Embedding-Enhanced TextTiling

Yiping Song; Lili Mou; Rui Yan; Li Yi; Zinan Zhu; Xiaohua Hu; Ming; Zhang

arXiv:1610.03955·cs.CL·October 14, 2016·1 cites

Dialogue Session Segmentation by Embedding-Enhanced TextTiling

Yiping Song, Lili Mou, Rui Yan, Li Yi, Zinan Zhu, Xiaohua Hu, Ming, Zhang

PDF

Open Access

TL;DR

This paper introduces an embedding-enhanced TextTiling method for session segmentation in human-computer conversations, leveraging word embeddings to improve robustness against noisy utterances.

Contribution

The paper proposes a novel embedding-enhanced TextTiling approach that outperforms existing methods in session segmentation tasks.

Findings

01

Improved segmentation accuracy over TextTiling and MMD methods

02

Embedding enhancement effectively captures semantic context in noisy conversations

03

Demonstrated robustness in real-world dialogue datasets

Abstract

In human-computer conversation systems, the context of a user-issued utterance is particularly important because it provides useful background information of the conversation. However, it is unwise to track all previous utterances in the current session as not all of them are equally important. In this paper, we address the problem of session segmentation. We propose an embedding-enhanced TextTiling approach, inspired by the observation that conversation utterances are highly noisy, and that word embeddings provide a robust way of capturing semantics. Experimental results show that our approach achieves better performance than the TextTiling, MMD approaches.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Speech and dialogue systems · Natural Language Processing Techniques