SSP: Self-Supervised Post-training for Conversational Search

Quan Tu; Shen Gao; Xiaolong Wu; Zhao Cao; Ji-Rong Wen; Rui Yan

arXiv:2307.00569·cs.CL·July 4, 2023

SSP: Self-Supervised Post-training for Conversational Search

Quan Tu, Shen Gao, Xiaolong Wu, Zhao Cao, Ji-Rong Wen, Rui Yan

PDF

Open Access 1 Repo

TL;DR

This paper introduces SSP, a self-supervised post-training method that enhances conversational search models by improving dialogue structure understanding and semantic comprehension, leading to better performance on benchmark datasets.

Contribution

The paper proposes a novel self-supervised post-training paradigm that can be integrated into existing models to improve conversational search performance.

Findings

01

Boosts performance on CAsT-19 and CAsT-20 datasets.

02

Enhances understanding of dialogue structure and semantics.

03

Applicable to most existing conversational models.

Abstract

Conversational search has been regarded as the next-generation search paradigm. Constrained by data scarcity, most existing methods distill the well-trained ad-hoc retriever to the conversational retriever. However, these methods, which usually initialize parameters by query reformulation to discover contextualized dependency, have trouble in understanding the dialogue structure information and struggle with contextual semantic vanishing. In this paper, we propose \fullmodel (\model) which is a new post-training paradigm with three self-supervised tasks to efficiently initialize the conversational search model to enhance the dialogue structure and contextual semantic understanding. Furthermore, the \model can be plugged into most of the existing conversational models to boost their performance. To verify the effectiveness of our proposed method, we apply the conversational encoder…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

morecry/ssp
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Speech and dialogue systems