MCP: Self-supervised Pre-training for Personalized Chatbots with   Multi-level Contrastive Sampling

Zhaoheng Huang; Zhicheng Dou; Yutao Zhu; Zhengyi Ma

arXiv:2210.08753·cs.CL·December 15, 2022·1 cites

MCP: Self-supervised Pre-training for Personalized Chatbots with Multi-level Contrastive Sampling

Zhaoheng Huang, Zhicheng Dou, Yutao Zhu, Zhengyi Ma

PDF

Open Access

TL;DR

This paper introduces MCP, a self-supervised pre-training framework using multi-level contrastive sampling to improve personalized chatbots by better capturing user dialogue history representations, addressing data sparsity and response quality issues.

Contribution

The paper proposes a novel self-supervised learning framework with three contrastive pre-training tasks to enhance user profile representations for personalized chatbots.

Findings

01

Significant performance improvements over existing methods.

02

Effective use of contrastive sampling for dialogue history.

03

Enhanced user profile representations lead to better responses.

Abstract

Personalized chatbots focus on endowing the chatbots with a consistent personality to behave like real users and further act as personal assistants. Previous studies have explored generating implicit user profiles from the user's dialogue history for building personalized chatbots. However, these studies only use the response generation loss to train the entire model, thus it is prone to suffer from the problem of data sparsity. Besides, they overemphasize the final generated response's quality while ignoring the correlations and fusions between the user's dialogue history, leading to rough data representations and performance degradation. To tackle these problems, we propose a self-supervised learning framework MCP for capturing better representations from users' dialogue history for personalized chatbots. Specifically, we apply contrastive sampling methods to leverage the supervised…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Recommender Systems and Techniques · AI in Service Interactions