A comprehensive solution to retrieval-based chatbot construction

Kristen Moore; Shenjun Zhong; Zhen He; Torsten Rudolf; Nils Fisher; Brandon Victor; Neha Jindal

arXiv:2106.06139·cs.CL·August 18, 2025·6 cites

A comprehensive solution to retrieval-based chatbot construction

Kristen Moore, Shenjun Zhong, Zhen He, Torsten Rudolf, Nils Fisher, Brandon Victor, Neha Jindal

PDF

Open Access

TL;DR

This paper introduces an end-to-end framework for building retrieval-based chatbots using self-supervised contrastive learning, demonstrating its effectiveness in real-world customer support scenarios.

Contribution

It provides a comprehensive pipeline from unlabelled chatlogs to deployed chatbot, including dataset creation, response selection, and architecture design, with a focus on contrastive learning.

Findings

01

Self-supervised contrastive learning outperforms other models in response selection.

02

Hierarchical RNN architecture meets deployment speed requirements.

03

Effective use of chatlogs for dataset creation and training.

Abstract

In this paper we present the results of our experiments in training and deploying a self-supervised retrieval-based chatbot trained with contrastive learning for assisting customer support agents. In contrast to most existing research papers in this area where the focus is on solving just one component of a deployable chatbot, we present an end-to-end set of solutions to take the reader from an unlabelled chatlogs to a deployed chatbot. This set of solutions includes creating a self-supervised dataset and a weakly labelled dataset from chatlogs, as well as a systematic approach to selecting a fixed list of canned responses. We present a hierarchical-based RNN architecture for the response selection model, chosen for its ability to cache intermediate utterance embeddings, which helped to meet deployment inference speed requirements. We compare the performance of this architecture across…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · AI in Service Interactions · Domain Adaptation and Few-Shot Learning

MethodsContrastive Learning