Ultra-Fast, Low-Storage, Highly Effective Coarse-grained Selection in Retrieval-based Chatbot by Using Deep Semantic Hashing
Tian Lan, Xian-Ling Mao, Xiaoyan Gao, Wei Wei, Heyan Huang

TL;DR
This paper compares sparse and dense representations for coarse-grained selection in retrieval-based chatbots, and introduces a deep semantic hashing method that achieves fast, low-storage, and effective selection with minimal performance loss.
Contribution
The paper systematically compares existing sparse and dense methods and proposes a novel deep semantic hashing approach that enhances speed and reduces storage in chatbot retrieval systems.
Findings
Dense representation outperforms sparse in effectiveness but is slower and requires more storage.
The proposed DSHC model significantly improves speed and reduces storage with limited performance loss.
Source code is publicly available for further research.
Abstract
We study the coarse-grained selection module in retrieval-based chatbot. Coarse-grained selection is a basic module in a retrieval-based chatbot, which constructs a rough candidate set from the whole database to speed up the interaction with customers. So far, there are two kinds of approaches for coarse-grained selection module: (1) sparse representation; (2) dense representation. To the best of our knowledge, there is no systematic comparison between these two approaches in retrieval-based chatbots, and which kind of method is better in real scenarios is still an open question. In this paper, we first systematically compare these two methods from four aspects: (1) effectiveness; (2) index stoarge; (3) search time cost; (4) human evaluation. Extensive experiment results demonstrate that dense representation method significantly outperforms the sparse representation, but costs more time…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications · Topic Modeling
MethodsSolana Customer Service Number +1-833-534-1729
