Chat-Ghosting: A Comparative Study of Methods for Auto-Completion in Dialog Systems
Sandeep Mishra, Anubhab Mandal, Bishal Santra, Tushar Abhishek, Pawan Goyal, Manish Gupta

TL;DR
This paper compares various auto-completion methods for dialog systems, revealing that traditional statistical models outperform deep learning models on seen queries, while neural models excel on unseen queries, especially with conversational context.
Contribution
The study provides a comprehensive benchmark of ghosting methods across multiple datasets and introduces a novel entropy-based early stopping strategy.
Findings
Statistical n-gram models outperform deep learning models on seen prefixes.
Neural models like T5 perform better on unseen queries.
Adding dialog context significantly improves ghosting quality.
Abstract
Ghosting, the ability to predict a user's intended text input for inline query auto-completion, is an invaluable feature for modern search engines and chat interfaces, greatly enhancing user experience. By suggesting completions to incomplete queries (or prefixes), ghosting aids users with slow typing speeds, disabilities, or limited language proficiency. Ghosting is a challenging problem and has become more important with the ubiquitousness of chat-based systems like ChatGPT, Copilot, etc. Despite the increasing prominence of chat-based systems utilizing ghosting, this challenging problem of Chat-Ghosting has received little attention from the NLP/ML research community. There is a lack of standardized benchmarks and relative performance analysis of deep learning and non-deep learning methods. We address this through an open and thorough study of this problem using four publicly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Information Retrieval and Search Behavior · Multimodal Machine Learning Applications
MethodsAttention Dropout · Refunds@Expedia|||How do I get a full refund from Expedia? · Layer Normalization · Dropout · Gated Linear Unit · Adafactor · Inverse Square Root Schedule · Dense Connections · Softmax · T5
