Low-Resource Adaptation of Open-Domain Generative Chatbots
Greyson Gerhard-Young, Raviteja Anantha, Srinivas Chappidi, Bj\"orn, Hoffmeister

TL;DR
This paper presents a framework for adapting large open-domain chatbots into low-resource models that retain conversational abilities, improve domain-specific performance, and handle multi-turn interactions efficiently on user devices.
Contribution
It introduces a novel framework for low-parameter chatbots that maintains knowledge, manages diverse question types, and transitions smoothly between chatting and transactional tasks.
Findings
Achieves comparable performance with 90% fewer parameters.
Retains general knowledge and conversational abilities in low-resource models.
Effectively handles multi-turn conversations and domain-specific tasks.
Abstract
Recent work building open-domain chatbots has demonstrated that increasing model size improves performance. On the other hand, latency and connectivity considerations dictate the move of digital assistants on the device. Giving a digital assistant like Siri, Alexa, or Google Assistant the ability to discuss just about anything leads to the need for reducing the chatbot model size such that it fits on the user's device. We demonstrate that low parameter models can simultaneously retain their general knowledge conversational abilities while improving in a specific domain. Additionally, we propose a generic framework that accounts for variety in question types, tracks reference throughout multi-turn conversations, and removes inconsistent and potentially toxic responses. Our framework seamlessly transitions between chatting and performing transactional tasks, which will ultimately make…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · AI in Service Interactions · Recommender Systems and Techniques
