An Approach to Build Zero-Shot Slot-Filling System for Industry-Grade   Conversational Assistants

G P Shrivatsa Bhargav; Sumit Neelam; Udit Sharma; Shajith Ikbal,; Dheeraj Sreedhar; Hima Karanam; Sachindra Joshi; Pankaj Dhoolia; Dinesh Garg,; Kyle Croutwater; Haode Qi; Eric Wayne; J William Murdock

arXiv:2406.08848·cs.CL·June 14, 2024

An Approach to Build Zero-Shot Slot-Filling System for Industry-Grade Conversational Assistants

G P Shrivatsa Bhargav, Sumit Neelam, Udit Sharma, Shajith Ikbal,, Dheeraj Sreedhar, Hima Karanam, Sachindra Joshi, Pankaj Dhoolia, Dinesh Garg,, Kyle Croutwater, Haode Qi, Eric Wayne, J William Murdock

PDF

Open Access

TL;DR

This paper introduces a low-latency, zero-shot slot-filling system for conversational assistants using fine-tuned small LLMs, achieving significant performance improvements across multiple domains.

Contribution

It presents a novel fine-tuning approach with carefully prepared data enabling zero-shot, industry-grade slot-filling with reduced latency and improved accuracy.

Findings

01

6.9% relative F1 improvement over baseline

02

57% reduction in latency

03

4.2% average F1 gain across slot types

Abstract

We present an approach to build Large Language Model (LLM) based slot-filling system to perform Dialogue State Tracking in conversational assistants serving across a wide variety of industry-grade applications. Key requirements of this system include: 1) usage of smaller-sized models to meet low latency requirements and to enable convenient and cost-effective cloud and customer premise deployments, and 2) zero-shot capabilities to serve across a wide variety of domains, slot types and conversational scenarios. We adopt a fine-tuning approach where a pre-trained LLM is fine-tuned into a slot-filling model using task specific data. The fine-tuning data is prepared carefully to cover a wide variety of slot-filling task scenarios that the model is expected to face across various domains. We give details of the data preparation and model building process. We also give a detailed analysis of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAI in Service Interactions