LLM-based Weak Supervision Framework for Query Intent Classification in Video Search
Farnoosh Javadi, Phanideep Gampa, Alyssa Woo, Xingxing Geng, Hang, Zhang, Jose Sepulveda, Belhassen Bayar, Fei Wang

TL;DR
This paper presents a weak supervision framework using large language models to automatically generate training data for query intent classification in video search, significantly improving recall and data quality.
Contribution
It introduces a novel LLM-based weak supervision approach with prompt engineering and persona routing to enhance query intent classification without manual labeling.
Findings
113% relative gain in recall over baseline
47.60% improvement in data agreement rate
3.67% increase in weighted F1 score with persona routing
Abstract
Streaming services have reshaped how we discover and engage with digital entertainment. Despite these advancements, effectively understanding the wide spectrum of user search queries continues to pose a significant challenge. An accurate query understanding system that can handle a variety of entities that represent different user intents is essential for delivering an enhanced user experience. We can build such a system by training a natural language understanding (NLU) model; however, obtaining high-quality labeled training data in this specialized domain is a substantial obstacle. Manual annotation is costly and impractical for capturing users' vast vocabulary variations. To address this, we introduce a novel approach that leverages large language models (LLMs) through weak supervision to automatically annotate a vast collection of user search queries. Using prompt engineering and a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning · Machine Learning and Data Classification
MethodsSparse Evolutionary Training
