Low Resource Pipeline for Spoken Language Understanding via Weak Supervision
Ayush Kumar, Rishabh Kumar Tripathi, Jithendra Vepa

TL;DR
This paper introduces a prompt-based weak supervision approach for spoken language understanding, enabling effective training with limited or no labeled data, and demonstrates superior performance over traditional methods.
Contribution
It proposes using task-agnostic prompts as a universal weak source for SLU tasks, reducing manual effort and improving low-resource learning performance.
Findings
Prompt-based weak labels are reliable for SLU tasks.
The method outperforms rule-based WSL by over 5% in Macro-F1.
Achieves more than 4% improvement in zero and few-shot learning scenarios.
Abstract
In Weak Supervised Learning (WSL), a model is trained over noisy labels obtained from semantic rules and task-specific pre-trained models. Rules offer limited generalization over tasks and require significant manual efforts while pre-trained models are available only for limited tasks. In this work, we propose to utilize prompt-based methods as weak sources to obtain the noisy labels on unannotated data. We show that task-agnostic prompts are generalizable and can be used to obtain noisy labels for different Spoken Language Understanding (SLU) tasks such as sentiment classification, disfluency detection and emotion classification. These prompts could additionally be updated to add task-specific contexts, thus providing flexibility to design task-specific prompts. We demonstrate that prompt-based methods generate reliable labels for the above SLU tasks and thus can be used as a universal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Topic Modeling · Text and Document Classification Technologies
