Joint Text and Label Generation for Spoken Language Understanding
Yang Li, Ben Athiwaratkun, Cicero Nogueira dos Santos, Bing Xiang

TL;DR
This paper introduces a method that leverages pretrained language models to generate synthetic data encoding intent and slot labels, improving generalization in spoken language understanding tasks with limited data.
Contribution
It proposes a novel approach to generate labeled synthetic data from pretrained models and employs mixout regularization to handle noisy labels, enhancing intent classification and slot labeling.
Findings
Outperforms baseline methods significantly.
Effective in low-data scenarios.
Proves robustness of mixout regularization against label noise.
Abstract
Generalization is a central problem in machine learning, especially when data is limited. Using prior information to enforce constraints is the principled way of encouraging generalization. In this work, we propose to leverage the prior information embedded in pretrained language models (LM) to improve generalization for intent classification and slot labeling tasks with limited training data. Specifically, we extract prior knowledge from pretrained LM in the form of synthetic data, which encode the prior implicitly. We fine-tune the LM to generate an augmented language, which contains not only text but also encodes both intent labels and slot labels. The generated synthetic data can be used to train a classifier later. Since the generated data may contain noise, we rephrase the learning from generated data as learning with noisy labels. We then utilize the mixout regularization for the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Text and Document Classification Technologies · Machine Learning and Data Classification
