Advancing Semi-Supervised Task Oriented Dialog Systems by JSA Learning of Discrete Latent Variable Models
Yucheng Cai, Hong Liu, Zhijian Ou, Yi Huang, Junlan Feng

TL;DR
This paper introduces JSA-TOD, a semi-supervised learning approach for task-oriented dialog systems using joint stochastic approximation, which outperforms traditional variational methods and achieves near full-supervision performance with limited labels.
Contribution
It is the first to apply JSA to semi-supervised learning of discrete latent variable models in task-oriented dialog systems, improving performance over variational methods.
Findings
JSA-TOD significantly outperforms variational learning methods.
Semi-supervised JSA-TOD with 20% labels approaches full-supervision performance.
JSA-TOD effectively leverages unlabeled dialog data for better dialog system training.
Abstract
Developing semi-supervised task-oriented dialog (TOD) systems by leveraging unlabeled dialog data has attracted increasing interests. For semi-supervised learning of latent state TOD models, variational learning is often used, but suffers from the annoying high-variance of the gradients propagated through discrete latent variables and the drawback of indirectly optimizing the target log-likelihood. Recently, an alternative algorithm, called joint stochastic approximation (JSA), has emerged for learning discrete latent variable models with impressive performances. In this paper, we propose to apply JSA to semi-supervised learning of the latent state TOD models, which is referred to as JSA-TOD. To our knowledge, JSA-TOD represents the first work in developing JSA based semi-supervised learning of discrete latent variable conditional models for such long sequential generation problems like…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and dialogue systems · Topic Modeling · Advanced Text Analysis Techniques
