Spoken Language Understanding for Conversational AI: Recent Advances and Future Direction
Soyeon Caren Han, Siqu Long, Henry Weld, Josiah Poon

TL;DR
This paper reviews recent advances in spoken language understanding for conversational AI, focusing on deep learning models like Transformers and BERT, and discusses future directions and practical implementations.
Contribution
It provides a comprehensive overview of joint intent detection and slot filling models, highlighting recent deep learning techniques and offering practical code demonstrations.
Findings
Transformer-based models improve accuracy in intent detection and slot filling.
Pre-trained models like BERT enhance performance on SLU tasks.
Joint modeling boosts overall understanding in conversational AI.
Abstract
When a human communicates with a machine using natural language on the web and online, how can it understand the human's intention and semantic context of their talk? This is an important AI task as it enables the machine to construct a sensible answer or perform a useful action for the human. Meaning is represented at the sentence level, identification of which is known as intent detection, and at the word level, a labelling task called slot filling. This dual-level joint task requires innovative thinking about natural language and deep learning network design, and as a result, many approaches and models have been proposed and applied. This tutorial will discuss how the joint task is set up and introduce Spoken Language Understanding/Natural Language Understanding (SLU/NLU) with Deep Learning techniques. We will cover the datasets, experiments and metrics used in the field. We will…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and dialogue systems
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Byte Pair Encoding · Dense Connections · Residual Connection · Label Smoothing · Position-Wise Feed-Forward Layer · Dropout · Absolute Position Encodings
