Integrating Dialog History into End-to-End Spoken Language Understanding Systems
Jatin Ganhotra, Samuel Thomas, Hong-Kwang J. Kuo, Sachindra Joshi,, George Saon, Zolt\'an T\"uske, Brian Kingsbury

TL;DR
This paper demonstrates that incorporating dialog history encoded with BERT into end-to-end spoken language understanding systems significantly improves performance on dialog action and intent recognition tasks.
Contribution
The paper introduces a method to integrate dialog history into end-to-end SLU systems using BERT embeddings, enhancing understanding of conversational context.
Findings
8% improvement in dialog action recognition
30% improvement in caller intent recognition
Effective use of BERT-encoded dialog history
Abstract
End-to-end spoken language understanding (SLU) systems that process human-human or human-computer interactions are often context independent and process each turn of a conversation independently. Spoken conversations on the other hand, are very much context dependent, and dialog history contains useful information that can improve the processing of each conversational turn. In this paper, we investigate the importance of dialog history and how it can be effectively integrated into end-to-end SLU systems. While processing a spoken utterance, our proposed RNN transducer (RNN-T) based SLU model has access to its dialog history in the form of decoded transcripts and SLU labels of previous turns. We encode the dialog history as BERT embeddings, and use them as an additional input to the SLU model along with the speech features for the current utterance. We evaluate our approach on a recently…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsAttention Is All You Need · Linear Layer · Multi-Head Attention · WordPiece · Softmax · Residual Connection · Attention Dropout · Layer Normalization · Dropout · Dense Connections
