TL;DR
This paper demonstrates that transformer models, especially with broader context and punctuation, significantly improve dialog act recognition accuracy, highlighting key factors influencing model performance in conversational understanding.
Contribution
The study shows the impact of context, punctuation, and label set specificity on transformer-based dialog act recognition, providing insights for improving spoken language understanding.
Findings
Broader context improves disambiguation of infrequent dialog acts.
Punctuation presence drastically enhances model performance.
Label set specificity does not affect segmentation accuracy.
Abstract
Dialog acts can be interpreted as the atomic units of a conversation, more fine-grained than utterances, characterized by a specific communicative function. The ability to structure a conversational transcript as a sequence of dialog acts -- dialog act recognition, including the segmentation -- is critical for understanding dialog. We apply two pre-trained transformer models, XLNet and Longformer, to this task in English and achieve strong results on Switchboard Dialog Act and Meeting Recorder Dialog Act corpora with dialog act segmentation error rates (DSER) of 8.4% and 14.2%. To understand the key factors affecting dialog act recognition, we perform a comparative analysis of models trained under different conditions. We find that the inclusion of a broader conversational context helps disambiguate many dialog act classes, especially those infrequent in the training data. The presence…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · AdamW · How do I make a claim with Expedia?*Make FastClaimService · Attention Dropout · WordPiece · Byte Pair Encoding · Weight Decay · How do I get a human at Expedia immediately? (2025-2026)
