Advancing Topic Segmentation of Broadcasted Speech with Multilingual Semantic Embeddings
Sakshi Deo Shukla, Pavel Denisov, Tugtekin Turan

TL;DR
This paper introduces an end-to-end multilingual speech-based topic segmentation method that bypasses traditional transcript-based approaches, demonstrating competitive results on a new diverse broadcasted news dataset across six European languages and Hindi.
Contribution
The paper presents a novel end-to-end speech segmentation approach using multilingual semantic embeddings and establishes a new benchmark dataset for spoken news topic segmentation.
Findings
End-to-end model achieves a $P_k$ score of 0.2564 for English.
Multilingual training improves scores to 0.1988 and 0.2370.
Model and data scripts are publicly released for research use.
Abstract
Recent advancements in speech-based topic segmentation have highlighted the potential of pretrained speech encoders to capture semantic representations directly from speech. Traditionally, topic segmentation has relied on a pipeline approach in which transcripts of the automatic speech recognition systems are generated, followed by text-based segmentation algorithms. In this paper, we introduce an end-to-end scheme that bypasses this conventional two-step process by directly employing semantic speech encoders for segmentation. Focused on the broadcasted news domain, which poses unique challenges due to the diversity of speakers and topics within single recordings, we address the challenge of accessing topic change points efficiently in an end-to-end manner. Furthermore, we propose a new benchmark for spoken news topic segmentation by utilizing a dataset featuring approximately 1000…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSentiment Analysis and Opinion Mining · Advanced Text Analysis Techniques · Natural Language Processing Techniques
MethodsSparse Evolutionary Training
