Unsupervised Topic Segmentation of Meetings with BERT Embeddings
Alessandro Solbiati, Kevin Heffernan, Georgios Damaskinos, Shivani, Poddar, Shubham Modi, Jacques Cali

TL;DR
This paper presents an unsupervised method for meeting topic segmentation using BERT embeddings, significantly improving accuracy over previous approaches without requiring annotated data.
Contribution
It introduces a novel unsupervised approach leveraging pre-trained BERT embeddings for meeting topic segmentation, outperforming existing methods.
Findings
15.5% reduction in error rate over previous unsupervised methods
Effective on multiple meeting transcript datasets
Demonstrates the benefit of pre-trained neural architectures in unsupervised tasks
Abstract
Topic segmentation of meetings is the task of dividing multi-person meeting transcripts into topic blocks. Supervised approaches to the problem have proven intractable due to the difficulties in collecting and accurately annotating large datasets. In this paper we show how previous unsupervised topic segmentation methods can be improved using pre-trained neural architectures. We introduce an unsupervised approach based on BERT embeddings that achieves a 15.5% reduction in error rate over existing unsupervised approaches applied to two popular datasets for meeting transcripts.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Complex Network Analysis Techniques · Advanced Text Analysis Techniques
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Weight Decay · Adam · Layer Normalization · WordPiece · Dropout · Refunds@Expedia|||How do I get a full refund from Expedia? · Linear Warmup With Linear Decay
