MeetingBank: A Benchmark Dataset for Meeting Summarization
Yebowen Hu, Tim Ganter, Hanieh Deilamsalehy, Franck, Dernoncourt, Hassan Foroosh, Fei Liu

TL;DR
MeetingBank introduces a comprehensive benchmark dataset of city council meetings, enabling improved development of meeting summarization systems through its divide-and-conquer approach and rich annotations.
Contribution
This paper presents MeetingBank, a novel annotated dataset for meeting summarization, addressing the lack of corpora and providing a structured, manageable approach for summarizing lengthy meetings.
Findings
Dataset includes transcripts, summaries, and metadata for city council meetings.
The divide-and-conquer approach facilitates more effective summarization.
Publicly available to support future research in meeting summarization.
Abstract
As the number of recorded meetings increases, it becomes increasingly important to utilize summarization technology to create useful summaries of these recordings. However, there is a crucial lack of annotated meeting corpora for developing this technology, as it can be hard to collect meetings, especially when the topics discussed are confidential. Furthermore, meeting summaries written by experienced writers are scarce, making it hard for abstractive summarizers to produce sensible output without a reliable reference. This lack of annotated corpora has hindered the development of meeting summarization technology. In this paper, we present MeetingBank, a new benchmark dataset of city council meetings over the past decade. MeetingBank is unique among other meeting corpora due to its divide-and-conquer approach, which involves dividing professionally written meeting minutes into shorter…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Service-Oriented Architecture and Web Services
