MUG: A General Meeting Understanding and Generation Benchmark
Qinglin Zhang, Chong Deng, Jiaqing Liu, Hai Yu, Qian Chen, Wen Wang,, Zhijie Yan, Jinglin Liu, Yi Ren, Zhou Zhao

TL;DR
This paper introduces MUG, a comprehensive benchmark for meeting understanding and generation, supported by a large annotated Mandarin meeting dataset, to advance NLP applications in meeting scenarios.
Contribution
It establishes the MUG benchmark and releases the AliMeeting4MUG Corpus, enabling large-scale evaluation of multiple SLP tasks in meeting scenarios.
Findings
AliMeeting4MUG is the largest meeting dataset for SLP tasks.
Baseline systems show room for improvement across tasks.
Benchmarking results highlight challenges and future directions.
Abstract
Listening to long video/audio recordings from video conferencing and online courses for acquiring information is extremely inefficient. Even after ASR systems transcribe recordings into long-form spoken language documents, reading ASR transcripts only partly speeds up seeking information. It has been observed that a range of NLP applications, such as keyphrase extraction, topic segmentation, and summarization, significantly improve users' efficiency in grasping important information. The meeting scenario is among the most valuable scenarios for deploying these spoken language processing (SLP) capabilities. However, the lack of large-scale public meeting datasets annotated for these SLP tasks severely hinders their advancement. To prompt SLP advancement, we establish a large-scale general Meeting Understanding and Generation Benchmark (MUG) to benchmark the performance of a wide range of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Text Analysis Techniques · Topic Modeling · Natural Language Processing Techniques
