M$^3$FinMeeting: A Multilingual, Multi-Sector, and Multi-Task Financial Meeting Understanding Evaluation Dataset

Jie Zhu; Junhui Li; Yalong Wen; Xiandong Li; Lifan Guo; Feng Chen

arXiv:2506.02510·cs.CL·August 19, 2025

M$^3$FinMeeting: A Multilingual, Multi-Sector, and Multi-Task Financial Meeting Understanding Evaluation Dataset

Jie Zhu, Junhui Li, Yalong Wen, Xiandong Li, Lifan Guo, Feng Chen

PDF

Open Access

TL;DR

The paper introduces M$^3$FinMeeting, a comprehensive multilingual, multi-sector, multi-task dataset for evaluating large language models' understanding of financial meetings, highlighting current models' limitations.

Contribution

It presents a novel, diverse benchmark dataset for financial meeting understanding across languages, sectors, and tasks, addressing gaps in existing financial NLP evaluation methods.

Findings

01

LLMs show significant room for improvement on the benchmark.

02

The dataset covers English, Chinese, and Japanese languages.

03

It includes summarization, QA pair extraction, and question answering tasks.

Abstract

Recent breakthroughs in large language models (LLMs) have led to the development of new benchmarks for evaluating their performance in the financial domain. However, current financial benchmarks often rely on news articles, earnings reports, or announcements, making it challenging to capture the real-world dynamics of financial meetings. To address this gap, we propose a novel benchmark called $\texttt{M$ ^3 $FinMeeting}$ , which is a multilingual, multi-sector, and multi-task dataset designed for financial meeting understanding. First, $\texttt{M$ ^3 $FinMeeting}$ supports English, Chinese, and Japanese, enhancing comprehension of financial discussions in diverse linguistic contexts. Second, it encompasses various industry sectors defined by the Global Industry Classification Standard (GICS), ensuring that the benchmark spans a broad range of financial activities. Finally,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFinTech, Crowdfunding, Digital Finance