M$^3$FinMeeting: A Multilingual, Multi-Sector, and Multi-Task Financial Meeting Understanding Evaluation Dataset
Jie Zhu, Junhui Li, Yalong Wen, Xiandong Li, Lifan Guo, Feng Chen

TL;DR
The paper introduces M$^3$FinMeeting, a comprehensive multilingual, multi-sector, multi-task dataset for evaluating large language models' understanding of financial meetings, highlighting current models' limitations.
Contribution
It presents a novel, diverse benchmark dataset for financial meeting understanding across languages, sectors, and tasks, addressing gaps in existing financial NLP evaluation methods.
Findings
LLMs show significant room for improvement on the benchmark.
The dataset covers English, Chinese, and Japanese languages.
It includes summarization, QA pair extraction, and question answering tasks.
Abstract
Recent breakthroughs in large language models (LLMs) have led to the development of new benchmarks for evaluating their performance in the financial domain. However, current financial benchmarks often rely on news articles, earnings reports, or announcements, making it challenging to capture the real-world dynamics of financial meetings. To address this gap, we propose a novel benchmark called \texttt{M^3FinMeeting}, which is a multilingual, multi-sector, and multi-task dataset designed for financial meeting understanding. First, \texttt{M^3FinMeeting} supports English, Chinese, and Japanese, enhancing comprehension of financial discussions in diverse linguistic contexts. Second, it encompasses various industry sectors defined by the Global Industry Classification Standard (GICS), ensuring that the benchmark spans a broad range of financial activities. Finally,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFinTech, Crowdfunding, Digital Finance
