VotIE: Information Extraction from Meeting Minutes
Jos\'e Pedro Evans, Lu\'is Filipe Cunha, Purifica\c{c}\~ao Silvano, Al\'ipio Jorge, Nuno Guimar\~aes, S\'ergio Nunes, Ricardo Campos

TL;DR
VotIE introduces a new task and benchmark for extracting structured voting information from heterogeneous municipal meeting minutes, highlighting the strengths and limitations of fine-tuned encoders versus large language models.
Contribution
The paper establishes the first benchmark for voting information extraction from municipal minutes and compares the effectiveness of different models, including fine-tuned encoders and LLMs.
Findings
Fine-tuned XLM-R-CRF achieves 93.2% macro F1 in in-domain evaluation.
Few-shot LLMs show better robustness in cross-municipality transfer.
Generative models are computationally costly, limiting practical deployment.
Abstract
Municipal meeting minutes record key decisions in local democratic processes. Unlike parliamentary proceedings, which typically adhere to standardized formats, they encode voting outcomes in highly heterogeneous, free-form narrative text that varies widely across municipalities, posing significant challenges for automated extraction. In this paper, we introduce VotIE (Voting Information Extraction), a new information extraction task aimed at identifying structured voting events in narrative deliberative records, and establish the first benchmark for this task using Portuguese municipal minutes, building on the recently introduced CitiLink corpus. Our experiments yield two key findings. First, under standard in-domain evaluation, fine-tuned encoders, specifically XLM-R-CRF, achieve the strongest performance, reaching 93.2\% macro F1, outperforming generative approaches. Second, in a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Text Analysis Techniques · Topic Modeling · Sentiment Analysis and Opinion Mining
