A Monte Carlo Language Model Pipeline for Zero-Shot Sociopolitical Event Extraction
Erica Cai, Brendan O'Connor

TL;DR
This paper introduces a multi-stage, instruction-following generative language model pipeline with Monte Carlo sampling for zero-shot sociopolitical event extraction, improving accuracy, interpretability, and efficiency over existing methods.
Contribution
The paper presents a novel fine-grained, multi-stage pipeline with Monte Carlo sampling that enhances zero-shot event extraction control, interpretability, and computational efficiency.
Findings
Outperforms other zero-shot EE methods by at least 17 F1 points.
Reduces query processing to as low as 12% of previous methods.
Demonstrates effective application to international relations analysis.
Abstract
Current social science efforts automatically populate event databases of "who did what to whom?" tuples, by applying event extraction (EE) to text such as news. The event databases are used to analyze sociopolitical dynamics between actor pairs (dyads) in, e.g., international relations. While most EE methods heavily rely on rules or supervised learning, \emph{zero-shot} event extraction could potentially allow researchers to flexibly specify arbitrary event classes for new research questions. Unfortunately, we find that current zero-shot EE methods, as well as a naive zero-shot approach of simple generative language model (LM) prompting, perform poorly for dyadic event extraction; most suffer from word sense ambiguity, modality sensitivity, and computational inefficiency. We address these challenges with a new fine-grained, multi-stage instruction-following generative LM pipeline,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Computational and Text Analysis Methods
