Learning to Regulate: A New Event-Level Dataset of Capital Control Measures
Geyue Sun, Xiao Liu, Tomas Williams, Roberto Samaniego

TL;DR
This paper introduces a comprehensive event-level dataset of capital control measures from 196 countries (1999-2023), utilizing large language models for extraction and classification, enabling detailed policy analysis and cross-country comparisons.
Contribution
It presents a novel dataset and a framework using LLMs for automated extraction and classification of capital control events, advancing empirical economic research tools.
Findings
Inward capital controls reduce fund inflows within one month.
Restrictive policies have stronger effects than liberalizing ones.
Heterogeneity in policy impacts across countries.
Abstract
We construct a novel event-level Capital Control Measures (CCM) dataset covering 196 countries from 1999 to 2023 by leveraging prompt-based large language models (LLMs). The dataset enables event study analysis and cross-country comparisons based on rich policy attributes, including action type, intensity, direction, implementing entity, and other multidimensional characteristics. Using a two-step prompt framework with GPT-4.1, we extract structured information from the IMF's Annual Report on Exchange Arrangements and Exchange Restrictions (AREAER), resulting in 5,198 capital control events with 27 annotated fields and corresponding model reasoning. Secondly, to facilitate real-time classification and extension to external sources, we fine-tune an open-source Meta Llama 3.1-8B model, named CCM-Llama, trained on AREAER change logs and final status reports. The model achieves 90.09\%…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStock Market Forecasting Methods · Financial Distress and Bankruptcy Prediction · Explainable Artificial Intelligence (XAI)
MethodsLinear Layer · Adam · Dense Connections · Softmax · Position-Wise Feed-Forward Layer · Absolute Position Encodings · Label Smoothing · Multi-Head Attention · Byte Pair Encoding · Dropout
