Russian Financial Statements Database: A firm-level collection of the universe of financial statements
Sergey Bondarkov, Victor Ledenev, Dmitriy Skougarevskiy

TL;DR
The RFSD is a comprehensive, harmonized database of Russian firms' financial statements from 2011 to 2023, enabling diverse economic analyses and improving data quality and coverage.
Contribution
It provides the first open, complete collection of Russian firm financial data, including non-filers, with extensive validation and data enhancements.
Findings
Most statements articulate well and correlate with regional GDP.
Data validation confirms high quality and consistency.
The database reveals reporting biases among firms.
Abstract
The Russian Financial Statements Database (RFSD) is an open, harmonized collection of annual unconsolidated financial statements of the universe of Russian firms in 2011-2023. It is the first open data set with information on every active firm in the country, including non-filing firms. With 56.6 million geolocated firm-year observations gathered from two official sources, the RFSD features multiple end-user quality-of-life improvements such as data imputation, statement articulation, harmonization across data providers and formats, and data enrichment. Extensive internal and external validation shows that most statements articulate well while their aggregates display higher correlation with the regional GDP than the previous gridded GDP data products. We also examine the direction and magnitude of the reporting bias by comparing the universe of firms that are required to file with the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSparse Evolutionary Training
