METBRA25Y: Brazil Surface Meteorology Archive with Harmonized Variables and Quality Control
Matheus Lima Castro, William Dantas Vichete, Leopoldo Lusquino Filho

TL;DR
METBRA25Y is a comprehensive, quality-controlled, harmonized archive of hourly meteorological data from Brazil, supporting diverse environmental and climate research with standardized variables and detailed metadata.
Contribution
It introduces a reproducible workflow for harmonizing and quality-controlling Brazil's surface meteorological data, enabling consistent station-level time series analysis.
Findings
Includes 616 stations with 2000-2025 data coverage.
Implements a two-stage quality control process for data reliability.
Provides detailed metadata, quality flags, and validation summaries.
Abstract
This data paper describes METBRA25Y, a harmonized archive of hourly surface meteorological observations from Brazil derived from public historical records of the Instituto Nacional de Meteorologia (INMET). The dataset was designed to support reproducible environmental, climatological, hydrological, agricultural, urban-risk, and machine-learning studies that require station-level meteorological time series with standardized variable names and explicit quality-control metadata. The processing workflow ingests annual INMET archives, parses station metadata from raw file headers, normalizes heterogeneous Portuguese column names into a canonical schema, constructs hourly timestamps, consolidates observations by city and station, and exports compressed CSV files together with station manifests, per-station quality flags, daily precipitation aggregates, variable-level failure summaries, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
