Integrating Meteorological and Operational Data: A Novel Approach to Understanding Railway Delays in Finland
Vinicius Pozzobon Borin, Jean Michel de Souza Sant'Ana, Usama Raheel, Nurul Huda Mahmood

TL;DR
This paper introduces a comprehensive dataset combining Finnish railway operational data with meteorological observations, enabling advanced analysis of weather impacts on train delays and supporting machine learning applications in railway reliability studies.
Contribution
The study presents the first publicly available dataset integrating Finnish railway operations with synchronized meteorological data, along with preprocessing methods and initial analysis demonstrating its utility.
Findings
Winter delays exceed 25% during peak months
High-delay corridors are geographically clustered in Finland
Baseline machine learning model predicts delays with MAE of 2.73 minutes
Abstract
Train delays result from complex interactions between operational, technical, and environmental factors. While weather impacts railway reliability, particularly in Nordic regions, existing datasets rarely integrate meteorological information with operational train data. This study presents the first publicly available dataset combining Finnish railway operations with synchronized meteorological observations from 2018-2024. The dataset integrates operational metrics from Finland Digitraffic Railway Traffic Service with weather measurements from 209 environmental monitoring stations, using spatial-temporal alignment via Haversine distance. It encompasses 28 engineered features across operational variables and meteorological measurements, covering approximately 38.5 million observations from Finland's 5,915-kilometer rail network. Preprocessing includes strategic missing data handling…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRailway Systems and Energy Efficiency · Railway Engineering and Dynamics · Transport and Economic Policies
