Bi-level RL-Heuristic Optimization for Real-world Winter Road Maintenance

Yue Xie; Zizhen Xu; William Beazley; Fumiya Iida

arXiv:2602.24097·cs.AI·March 2, 2026

Bi-level RL-Heuristic Optimization for Real-world Winter Road Maintenance

Yue Xie, Zizhen Xu, William Beazley, Fumiya Iida

PDF

Open Access

TL;DR

This paper introduces a scalable bi-level reinforcement learning framework for large-scale winter road maintenance routing, improving efficiency, reducing emissions, and balancing workloads on real UK road networks.

Contribution

It presents a novel AI-driven bi-level optimization approach that effectively manages large real-world road networks for winter maintenance planning.

Findings

01

Reduced maximum vehicle travel time below two hours

02

Lowered carbon emissions in routing solutions

03

Achieved significant cost savings and workload balance

Abstract

Winter road maintenance is critical for ensuring public safety and reducing environmental impacts, yet existing methods struggle to manage large-scale routing problems effectively and mostly reply on human decision. This study presents a novel, scalable bi-level optimization framework, validated on real operational data on UK strategic road networks (M25, M6, A1), including interconnected local road networks in surrounding areas for vehicle traversing, as part of the highway operator's efforts to solve existing planning challenges. At the upper level, a reinforcement learning (RL) agent strategically partitions the road network into manageable clusters and optimally allocates resources from multiple depots. At the lower level, a multi-objective vehicle routing problem (VRP) is solved within each cluster, minimizing the maximum vehicle travel time and total carbon emissions. Unlike…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVehicle Routing Optimization Methods · Forest Biomass Utilization and Management · Infrastructure Resilience and Vulnerability Analysis