Bi-level RL-Heuristic Optimization for Real-world Winter Road Maintenance
Yue Xie, Zizhen Xu, William Beazley, Fumiya Iida

TL;DR
This paper introduces a scalable bi-level reinforcement learning framework for large-scale winter road maintenance routing, improving efficiency, reducing emissions, and balancing workloads on real UK road networks.
Contribution
It presents a novel AI-driven bi-level optimization approach that effectively manages large real-world road networks for winter maintenance planning.
Findings
Reduced maximum vehicle travel time below two hours
Lowered carbon emissions in routing solutions
Achieved significant cost savings and workload balance
Abstract
Winter road maintenance is critical for ensuring public safety and reducing environmental impacts, yet existing methods struggle to manage large-scale routing problems effectively and mostly reply on human decision. This study presents a novel, scalable bi-level optimization framework, validated on real operational data on UK strategic road networks (M25, M6, A1), including interconnected local road networks in surrounding areas for vehicle traversing, as part of the highway operator's efforts to solve existing planning challenges. At the upper level, a reinforcement learning (RL) agent strategically partitions the road network into manageable clusters and optimally allocates resources from multiple depots. At the lower level, a multi-objective vehicle routing problem (VRP) is solved within each cluster, minimizing the maximum vehicle travel time and total carbon emissions. Unlike…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVehicle Routing Optimization Methods · Forest Biomass Utilization and Management · Infrastructure Resilience and Vulnerability Analysis
