A Deep Reinforcement Learning Framework for Rebalancing Dockless Bike Sharing Systems
Ling Pan, Qingpeng Cai, Zhixuan Fang, Pingzhong Tang and, Longbo Huang

TL;DR
This paper introduces a deep reinforcement learning framework, HRP, for efficiently rebalancing dockless bike sharing systems by considering spatial and temporal factors, improving service quality and revenue.
Contribution
It presents a novel HRP algorithm that captures spatial-temporal dependencies for bike rebalancing, outperforming existing methods and generalizing well to new areas.
Findings
HRP achieves near 24-timeslot look-ahead optimization performance.
HRP outperforms state-of-the-art methods in service level and bike distribution.
HRP generalizes effectively to unseen areas.
Abstract
Bike sharing provides an environment-friendly way for traveling and is booming all over the world. Yet, due to the high similarity of user travel patterns, the bike imbalance problem constantly occurs, especially for dockless bike sharing systems, causing significant impact on service quality and company revenue. Thus, it has become a critical task for bike sharing systems to resolve such imbalance efficiently. In this paper, we propose a novel deep reinforcement learning framework for incentivizing users to rebalance such systems. We model the problem as a Markov decision process and take both spatial and temporal features into consideration. We develop a novel deep reinforcement learning algorithm called Hierarchical Reinforcement Pricing (HRP), which builds upon the Deep Deterministic Policy Gradient algorithm. Different from existing methods that often ignore spatial information and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsUrban Transport and Accessibility · Smart Parking Systems Research · Human Mobility and Location-Based Analysis
