Large-scale Charging of Electric Vehicles: A Multi-Armed Bandit Approach
Zhe Yu, Yunjian Xu, Lang Tong

TL;DR
This paper models large-scale EV charging scheduling as a restless multi-armed bandit problem, deriving index policies that outperform existing algorithms, especially with time-varying costs.
Contribution
It introduces a Markov decision process model for EV charging, proves indexability, derives a closed-form Whittle's index, and proposes an improved policy for better performance.
Findings
Whittle's index policy is effective but not always optimal.
Enhanced policy with spatial interchange improves charging efficiency.
Proposed algorithms outperform existing methods, especially with variable costs.
Abstract
The successful launch of electric vehicles (EVs) depends critically on the availability of convenient and economic charging facilities. The problem of scheduling of large-scale charging of EVs by a service provider is considered. A Markov decision process model is introduced in which EVs arrive randomly at a charging facility with random demand and completion deadlines. The service provider faces random charging costs, convex non-completion penalties, and a peak power constraint that limits the maximum number of simultaneous activation of EV chargers. Formulated as a restless multi-armed bandit problem, the EV charging problem is shown to be indexable. A closed-form expression of the Whittle's index is obtained for the case when the charging costs are constant. The Whittle's index policy, however, is not optimal in general. An enhancement of the Whittle's index policy based on spatial…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
