Approximate policy iteration using neural networks for storage problems

Trivikram Dokka; Richlove Frimpong

arXiv:1910.01895·math.OC·October 7, 2019

Approximate policy iteration using neural networks for storage problems

Trivikram Dokka, Richlove Frimpong

PDF

Open Access

TL;DR

This paper enhances approximate policy iteration for energy storage problems by integrating neural networks, especially effective when storage charging efficiency is low, using a novel aggregate variable model to reduce decision complexity.

Contribution

Introduces a new aggregate variable model enabling neural networks to improve policy evaluation in SNES, demonstrating significant performance gains, notably with low charging efficiency.

Findings

01

Neural networks improve API performance in SNES.

02

Aggregate variables reduce decision vector dimensionality.

03

Performance gains are more pronounced with low charging efficiency.

Abstract

We consider the stochastic single node energy storage problem (SNES) and revisit Approximate Policy Iteration (API) to solve SNES. We show that the performance of API can be boosted by using neural networks as an approximation architecture at the policy evaluation stage. To achieve this, we use a model different to that in literature with aggregate variables reducing the dimensionality of the decision vector, which in turn makes it viable to use neural network predictions in the policy improvement stage. We show that performance improvement by neural networks is even more significant in the case when charging efficiency of storage systems is low.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Reinforcement Learning in Robotics · Advanced Bandit Algorithms Research