# Ensemble Control of Cycling Energy Loads: Markov Decision Approach

**Authors:** Michael Chertkov, Vladimir Y. Chernyak, Deepjyoti Deka

arXiv: 1701.04941 · 2017-10-24

## TL;DR

This paper develops a Markov decision process framework for controlling ensembles of cycling energy loads, optimizing operational costs and welfare penalties, with potential applications in demand response and energy management.

## Contribution

It introduces a class of linearly solvable MDP models for ensemble control, extending to demand response scenarios with modifications affecting linear solvability.

## Key findings

- Optimal strategies balance cost and welfare penalties.
- Modified MDPs can encourage or constrain state transitions.
- Numerical experiments demonstrate the framework's potential.

## Abstract

A Markov decision process (MDP) framework is adopted to represent ensemble control of devices with cyclic energy consumption patterns, e.g., thermostatically controlled loads. Specifically we utilize and develop the class of MDP models previously coined linearly solvable MDPs, that describe optimal dynamics of the probability distribution of an ensemble of many cycling devices. Two principally different settings are discussed. First, we consider optimal strategy of the ensemble aggregator balancing between minimization of the cost of operations and minimization of the ensemble welfare penalty, where the latter is represented as a KL-divergence between actual and normal probability distributions of the ensemble. Then, second, we shift to the demand response setting modeling the aggregator's task to minimize the welfare penalty under the condition that the aggregated consumption matches the targeted time-varying consumption requested by the system operator. We discuss a modification of both settings aimed at encouraging or constraining the transitions between different states. The dynamic programming feature of the resulting modified MDPs is always preserved; however, `linear solvability' is lost fully or partially, depending on the type of modification. We also conducted some (limited in scope) numerical experimentation using the formulations of the first setting. We conclude by discussing future generalizations and applications.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1701.04941/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/1701.04941/full.md

## References

41 references — full list in the complete paper: https://tomesphere.com/paper/1701.04941/full.md

---
Source: https://tomesphere.com/paper/1701.04941