Exploiting homogeneity for the optimal control of discrete-time systems: application to value iteration
Mathieu Granzotto, Romain Postoyan, Lucian Bu\c{s}oniu, Dragan, Ne\v{s}i\'c, Jamal Daafouz

TL;DR
This paper leverages homogeneity properties of discrete-time systems to develop a scalable, efficient value iteration method that provides bounds on the optimal value function, reducing computational complexity.
Contribution
It introduces a novel approach exploiting system homogeneity to construct (near-)optimal controls across the state space with fewer computations.
Findings
The proposed method guarantees lower and upper bounds on the value function.
It reduces the computational effort of value iteration.
Numerical results demonstrate the effectiveness of the new algorithm.
Abstract
To investigate solutions of (near-)optimal control problems, we extend and exploit a notion of homogeneity recently proposed in the literature for discrete-time systems. Assuming the plant dynamics is homogeneous, we first derive a scaling property of its solutions along rays provided the sequence of inputs is suitably modified. We then consider homogeneous cost functions and reveal how the optimal value function scales along rays. This result can be used to construct (near-)optimal inputs on the whole state space by only solving the original problem on a given compact manifold of a smaller dimension. Compared to the related works of the literature, we impose no conditions on the homogeneity degrees. We demonstrate the strength of this new result by presenting a new approximate scheme for value iteration, which is one of the pillars of dynamic programming. The new algorithm provides…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
